Integrative Biology Journals

JOURNAL OF FORESTRY RESEARCH ›› 2026, Vol. 37 ›› Issue (1): 1-.DOI: 10.1007/s11676-026-02013-2

• Original Paper •    

Accuracy and biological plausibility of a machine learning‑based transition matrix growth model for long‑term mixed forest projections under climate change

Xue Du1, Xiangdong Lei1, Xiao He1, Zeyu Zhou2, Hong Guo1, Yangping Qin1,3   

  1. 1State Key Laboratory of Efficient Production of Forest Resources, Institute of Forest Resource Information Techniques, Chinese Academy of Forestry, Beijing 100091, People’s Republic of China 

    2Experimental Center of Forestry in North China, Chinese Academy of Forestry, Beijing 102300, People’s Republic of China 

    3Southwest Survey and Planning Institute, National Forestry and Grassland Administration, Kunming 650031, People’s Republic of China

  • Received:2025-07-26 Accepted:2025-10-04 Online:2026-02-28 Published:2026-01-01
  • Supported by:
    This work was funded by the National Key R&D Program of China (2022YFD2200500) and the Forestry Public Welfare Scientific Research Project (201504303).

Abstract: Modeling mixed forest growth is challenging due to multiple tree species, uneven-aged structures, and complex environmental interactions. Traditional transition matrix models, widely used for predicting ingrowth, upgrowth, and mortality in mixed forests, rely on statistical approaches but have limited capacity to handle high-dimensional data. Machine learning has the potential, but there is limited application and a lack of comparison, especially in terms of model accuracy and biological plausibility. This study developed a climate-sensitive transition matrix growth model with the integration of machine learning (ML-Matrix) for mixed conifer-broad-leaved forests in northeastern China. Forest data were obtained from permanent plots in the 5th–9th National Forest Inventories. We evaluated five machine learning algorithms—random forest (RF), boosted regression trees (BRT), artificial neural network (ANN), support vector machine (SVM), and k‑nearest neighbor (KNN)—to model upgrowth, two-stage mortality, and recruitment. Results showed BRT excelled in modeling upgrowth of Pinus koraiensis and the first-stage mortality, while ANN excelled for upgrowth of other tree species. For the second-stage mortality, ANN performed best for two broad-leaved groups, while SVM outperformed for three coniferous groups. Optimal recruitment algorithms (SVM, BRT, or ANN) varied by species. Stand variables were the most important predictors with the relative important values of 85.1%–99.8% for upgrowth, 49.0%–88.4% for mortality, and 46.4%–84.3% for recruitment. Compared to the traditional statistical transition matrix (TS-Matrix), ML-Matrix achieved higher test-set accuracy for four tree species groups, excluding Pinus koraiensis. However, long-term simulations revealed TS-Matrix produced more biologically reasonable prediction of stand density, basal area and volume. This study highlighted the potential of ML-Matrix and the need for caution with its long-term projections for mixed forests under climate change.

Key words: Forest dynamics, Machine learning, Upgrowth, Ingrowth, Mortality, Climate change