Artificial neural networks as an alternative method to nonlinear mixed-effects models for tree height predictions

doi:10.1016/j.foreco.2022.120017

Forest Ecology and Management

Volume 507, 1 March 2022, 120017

https://doi.org/10.1016/j.foreco.2022.120017 Get rights and content

Highlights

•
At the plot level, mixed-effects models provided the most accurate tree height predictions.
•
By grouping similar plots the ANN predictions improved.
•
The ANNs are more competitive if enough tree height measurements are available.
•
The BAL tree competition variable increased the accuracy of ANN models.

Abstract

Tree heights are one of the most important aspects of forest mensuration, but data are often unavailable due to costly and time-consuming field measurements. Therefore, various types of models have been developed for the imputation of tree heights for unmeasured trees, with mixed-effects models being one of the most commonly applied approaches. The disadvantage here is the need of sufficient sample size per tree species for each plot, which is often not met, especially in mixed forests. To avoid this limitation, we used principal component analysis (PCA) for the grouping of similar plots based on the most relevant site descriptors. Next, we compared mixed-effects models with height-diameter models based on artificial neural networks (ANN). In terms of root mean square error (RMSE), mixed-effects models provided the most accurate tree height predictions at the plot level, especially for tree species with a smaller number of tree height measurements. When plots were grouped using the PCA and the number of observations per category increased, ANN predictions improved and became more accurate than those provided by mixed-effects models. The performance of ANN also increased when the competition index was included as an additional explanatory variable. Our results show that in the pursuit of the most accurate modelling approach for tree height predictions, ANN should be seriously considered, especially when the number of tree measurements and their distribution is sufficient.

Graphical abstract

Introduction

Forest inventories are one of the primary sources for national and international reporting schemes (e.g. FAO, 2020), with wood volume, forest biomass and carbon stock among the most important attributes to be reported (Vidal et al., 2016). Methods for wood volume estimates are usually country-specific relying on diverse volume model types, the most common being the use of different volume functions, taper-curves, and breast-height form factor functions (Gschwantner et al., 2019). Directly and indirectly all of them require diameter at breast height (DBH, or D) and tree height (H) estimates. In addition to volume and biomass estimates, tree heights are of key importance for assessment of forest components such as productivity, site indexes and forest development in general. Consequently almost all growth and yield models require information on tree height to predict forest dynamics (Barreiro and Tomé, 2017), where tree heights could be needed at the tree (e.g. Pretzsch et al., 2002, Buchacher and Ledermann, 2020), plot or stand level (e.g. Härkönen et al., 2019). Therefore, highly accurate estimates of tree heights are crucially important in several forestry subdisciplines, as well in related ecological and environmental disciplines.

Field measurements of tree heights are time-consuming and therefore often measured only for a subsample of trees, with unmeasured heights predicted using height-diameter (H-D) models (Soares and Tomé, 2002, Mehtätalo et al., 2015). There are two types of H-D model; the first requires only DBH to predict tree height (H-D function), while the second incorporates stand-level predictors in addition to DBH. The former are called simple and the latter generalised models (Mehtätalo et al., 2015). In the literature two- and three- parameter H-D functions exists (Kindermann, 2016). Simple models are particularly useful in even aged stands with a small number of species in homogenous stand and site conditions. However, the tendency in European forests and beyond is to promote uneven aged and mixed species stand structures (Bravo-Oviedo et al., 2014, Pach et al., 2018), which require more complex modelling approaches that rely on DBH combined with additional stand and tree characteristics (Temesgen and Von Gadow, 2004).

In more complex forest communities, tree heights are often predicted on the basis of information at the plot level, with the plot index entering the model as a random effect of a mixed-effects model (Zuur et al., 2009, Bronisz and Mehtätalo, 2020). Therefore, we assume the same species-specific H-D curve within each plot. Here the fixed part of the model describes the predicted H-D curve for a typical plot in the used database (fixed-effect prediction) and the random effect provides a calibrated prediction (random-effect prediction), which together describe the plot specific H-D relationship. Consequently, the use of mixed-effects models also enable prediction on new plots (Mehtätalo et al., 2015).

The usefulness of including site or plot effects in H-D relationships has been reported for many species, e.g. Pinus Sylvestris (Lappi, 1991), Quercus pagoda (Lynch et al., 2005), Pinus taeda (Trincado et al., 2007), Pseudotsuga menziesii (Temesgen et al., 2008), and Betula pendula (Bronisz and Mehtätalo, 2020). VanderSchaaf (2014) fitted mixed-effects models for ten different conifer species in the Northwest USA, while Mehtätalo et al. (2015) tested modelled H-D relationships using a dataset representing a wide range of tree species.

The advantage of such approaches is the consideration of local site conditions that importantly affect H-D relationships, while the problems with convergence of model could arise if the number of height measurements per plot is low (Harrison et al., 2018), which is often the case in uneven aged mixed forests, with greater diversity of tree species and diameter distributions.

In recent years, machine learning (ML) has seen increased application in various sciences, including forestry. Besides the decision-tree learning and support vector machine, one commonly used method is the artificial neural network (ANN) (Liu et al., 2018). ANNs have the ability to acquire and maintain information based knowledge and can be defined as a set of processing units, represented by artificial neurons, interlinked by a multitude of interconnections (artificial synapses), implemented by vectors and matrices of synaptic weights (da Silva et al., 2017). The ANN model can be applied to various kinds of problems, from classification, clustering and optimisation to function approximation, and has already been applied in various forestry disciplines, such as forest fire prediction (Safi and Bouroumi, 2013), prediction of insect outbreaks (Park and Chung, 2006), and species distribution models (Scrinzi et al., 2007). Apart from these, the ANN has also been tested in tree height modelling for eucalyptus trees (Vieira et al., 2018), common beech (Fagus sylvatica) from northwestern Spain (Castaño-Santamaría et al., 2013) and Crimean juniper (Juniperus excels) (Özçelik et al., 2013).

The primary aim of our study was to explore the application of mixed-effects H-D models for height predictions within predominantly uneven aged mixed forests using forest inventory plot data. Slovenia has a long tradition of close to nature forest management with strong emphasis on natural regeneration, and consequently, a high proportion of uneven aged and mixed stands (Diaci, 2006). Secondly, focused on less representative tree species, we explore the application of grouping plots based on site factors derived from principal component analysis (PCA) (Jolliffe and Cadima, 2016), which are later used as (nested) random effects and categorical independent variables. Finally, we test and compare a new methodological approach based on artificial neural networks (ANN) for tree height predictions for variety of different tree species specific to central Europe. Inclusion of additional explanatory variables, namely competition, was tested for the ANN. Competition is often among the most effective in explaining forest stand dynamics (Jevšenak and Skudnik, 2021, Vospernik, 2021) and studies have reported a significant effect of competition on the modelling of height (Temesgen and Von Gadow, 2004) and height growth (Sharma and Brunner, 2017).

Section snippets

Data

To compare different modelling approaches, we used 5450 tree height measurements from 685 plots from the Slovenian national forest inventory (Skudnik et al., 2021). The data used in this study is from the fourth cycle of the Slovenian nation-wide survey, which was carried out in 2018. The sample trees were measured on permanent concentric sampling plots on a 4 × 4 km grid arranged systematically across the country (Kušar et al., 2010, Skudnik and Hladnik, 2018). At each plot located within the

Comparison of mixed-effects models and ANN at the plot level

The most representative tree species was common beech (Fagus sylvatica), followed by Norway spruce (Picea abies), silver fir (Abies alba) and sessile oak (Quercus petraea) (Table 2). Tree height measurements per species are described in more detail in Supplementary Table 3.

At the plot level comparison (Table 3A), mixed-effects models showed generally more accurate predictions than the ANN models. Out of 16 tree species, ANN1 was more accurate than the mixed-effects models for 7 species.

Use of artificial neural networks for tree height predictions

With our study, we directly compared the ANN approach against the current golden standard, i.e. mixed-effects models (for example Mehtätalo et al., 2015). While there are numerous types of ANN, we decided to use the Bayesian regularised ANN, which is robust to overfitting and often results in an S-shaped curve, similar to growth functions. Nevertheless, users must optimise the complexity of the neural net, which is defined by the selected number of hidden layers and associated neurons (Gardner

Conclusions

Reliable models for tree heights are needed for the estimation of growing stock and biomass, understanding forest dynamics, and assessing site quality. In more complex forest communities, mixed-effects models are the current golden standard for tree height predictions, in which plot-level effects are included as random effects. In this study we present that also ANN can be reliably used to predict tree heights. For Slovenian NFI data using only plot IDs, the mixed-effects approach showed the

Funding

Funding for this study was provided by the Slovene Research Agency: Program and Research Group “Forest biology, ecology and technology” (P4-0107) and Target research project “Development of models for forest management in Slovenia” (V4-2014B). The collection of data used in this study (Slovenian NFI Data) was financed by the Ministry of Agriculture, Forestry and Food in the scope of the “Public Forest Service” programme. Jernej is grateful for the support by the World Federation of Scientists,

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References (70)

S. Barreiro et al.
Projection Systems in Europe and North America: Concepts and Approaches
C. Bergmeir et al.
On the use of cross-validation for time series predictor evaluation
Inf. Sci.
(2012)
Bravo-Oviedo, A., Pretzsch, H., Ammer, C., Andenmatten, E., Barbati, A., Barreiro, S., Brang, P., Bravo, F., Coll, L.,...
K. Bronisz et al.
Mixed-effects generalized height–diameter model for young silver birch stands on post-agricultural lands
For. Ecol. Manage.
(2020)
R. Buchacher et al.
Interregional Crown Width Models for Individual Trees Growing in Pure and Mixed Stands in Austria
Forests
(2020)
F. Burden et al.
Bayesian regularization of neural networks
J. Castaño-Santamaría et al.
Tree height prediction approaches for uneven-aged beech forests in northwestern Spain
For. Ecol. Manage.
(2013)
A.L. Cohen et al.
Model evaluation using grouped or individual data
Psychon. Bull. Rev.
(2008)
R.O. Curtis
Height-Diameter and Height-Diameter-Age Equations For Second-Growth Douglas-Fir
For. Sci.
(1967)

J. Diaci

Nature-based silviculture in Slovenia: origins, development and future trends

Z. Fang et al.

A multivariate simultaneous prediction system for stand growth and yield with fixed and random effects

For. Sci.

(2001)

FAO, 2020. Global Forest Resources Assessment 2020: Main report. Rome, pp....

F.D. Foresee et al.

L.A. García-Escudero et al.

A review of robust clustering methods

Adv. Data Anal. Classif.

(2010)

M.W. Gardner et al.

Artificial neural networks (the multilayer perceptron)-a review of applications in the atmospheric sciences

Atmos. Environ.

(1998)

C. Gollob et al.

A Flexible Height-Diameter Model for Tree Height Imputation on Forest Inventory Sample Plots Using Repeated Measures from the Past

Forests

(2018)

T. Gschwantner et al.

Harmonisation of stem volume estimates in European National Forest Inventories

Ann. Forest Sci.

(2019)

Hamamoto, Y.S.U., Kanaoka, T., Tomita, S., 1993. Evaluation of artificial neural network classifiers in small sample...

S. Härkönen et al.

A climate-sensitive forest model for assessing impacts of forest management in Europe

Environ. Model. Software

(2019)

H. Harmens et al.

Nitrogen concentrations in mosses indicate the spatial distribution of atmospheric nitrogen deposition in Europe

Environ. Pollut.

(2011)

H. Harmens et al.

Heavy metal and nitrogen concentrations in mosses are declining across Europe whilst some “hotspots” remain in 2010

Environ. Pollut.

(2015)

X.A. Harrison et al.

A brief introduction to mixed effects modelling and multi-model inference in ecology

PeerJ

(2018)

J. Jevšenak et al.

A random forest model for basal area increment predictions from national forest inventory data

For. Ecol. Manage.

(2021)

I.T. Jolliffe et al.

Principal component analysis: a review and recent developments

Philos. Trans. A Math. Phys. Eng. Sci.

(2016)

T. Karjalainen et al.

Field calibration of merchantable and sawlog volumes in forest inventories based on airborne laser scanning

Can. J. For. Res.

(2020)

G. Kindermann

Evaluation of growth functions for tree height modelling

Austrian J. For. Sci.

(2016)

F. Korner-Nievergelt et al.

Chapter 7 - Linear Mixed Effects Models

M. Kovač et al.

I. Gozdna inventura

G. Kušar et al.

Methodological bases of the forest and forest ecological condition survey

J. Lappi

Mixed linear models for analyzing and predicting stem form variation of scots pine

(1986)

J. Lappi

Calibration of Height and Volume Equations with Random Parameters

For. Sci.

(1991)

J. Lappi

A longitudinal analysis of height/diameter curves

For. Sci.

(1997)

Z. Liu et al.

Application of machine-learning methods in forest ecology: recent progress and future challenges

Environ. Rev.

(2018)

T.B. Lynch et al.

A Random-Parameter Height-Dbh Model for Cherrybark Oak

South. J. Appl. For.

(2005)

Cited by (15)

Uncertainty quantification for structural response field with ultra-high dimensions
2024, International Journal of Mechanical Sciences
The structural response field is crucial for understanding mechanical behavior, especially under uncertain conditions. However, current uncertainty quantification predominantly address one-dimensional output under multi-dimensional input. For the structural full field response with ultra-high dimension, quantifying corresponding uncertainties becomes a formidable task. This paper introduces an efficient uncertainty quantification method for structural response field. Initially, the manifold learning is introduced to transform uncertainty quantification of ultra-high-dimensional responses into the quantification of low-dimensional manifold projections. Subsequently, polynomial chaos expansions facilitate uncertainty propagation, allowing for the effective evaluation of statistical moments for each projection. Leveraging the established manifold structure enables obtaining analytical solutions for statistical moments of full field responses without additional computational costs. In the absence of prior information, the derivative λ-PDF is utilized to model uncertainty of response at any position, so as to realize the visualization of uncertainty. The proposed uncertainty quantification for structural response field provides a non-intrusive analysis framework, which can conveniently deal with structural multi-field coupling problems. Finally, three engineering examples involving multi field coupling are presented to demonstrate the effectiveness and practicability of proposed uncertainty quantification of structural response field.
Shifting potential for high-resolution climate reconstructions under global warming
2024, Quaternary Science Reviews
Reconstructions of climate in pre-instrumental times are a cornerstone of earth-system science that relies critically on statistical relationships between meteorological observations and natural proxy archives. Recent studies have frequently reported that these relationships are not stable in time (non-stationarity), which might be due to environmental change (climate, atmospheric CO₂), data resolution and quality, and statistical methods applied. Here, we assess the elusive impacts of these factors on the palaeoclimatological potential across the Northern Hemisphere. Scrutinizing spatiotemporal patterns in widely applied calibration metrics derived from 3781 tree-ring chronologies and 517 published dendroclimatic studies, we show that temperature and precipitation sensitivity have increased towards present. This increase was most pronounced in moisture-limited areas and accentuated when using daily rather than monthly instrumental data. An assessment of climate scenarios for 2021–2040 indicated further expansion of areas with strong water limitation (+5 ± 2%), whereas the areas with strong temperature limitation are projected to shrink by 8 ± 3% (tree-ring width proxy) and 3 ± 2% (maximum latewood density proxy). These findings indicate that further refinement of statistical methods will likely no longer compensate for trees’ decreasing temperature sensitivity. Further, a scenario of increased CO₂ fertilization may mitigate water limitation on tree growth and weaken precipitation reconstructions. Finally, we discuss the potential drivers of non-stationarity and the consequences for high-resolution paleoclimatology.
Plant functional traits and tree size inequality improved individual tree height prediction of mid-montane humid evergreen broad-leaved forests in southwest China
2024, Forest Ecology and Management
Tree height and diameter at breast height (DBH) are key survey factors in forest inventory. Compared to measuring DBH, the process of measuring tree height is time-consuming, labor-intensive and costly, especially in natural mixed forests. Tree height is therefore commonly predicted using height-diameter (H-D) models. Nevertheless, developing H-D models for natural mixed forests are hampered by the large diversity of tree species and the small number of rare tree species observed. In this study, a nonlinear mixed effects H-D model was established for a mid-montane humid evergreen broad-leaved forest in southwest China. For this purpose, we classified tree species by K-means clustering according to the specific plant functional traits. Data were collected from 5, 737 trees in 100 subplots with 86 species. The results revealed that the nonlinear mixed effects model of tree height accounted for 75% of the variation in tree height without significant heteroscedasticity, and that the model parameters had biological significance. In the 10-fold cross validation, no clear fluctuation in the model evaluation metrics was observed, indicating the absence of overfitting in the model. Of the 15 stand variables, dominant height (H_d) and Margalef’s index of DBH (D_Mg), which contributed the most to the height-diameter model, were selected to expand the model. Our findings suggest that tree height increased with increasing values of H_d and D_Mg, and vice versa. This study highlights the feasibility of categorizing natural mixed forests by utilizing plant functional traits and the influential role of tree size inequality on tree height variation.
Machine Learning Forest Simulator (MLFS): R package for data-driven assessment of the future state of forests
2023, Ecological Informatics
The Machine Learning Forest Simulator (MLFS) is the first completely data-driven model of forest dynamics organised as an R package. It is a single-tree, easy to use and freely available tool that is applicable to all forest ecosystems, from even-aged monocultures to mixed forests with diverse vertical structures. This article presents the newly developed model system and gives detailed instructions on how to use it, from data preparation to description of key arguments and algorithms and to evaluation and interpretation of simulation outputs. The sample simulations are based on the most recent national forest inventory data from Slovenia and include different mortality, harvesting and climate scenarios. We show a wide range of potential applications and input settings, which result in different growing stock, yield and forest structures. The MLFS showed reasonable accuracy and can therefore become a standard tool for decision-making in forest ecosystem management.
Prediction of tree crown width in natural mixed forests using deep learning algorithm
2023, Forest Ecosystems
Crown width (CW) is one of the most important tree metrics, but obtaining CW data is laborious and time-consuming, particularly in natural forests. The Deep Learning (DL) algorithm has been proposed as an alternative to traditional regression, but its performance in predicting CW in natural mixed forests is unclear. The aims of this study were to develop DL models for predicting tree CW of natural spruce-fir-broadleaf mixed forests in north-eastern China, to analyse the contribution of tree size, tree species, site quality, stand structure, and competition to tree CW prediction, and to compare DL models with nonlinear mixed effects (NLME) models for their reliability. An amount of total 10,086 individual trees in 192 subplots were employed in this study. The results indicated that all deep neural network (DNN) models were free of overfitting and statistically stable within 10-fold cross-validation, and the best DNN model could explain 69% of the CW variation with no significant heteroskedasticity. In addition to diameter at breast height, stand structure, tree species, and competition showed significant effects on CW. The NLME model (R² = 0.63) outperformed the DNN model (R² = 0.54) in predicting CW when the six input variables were consistent, but the results were the opposite when the DNN model (R² = 0.69) included all 22 input variables. These results demonstrated the great potential of DL in tree CW prediction.
Individual tree detection and estimation of stem attributes with mobile laser scanning along boreal forest roads
2022, ISPRS Journal of Photogrammetry and Remote Sensing
Citation Excerpt :
For instance, Kolendo et al. (2021) used a large-scale reference dataset to parameterize ITD algorithms in coniferous forests, reaching tree count RMSEs varying from approximately 6 to 13%, depending on the forest type. Skudnik and Jevšenak (2022) found that, in the presence of sufficient reference data for calibration, artificial neural network-derived tree height predictions can outperform predictions derived from mixed effect models. Generally, deep learning methods require large datasets for calibration to be used at their full potential (Hamraz et al., 2019; Xi et al., 2020).
The collection of field-reference data is a key task in remote sensing-based forest inventories. However, traditional methods of collection demand extensive personnel resources. Thus, field-reference data collection would benefit from more automated methods. In this study, we proposed a method for individual tree detection (ITD) and stem attribute estimation based on a car-mounted mobile laser scanner (MLS) operating along forest roads. We assessed its performance in six ranges with increasing mean distance from the roadside. We used a Riegl VUX-1LR sensor operating with high repetition rate, thus providing detailed cross sections of the stems. The algorithm we propose was designed for this sensor configuration, identifying the cross sections (or arcs) in the point cloud and aggregating those into single trees. Furthermore, we estimated diameter at breast height (DBH), stem profiles, and stem volume for each detected tree. The accuracy of ITD, DBH, and stem volume estimates varied with the trees’ distance from the road. In general, the proximity to the sensor of branches 0–10 m from the road caused commission errors in ITD and over estimation of stem attributes in this zone. At 50–60 m from roadside, stems were often occluded by branches, causing omissions and underestimation of stem attributes in this area. ITD’s precision and sensitivity varied from 82.8% to 100% and 62.7% to 96.7%, respectively. The RMSE of DBH estimates ranged from 1.81 cm (6.38%) to 4.84 cm (16.9%). Stem volume estimates had RMSEs ranging from 0.0800 m³ (10.1%) to 0.190 m³ (25.7%), depending on the distance to the sensor. The average proportion of detected reference volume was highly affected by the performance of ITD in the different zones. This proportion was highest from 0 to 10 m (113%), a zone that concentrated most ITD commission errors, and lowest from 50 to 60 m (66.6%), mostly due to the omission errors in this area. In the other zones, the RMSE ranged from 87.5% to 98.5%. These accuracies are in line with those obtained by other state-of-the-art MLS and terrestrial laser scanner (TLS) methods. The car-mounted MLS system used has the potential to collect data efficiently in large-scale inventories, being able to scan approximately 80 ha of forests per day depending on the survey setup. This data collection method could be used to increase the amount of field-reference data available in remote sensing-based forest inventories, improve models for area-based estimations, and support precision forestry development.

View all citing articles on Scopus

View full text

Artificial neural networks as an alternative method to nonlinear mixed-effects models for tree height predictions

Highlights

Abstract

Graphical abstract

Introduction

Section snippets

Data

Comparison of mixed-effects models and ANN at the plot level

Use of artificial neural networks for tree height predictions

Conclusions

Funding

Declaration of Competing Interest

Projection Systems in Europe and North America: Concepts and Approaches

On the use of cross-validation for time series predictor evaluation

Inf. Sci.

Mixed-effects generalized height–diameter model for young silver birch stands on post-agricultural lands

For. Ecol. Manage.

Interregional Crown Width Models for Individual Trees Growing in Pure and Mixed Stands in Austria

Forests

Bayesian regularization of neural networks

Tree height prediction approaches for uneven-aged beech forests in northwestern Spain

For. Ecol. Manage.

Model evaluation using grouped or individual data

Psychon. Bull. Rev.

Height-Diameter and Height-Diameter-Age Equations For Second-Growth Douglas-Fir

For. Sci.

Nature-based silviculture in Slovenia: origins, development and future trends

A multivariate simultaneous prediction system for stand growth and yield with fixed and random effects

For. Sci.

A review of robust clustering methods

Adv. Data Anal. Classif.

Artificial neural networks (the multilayer perceptron)-a review of applications in the atmospheric sciences

Atmos. Environ.

A Flexible Height-Diameter Model for Tree Height Imputation on Forest Inventory Sample Plots Using Repeated Measures from the Past

Forests

Harmonisation of stem volume estimates in European National Forest Inventories

Ann. Forest Sci.

A climate-sensitive forest model for assessing impacts of forest management in Europe

Environ. Model. Software

Nitrogen concentrations in mosses indicate the spatial distribution of atmospheric nitrogen deposition in Europe

Environ. Pollut.

Heavy metal and nitrogen concentrations in mosses are declining across Europe whilst some “hotspots” remain in 2010

Environ. Pollut.

A brief introduction to mixed effects modelling and multi-model inference in ecology

PeerJ

A random forest model for basal area increment predictions from national forest inventory data

For. Ecol. Manage.

Principal component analysis: a review and recent developments

Philos. Trans. A Math. Phys. Eng. Sci.

Field calibration of merchantable and sawlog volumes in forest inventories based on airborne laser scanning

Can. J. For. Res.

Evaluation of growth functions for tree height modelling

Austrian J. For. Sci.

Chapter 7 - Linear Mixed Effects Models

I. Gozdna inventura

Methodological bases of the forest and forest ecological condition survey

Mixed linear models for analyzing and predicting stem form variation of scots pine

Calibration of Height and Volume Equations with Random Parameters

For. Sci.

A longitudinal analysis of height/diameter curves

For. Sci.

Application of machine-learning methods in forest ecology: recent progress and future challenges

Environ. Rev.

A Random-Parameter Height-Dbh Model for Cherrybark Oak

South. J. Appl. For.