In this example, only tree metrics are selected in the basal area prediction model. The model seems to fail to predict large values. The prediction errors are positively correlated with basal area because large values are under-estimated.
```{r modelPlot, include=TRUE}
```{r modelPlot, include=TRUE, fig.height = 4.5, fig.width = 8}
# check correlation between errors and other variables
round(cor(cbind(model.ABA$values$residual, plots[subsample, c("G.m2.ha","N.ha","")], terrain.stats[subsample, 1:3])), 2)[1,]
# significance of correlation value
In case only point cloud metrics are used as potential inputs, the errors are hardly better distributed. Coloring points by ownership shows that plots located in private forests have the largest basal area values which tend to be under-estimated.
```{r point.metricsOnly, include=TRUE}
```{r point.metricsOnly, include=TRUE, fig.height = 4.5, fig.width = 8}
model.ABA.point.metrics <- lidaRtRee::ABAmodel(plots[subsample,variable], point.metrics[subsample,], transform="boxcox", nmax=4, xy = plots[subsample, c("X", "Y")])
# renames outputs
row.names(model.ABA.point.metrics$stats) <- names(model.ABA.point.metrics$model) <- variable
