Debber validation

Debber provides probability distributions of DEB parameters. Each of its estimates (e.g., a median) is accompanied by uncertainty metrics such as percentiles. The validity of Debber results is therefore not determined by the accuracy of the single estimate, but by whether the true parameter values, if measured, are distributed as Debber suggests: do the true DEB parameters indeed have a 50% probability of falling within Debber's predicted 50% confidence interval, do they have a 95% probability of falling within Debber's predicted 95% confidence interval, etc. To verify this, we perform leave-one-out cross-validation: for each of the 2006 species in the Add-my-Pet database (version 20200514), we do the following:

We temporarily drop the species from the AmP database.
We re-calibrate the evolutionary model based on all remaining AmP entries. Effectively, we rerun the entire PhyloPars analysis to estimate phylogenetic and phenotypic covariances, ensuring that the inference model is not influenced by the values that are to be estimated.
With the new evolutionary model, we estimate the parameters of the omitted species using the usual Debber method.
For each estimated parameter, we quantify the cross-validation error by its z-score: the difference between the estimated mean and the true AmP value, divided by the estimated standard deviation. This is done in transformed parameter space, that is, after parameters have been log- or logit-transformed to make their distribution more normal-like. Thus, the z-score is a meaningful measure of the estimation error for all parameters. It considers the full [normal] probability distribution predicted by Debber.

After collecting the errors per parameter across all species, we plot the histogram (normalized to have a surface area of 1) of z-scores to check whether it resembles the expected standard normal distribution (μ=0, σ=1). In addition, we make a Q-Q plot comparing the two cumulative distributions. Click a parameter on the left to see its cross-validation results.

Summary

parameter	interpretation	# samples	bias	within 50 % CI	within 95 % CI
T_A	Arrhenius temperature	2006	-0.017 σ	89.4 %	96.3 %
{p_Am}	specific assimilation flux	2006	0.001 σ	59.0 %	94.3 %
{F_m}	maximum specific searching rate	2006	-0.005 σ	97.0 %	99.1 %
κ_X	digestion efficiency (fraction of food to reserve)	2006	0.001 σ	90.9 %	96.1 %
κ_P	faecation efficiency (fraction of food to faeces)	2006	-0.000 σ	92.9 %	96.7 %
v	energy conductance	2006	0.002 σ	57.8 %	94.1 %
κ	allocation fraction to soma	2006	-0.001 σ	57.7 %	93.2 %
κ_R	reproduction efficiency	2006	-0.004 σ	92.2 %	97.0 %
[p_M]	volume-specific somatic maintenance	2006	0.005 σ	58.7 %	93.9 %
k_J	maturity maintenance rate coefficient	2006	0.000 σ	87.2 %	95.0 %
[E_G]	specific cost for structure	2006	0.006 σ	95.9 %	98.6 %
E_Hb	maturity at birth	2006	0.002 σ	63.6 %	95.5 %
E_Hx	maturity at weaning/fledgling	2006	0.001 σ	63.5 %	95.6 %
E_Hj	maturity at metamorphosis	2006	0.003 σ	63.8 %	94.5 %
E_Hp	maturity at puberty	2006	0.001 σ	60.1 %	95.3 %
h_a	Weibull aging acceleration	2006	-0.002 σ	69.9 %	93.6 %
s_G	Gompertz stress coefficient	2006	-0.002 σ	98.4 %	99.0 %
t₀	time at start of development	391	-0.004 σ	84.9 %	94.4 %

number of samples
bias	σ
within 50% confidence interval	%
within 95% confidence interval	%