The Box-Cox transformation (Box and Cox, 1964) is a way to transform data that ordinarily do not follow to a normal distribution so that it then conforms to it. The transformation is a piecewise function of the power parameter :
The function is, given the definition, continuous at the singular point . Plotting the transformation as a function of the original data
and the parameter
produces the following:
An interesting aspect of the Box-Cox transformation is that it can be used as a metric to quantify how normal or log-normal certain data are. This has clear applications in biology. Tissue growth that depends on cellular multiplication is exponential, with a constant factor that is often normally distributed, resulting in tissue size that is log-normally distributed (for discussion, see McAlister, 1879; Koch, 1966; Koch, 1969). Measurements of the final tissue size in different individuals, however, may not perfectly conform to the log-normal due to external influences that may hinder tissue growth. Moreover, it is not always possible to measure the final amount of tissue that is the product of a single lineage of self-multiplicative cells. The combination of different cell lineages, each with their own growth rate, as well as external influences, tend to produce a distribution that is more normally (Gaussianly) distributed. A distribution gradient, therefore, may exist between these two extremes of normality and log-normality. Estimating the parameter allows one to estimate also how closer to normal or log-normal certain measurements are. If
, the original data tend to be log-normally distributed, whereas if
, the data can be considered approximately more normally distributed.
References
- Box GEP, Cox DR. An analysis of transformations. J R Stat Soc B. 1964; 26(2):211-52.
- Koch AL. The logarithm in biology. 1. Mechanisms generating the log-normal distribution exactly. J Theor Biol. 1966; 12(2):276-90.
- Koch AL. The logarithm in biology. II. Distributions simulating the log-normal. J Theor Biol. 1969; 23(2):251-68.
- McAlister D. The law of the geometric mean. Proc R Soc London. 1879; 29:367-76.