|
|
|
|||
| Home Help Feedback Subscriptions Archive Search Table of Contents | ||||
|
Fig. 5. Worked example of Felsenstein's
phylogenetically independent contrasts (IC;
Felsenstein, 1985). For the
upper tree: letters at tips (terminal nodes) are names of five species
(AE); numbers at tips are hypothetical data for two phenotypic traits
(e.g. body mass, bill length); numbers adjacent to branches indicate lengths
in units of expected variance of character evolution (which would be
proportional to divergence times under a simple Brownian motion model of
character evolution; see Fig. 2
and text); names for internal nodes are in italics. For the bottom tree:
broken lines and numbers indicate amounts by which internal branches are
lengthened during calculations. The goal of the IC algorithm is to use
phylogenetic information to transform the N original, non-independent
species values into N1 independent and identically distributed
contrasts. These values can then be used in conventional statistical
procedures, with the constraint that all calculations (e.g. correlation,
regression) are computed through the origin. The algorithm begins at the tips
of the phylogenetic tree with sister taxa (typically species). In this
example, these would be species A and B as well as C and D. For these two
contrasts (which can also be identified by their basal node (4 and
3, respectively), the difference in phenotypes between the pair
members is first computed. These `raw contrasts' are then divided by their
`standard deviations,' which are the square roots of the sums of the branch
lengths. This yields `Standardized Contrasts,' which are the values actually
used for subsequent
analyses.
Contrasts can also be calculated for deeper nodes in the tree if we estimate the phenotypes of hypothetical ancestors. For node 4, the appropriate value (assuming Brownian motion evolution with no directional trend) would be the average of its daughter species A and B, i.e. a value of 2.5 for trait 1. Similarly, for node 3, the value is 7. The raw contrast (difference) between nodes 4 and 3 is thus a value of 2.57=4.5. This value would then be divided by its standard deviation. However, this contrast is different from those calculated at the tips of the tree. The contrast of node 4 vs 3 uses two estimated phenotypes, not actual measured data. Therefore, we should devalue it relative to the tip contrasts. This is accomplished by lengthening the branches leading to internal nodes. As explained in Felsenstein (1985), Under Brownian motion, the amount of lengthening is computed as: (daughter branch length 1 x daughter branch length 2) /(daughter branch length 1 + daughter branch length 2). Thus, the branch leading to node 4 is lengthened by 1, giving a corrected length of 4. The branch to node 3 is lengthened by 2, for a corrected length of 3. These corrected values are then used to compute the standard deviation of contrast 4 vs 3.
To compute the contrast between internal node 2 and terminal node E, the value at node 2 must be estimated. It is taken as a weighted average, with weights inversely proportional to the lengths of the branches leading to its daughter nodes 3 and 4. The value computed for node 2 will, therefore, be more similar to the value at node 3 because the branch leading to it (corrected length=3) is shorter than the one leading to node 4 (corrected length=4). The branch leading to node 2 is also lengthened, with computations that use the already-lengthened branches above node 2.
The IC algorithm involves iterative calculations best accomplished by computer programs, such as the PDTREE module of the Phenotypic Diversity Analysis Programs (available from T.G.) or the corresponding module in Mesquite (http://mesquiteproject.org/mesquite/mesquite.html). Readers are cautioned that some computer programs do not properly lengthen internal branches or have other errors. One way to check whether a program is performing the correct calculations is to collapse the internal branches of the phylogeny to be length zero, and then make all of the terminal branches (those leading to tips) equal in length. This yields a `star' phylogeny (e.g. Figs 3A and 4A). This star phylogeny can then be used to compute statistics with independent contrasts, such as the Pearson correlation (through the origin) between two traits. This value should be identical to the value obtained with a conventional statistical program (not through the origin). If the values differ, the IC program is doing something incorrectly.
Most programs for independent contrasts will output the contrasts and some simple statistics, such as the correlation between two sets of contrasts. For more complicated analyses, such as multiple regression or principal components analysis, one will generally need to write out the contrasts and then read them back into a commercial statistics package, most of which will allow the user to specify that calculations of correlations, regressions, etc., be done through the origin.
If standardized contrasts are calculated for the second trait shown in the figure, then the Pearson correlation (through the origin) with trait 1 is 0.85289. This would be considered not statistically significant as compared with a critical value of 0.878 for a two-tailed test with three degrees of freedom. For comparison, the conventional Pearson correlation of the tip data (mathematically equivalent to performing the contrasts calculations on a star phylogeny) is 0.88024, which would be considered marginally significant. Obviously, one should not get too exorcised about such small differences in parameter estimates or P values, but this example serves to make the point that results will always be somewhat different when the phylogeny is specified as being hierarchical rather than a star. For the least-squares linear regression slope, values are 0.55003 for IC and 0.60057 for conventional; corresponding y-intercepts are 1.00724 (calculated as described in Garland et al., 1993; Garland and Ives, 2000) and 0.57759.
Note that the value at the very bottom of the tree, the basal or root node, can also be estimated, but for this data set there is nothing with which to contrast it. Therefore, Felsenstein (1985) did not show calculation of the root node value in his example. However, this value is of interest because it represents (1) the phylogenetically correct estimate of the mean value for all five species and also (2) an estimate of the value for the hypothetical ancestor. As discussed elsewhere (e.g. Garland et al., 1999), both interpretations hinge on the Brownian motion assumption, and the latter also requires that no directional trend in character evolution has occurred. One can also compute standard errors and confidence intervals for the root node (Garland et al., 1999). For trait 1, the value computed by our PDTREE program is 4.115 with a standard error (S.E.) of ±1.7055 and a 95% confidence interval of 0.6204 to 8.8499. Conventional values (which are identical to those obtained after collapsing the phylogeny to be a star) are 4.200 with S.E. ±1.3191 and 95% CI of 0.5376 to 7.8624. Thus, the phylogenetic point estimate is slightly different from the conventional one, but the S.E. and CI are substantially wider. As also noted in the text, this general pattern of wide S.E. values and CIs for estimates at basal nodes has been pointed out before (Garland et al., 1999), and emphasizes that it is often hard to infer ancestral values with much confidence (Schluter et al., 1997; Cunningham et al., 1998). Finally, note that the values given above, which are derived literally by the algebra of phylogenetically independent contrasts in the PDTREE program (Garland et al., 1999), are the same as those obtained by GLS (e.g. see box 3 in Cunningham et al., 1998), and can also be obtained from our REGRESSION.M Matlab program (Blomberg et al., 2003).
Importantly, for nodes not at the base of the phylogeny, the values estimated during the IC algorithm have no special meaning. Rather, they should be viewed merely as intermediate steps in the calculations. Only the value for the root node is equivalent to the maximum likelihood estimate, which can be obtained by GLS or by the `squared-change parsimony' algorithm (Schluter et al., 1997; Cunningham et al., 1998; Garland et al., 1999). However, an interesting property of the IC algorithm is that if you reroot the tree at an internal node (as can be done in our PDTREE program) then the value obtained for that node will be the same as the GLS value (Garland et al., 1999; empirical examples there and in Laurin, 2004). Although such rerooting when performing the IC algorithm affects the values estimated at nodes, it does not affect the estimated correlation or slope between two traits.
For this example data set, the diagnostic Pearson correlation (not through the origin) between the absolute values of the standardized contrasts and their standard deviations (Garland et al., 1992) is 0.663 for trait 1 and 0.144 for trait 2, neither of which is statistically significant (two-tailed critical value for two d.f.=0.950). This would suggest that the branch lengths are adequate for the Brownian motion assumption of independent contrasts, but we should be cautious with so few data points. (Note also that the hypothetical tip values shown in the figure were just made up, not actually produced by a Brownian motion process.)
Blomberg et al. (2003) derived a statistic, termed K, which indicates the amount of `phylogenetic signal' in the tip data relative to the expectation for a trait that evolved by Brownian motion along the specified topology and branch lengths. A value of 1.00 indicates exactly the amount expected. Values <1 indicate less resemblance of relatives than expected, and values >1 indicate stronger-than-expected phylogenetic `clumping' of trait values. For trait 1, K=1.084, indicating that the tendency for related species to resemble each other is, averaged over the entire tree, just about what one would expect if the traits had actually evolved by a process similar to Brownian motion. The PHYSIG.M Matlab program can be obtained from T.G. on request.
As discussed in the text, Blomberg et al. (2003) also present a randomization test for the presence of phylogenetic signal (also implemented in PHYSIG.M). For trait 1, 247 of 1000 randomized data sets (tip values shuffled randomly) yielded a mean squared error that was less than or equal to the value of 8.0264 for the data in their correct position. Thus, the presence of signal would not be considered statistically significant at the usual P<0.05. However, computer simulations demonstrate that approximately 20 species are required to achieve a statistical power of 80% for detecting signal when it is present (Blomberg et al., 2003), so this result should not be taken to suggest that phylogenetic signal is actually lacking for these example data. (Note that the K statistic does not depend on sample size and is a valid descriptive statistic for the amount of signal even for small data sets such as this.)
Finally, the matrix derived from this tree that would be used in GLS
calculations (see text) is as follows (as produced by our PDDIST program as a
DSC file):
![]() |
Other examples of this sort of matrix can be found in Cunningham et al. (1998, box 3) and Freckleton et al. (2002, fig. 1).
|