spacer gif spacer gif spacer gif spacer gif spacer gif
 QUICK SEARCH:   [advanced]


spacer gif
     Home     Help     Feedback     Subscriptions     Archive     Search     Table of Contents    

First published online August 4, 2005
Journal of Experimental Biology 208, 3015-3035 (2005)
Published by The Company of Biologists 2005
doi: 10.1242/jeb.01745
This Article
Right arrow Summary Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Garland, T.
Right arrow Articles by Rezende, E. L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Garland, T., Jr
Right arrow Articles by Rezende, E. L.

Commentary

Phylogenetic approaches in comparative physiology

Theodore Garland, Jr1, Albert F. Bennett2,* and Enrico L. Rezende1

1 Department of Biology, University of California, Riverside, CA 92521, USA
2 Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697, USA

* Author for correspondence (e-mail: abennett{at}uci.edu)

Accepted 13 June 2005


    Summary
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
Over the past two decades, comparative biological analyses have undergone profound changes with the incorporation of rigorous evolutionary perspectives and phylogenetic information. This change followed in large part from the realization that traditional methods of statistical analysis tacitly assumed independence of all observations, when in fact biological groups such as species are differentially related to each other according to their evolutionary history. New phylogenetically based analytical methods were then rapidly developed, incorporated into `the comparative method', and applied to many physiological, biochemical, morphological and behavioral investigations. We now review the rationale for including phylogenetic information in comparative studies and briefly discuss three methods for doing this (independent contrasts, generalized least-squares models, and Monte Carlo computer simulations). We discuss when and how to use phylogenetic information in comparative studies and provide several examples in which it has been helpful, or even crucial, to a comparative analysis. We also consider some difficulties with phylogenetically based statistical methods, and of comparative approaches in general, both practical and theoretical. It is our personal opinion that the incorporation of phylogeny information into comparative studies has been highly beneficial, not only because it can improve the reliability of statistical inferences, but also because it continually emphasizes the potential importance of past evolutionary history in determining current form and function.

Key words: allometry, comparative method, evolutionary physiology, model of evolution, phylogeny, statistical analysis


    Introduction
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
Studies of organismal form and function rely on multiple types of scientific investigation, including theory, description, experimentation and comparison. Comparing species is an ancient human enterprise, done for a variety of reasons (Sanford et al., 2002Go). Since Charles Darwin, the `comparative method' – comparing populations, species or higher taxa – has been the most common and productive means of elucidating past evolutionary processes (Harvey and Pagel, 1991Go; Brooks and McClennan, 2002). Comparative methods have been used extensively to infer evolutionary adaptation, that is, changes in response to natural selection (for alternate physiological meanings of `adaptation', see Garland and Adolph, 1991Go; Bennett, 1997Go). They are most often promoted and criticized (e.g. Leroi et al., 1994Go) within this context. However, comparative methods are not used to infer adaptation alone (Garland and Adolph, 1994Go; Sanford et al., 2002Go), but are also employed to analyze the effects of sexual selection (e.g. Hosken et al., 2001Go; Nunn, 2002Go; Smith and Cheverud, 2002Go; Aparicio et al., 2003Go; Cox et al., 2003Go), which may be nonadaptive or even maladaptive with respect to natural selection. These methods can also be used to compare rates of evolution across clades or the amount of morphospace occupied by clades or by ecologically defined groups (Garland, 1992Go; Clobert et al., 1998Go; Ricklefs and Nealen, 1998Go; Garland and Ives, 2000Go; Hutcheon and Garland, 2004Go; McKechnie and Wolf, 2004Go). Of particular interest for the present review, they are also widely used to explore trade-offs (e.g. Clobert et al., 1998Go; Vanhooydonck and Van Damme, 2001Go) and to examine functional (mechanistic) relationships among traits (e.g. Lauder, 1990Go; Iwaniuk et al., 1999Go; Mottishaw et al., 1999Go; Autumn et al., 2002Go; Hale et al., 2002Go; Gibbs et al., 2003Go; Johnston et al., 2003Go; Herrel et al., 2005Go), including allometric scaling with body size (e.g. Garland, 1994Go; Reynolds and Lee, 1996Go; Williams, 1996Go; Clobert et al., 1998Go; Garland and Ives, 2000Go; Nunn and Barton, 2000Go; Herrel et al., 2002Go; Perry and Garland, 2002Go; Rezende et al., 2002Go, 2004Go; Schleucher and Withers, 2002Go; McGuire, 2003Go; Al-kahtani et al., 2004Go; McKechnie and Wolf, 2004Go; Muñoz-Garcia and Williams, in pressGo).

Comparative methods have been radically restructured over the past two decades, and now routinely incorporate both phylogenetic information and explicit models of character evolution. Indeed, Sanford et al. (2002Go) suggest that this new emphasis be termed the `comparative phylogenetic method'. As outlined in Blomberg and Garland (2002Go), this revolution in comparative phylogenetic methodology followed from several conceptual advances: (1) adaptation should not be casually inferred from comparative data; (2) the incorporation of phylogenetic information increases both the quality and even the type of inference from comparative data alone; (3) because all organisms are differentially related to each other, taxa cannot be assumed to be independent of each other for statistical purposes; (4) statistical analyses of comparative data must assume some model of character evolution for effective inference; (5) taxa used in comparative analyses should be chosen in regard to their phylogenetic affinities as well as the area of functional investigation; and (6) even phylogenetically based comparisons are purely correlational and inferences of causation drawn from them can be enhanced by other approaches, including experimental manipulations.

To expand on some of these points, `quality' in point 2 includes the simple fact that adding an independent estimate of phylogenetic relationships to a comparative analysis increases – often greatly – the amount of basic data that is brought to bear on a given question, whereas `type' refers to analyses that are simply impossible without a phylogenetic perspective, such as reconstructing ancestral values or comparing rates of evolution among lineages. Although phylogenetic information and a suitable analytical method may allow any comparative data set to be `rescued' from phylogenetic nonindependence (e.g. avoid inflated Type I error rates; point 3), phylogenetically informed choice of species (point 5) can accomplish more, such as actually increasing statistical power to detect relationships among traits (Garland et al., 1993Go; Garland, 2001Go). Finally, we note that point 6 was recognized long ago, but has been re-emphasized as phylogenetically explicit methods of statistical inference have been developed (e.g. see Lauder, 1990Go; Garland and Adolph, 1994Go; Leroi et al., 1994Go; Autumn et al., 2002Go).

The intent of this commentary is to provide a review of some advances that have occurred in the comparative method, with an emphasis on their place in comparative physiology. We examine the underlying reasons for the incorporation of phylogenetic information into comparative studies. In an Appendix, we give a brief overview of the three most commonly used and best understood phylogenetically based statistical methods: independent contrasts (IC; worked example in Fig. 5), generalized least-squares (GLS) models, and Monte Carlo computer simulations. These methods apply mainly to analysis of continuously varying (or at least quantitative) traits, which is the nature of most physiological traits (e.g. blood pressure, metabolic rate, enzyme activity). However, they can also easily incorporate independent variables that are treated as discrete categories, such as diet (e.g. insectivore, frugivore, sanguivore) or habitat (e.g. fresh or salt water). Discussions of methods for categorical traits and computer programs to implement them are available from Mark Pagel (e.g. see Pagel, 1999Go), in MacClade (Maddison and Maddison, 2000Go), and in Mesquite (http://mesquiteproject.org/mesquite/mesquite.html; see also Paradis and Claude, 2002Go). For a general listing of phylogeny-related programs, see the website maintained by Joe Felsenstein (http://evolution.genetics.washington.edu/phylip/software.html).

We discuss when phylogenetically based statistical methods should be used and give some practical examples of where a phylogenetic perspective has improved our understanding of comparative data and evolutionary processes. We also discuss some of the practical and theoretical limitations of such methods. Throughout, we try to emphasize that the incorporation of phylogeny can greatly enhance comparative studies, deliver new insights, and open new areas for research. This is of necessity only a brief summary and readers are directed to more extensive discussions of the topics and issues raised here (e.g. Ridley, 1983Go; Lauder, 1981Go, 1982Go, 1990Go; Harvey and Pagel, 1991Go; Garland et al., 1992Go, 1999Go; Garland and Adolph, 1994Go; Harvey, 1996Go; Ricklefs and Nealen, 1998Go; Ackerly, 1999Go, 2000Go, 2004Go; Pagel, 1999Go; Purvis and Webster, 1999Go; Diniz-Filho, 2000Go; Feder et al., 2000Go; Garland and Ives, 2000Go; Maddison and Maddison, 2000Go; Garland, 2001Go; Rohlf, 2001Go; Autumn et al., 2002Go; Blomberg and Garland, 2002Go; Brooks and McLennan, 2002Go; Blomberg et al., 2002, 2003Go; Rezende and Garland, 2003Go; Housworth et al., 2004Go). We have intentionally not cited some `forum' and `perspective' type papers because we felt that their rhetoric was misleading, and in some cases they contain outright errors.

The empirical examples cited here are idiosyncratic, reflecting mainly our own research interests. Thus, we emphasize examples that involve physiological phenotypes, but include others when they are lacking. Our enthusiasm for phylogenetic approaches in comparative physiology should not be taken to imply, however, that we think they are more important than other approaches, such as measurement of selection acting in natural populations, experimental evolution (e.g. see Garland and Carter, 1994Go; Bennett and Lenski, 1999Go; Ackerly et al., 2000Go; Feder et al., 2000Go; Garland, 2001Go, 2003Go; Bennett, 2003Go; Swallow and Garland, 2005Go), or more purely mechanistic investigations (e.g. Mangum and Hochachka, 1998Go; Hochachka and Somero, 2002Go).

We are concerned that some of our discussion of assumptions and intricacies of phylogenetically based statistical methods may be off-putting to those who simply want to analyze their data (see also Felsenstein, 1985Go). However, it must be acknowledged that statistical analyses in general are not always simple and have underlying assumptions that cannot be ignored. Most of the tools that we use in everyday research (e.g. correlation, regression, analysis of variance, analysis of covariance) have been around for 50 years or even a century. Nonetheless, the field of statistics (both theoretical and applied) continues to refine these methods. Such questions as what type of line is best for describing functional relationships (e.g. Rayner, 1985Go; chapter 6 in Harvey and Pagel, 1991Go; Riska, 1991Go; McGuire, 2003Go; Garland et al., 2004Go), how to deal with non-linear relationships (Quader et al., 2004Go) or random effects in ANOVA models, when to include or exclude interaction terms, how best to transform data, or when to employ nonparametric methods, still do not have simple, general answers. Moreover, new statistical methods continue to be developed, including computer-intensive approaches that were not possible 50 years ago (e.g. see Lapointe and Garland, 2001Go; Roff, in pressGo). For many statistical parameters, including comparative methodologies, several different approaches (and attendant algorithms) may be used for estimation, none of which performs `best' in all situations. We believe that it is important that a comparative biologist understand the assumptions and approaches underlying these methodologies, and does not just resort to their rote application, and that is the basis for our more detailed presentation.


    Phylogeny and modern (statistical) comparative methods
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
The beginning of the transition into modern comparative phylogenetic methods is marked by the publication of Ridley's book on mating adaptations (Ridley, 1983Go) and by Felsenstein's article entitled `Phylogenies and the comparative method' (Felsenstein, 1985Go). Both argued for the necessity of incorporating an explicitly phylogenetic perspective into analyses of comparative data. These authors were not the first to claim that comparative data generally violate the assumptions of conventional statistical methods (see Harvey and Pagel, 1991Go), but Felsenstein (1985Go) proposed the first fully phylogenetic method, i.e. one that could incorporate detailed information on topology and branch lengths, which he termed independent contrasts (IC). Although the full-blown IC method (see Appendix for description and worked example in Fig. 5) requires detailed information on phylogenetic topology, branch lengths (Fig. 1), and model of character evolution (Fig. 2) in order to be maximally reliable, Felsenstein (1985Go) also considered how one might make use of partial information, such as might be derived from a taxonomy that had some resemblance to actual phylogenetic relationships, e.g. by comparing several pairs of species within a series of genera, an approach that is now commonly used (e.g. Monkkonen, 1995Go; for a review of plant examples, see Ackerly, 1999Go). Nonetheless, the requirements of the method seemed daunting to many, and its use in comparative physiology grew slowly. Indeed, one of us even helped to develop an alternative phylogenetic method, partly because of a lack of information on branch lengths (see Figs 1, 2) in a comparative study that seemed to preclude the use of IC (Huey and Bennett, 1987Go; and see extensions in Garland et al., 1991Go; Martins and Garland, 1991Go).



View larger version (14K):
[in this window]
[in a new window]
 
Fig. 1. Hypothetical evolutionary relationships among 17 species of organisms. Vertical axis represents time in relative units, with the top of the `phylogenetic tree' representing the present. Hence, species 1–7 and 8–13 are alive now (extant), whereas species e1–e4 are extinct. Statements about phylogenetic relationships are based solely on recency of common ancestry. For example, species 1 and 2 are each other's closest relative because they share a more recent common history with each other than with any other species depicted in this figure. The horizontal axis is arbitrary, and note that nodes could be rotated for graphical convenience with no implication for evolutionary relationships. `Clades' are hierarchically arranged, `monophyletic' groups of species, including all species that have descended from a common ancestor as well as that basal ancestor. All species within a given clade are more closely related to each other (they share a more recent common ancestor) than to any species in another clade. In the strict sense, a clade includes all species that have ever existed within it. Thus, Clade B includes species 7 as well as e1–e4. However, as it is impossible to know of all extinct species within a given clade and as physiologists rarely include extinct taxa in their studies, the term `clade' is often used in a relative way with respect to a particular collection of species that are included in a given study. Consider a comparative physiological study of species 1–13. Species 1–6 might be referred to as Clade 1, while species 7–13 might be referred to as Clade 2. However, note that species 7 is relatively distantly related to the other extant species in Clade 2 (i.e. species 8–13 shared a last common ancestor much more recently than the last common ancestor of them with species 7). Hence, a researcher studying species 1–13 might prefer to write in terms of Clades A, B and C in order to highlight the fact that, a priori, she would expect species 7 to be somewhat different from species 8–13. (Importantly, a priori hypotheses about particular single species can be tested with well-established phylogenetically based statistical methods, although they may not be convincing to some regardless of the level of statistical significance; see Garland et al., 1993Go; Garland and Adolph, 1994Go; Garland and Ives, 2000Go.) Branch lengths in this figure are proportional to divergence times. All phylogenetically based statistical methods use branch lengths in their calculations, although some assume (arbitrarily) that each branch segment is equal in length. Alternatively, under the commonly assumed Brownian motion model of character evolution (see Fig. 2), branch lengths are assumed to be in units proportional to (relative) divergence times and hence to the variance of character evolution along each branch segment (i.e. longer branches imply greater variance of character change; see Felsenstein, 1985Go).

 


View larger version (17K):
[in this window]
[in a new window]
 
Fig. 2. Illustration of a Brownian motion (random walk in continuous time) model of character evolution, as might be implemented in a computer program (e.g. PDSIMUL of Garland et al., 1993Go). The goal is to simulate the evolution of two traits, beginning at the bottom of the phylogenetic tree and ending at the three tips, species A, B and C. A computer program begins at the bottom of the tree (internal node `F') with user-specified starting values, in this example 10 and 10 for Traits 1 and 2, respectively. It then draws a random datum from a bivariate normal distribution of hypothetical evolutionary changes for the two traits. This distribution is illustrated by concentric rings proportional to density of data points in the z axis (projecting out of the page), with darker indicating a higher density of points; the tails of the distribution diminish to infinity). In this example, we assume that the means of this distribution are 0 for both traits, such that no general tendency for either to increase or decrease will be modeled. We also specify 0 correlation between them, such that they will `evolve' independently, on average. For the amount of evolutionary change from node F to tip species C, we happen to draw values of –4 and –2 (red). Thus, species C has values of 6 and 8. For the change from node F to G, we draw +3 and +2 (blue). Above this, we draw two separate sets of changes: +1 and –2 leading to tip species A (green); –1 and +2 leading to tip species B (purple). Note that the amount of change tends to be greater for longer branches, reflecting a greater opportunity for evolutionary change. In practice, a computer program might achieve this by expanding or contracting the widths of the bivariate normal distribution for relatively longer or shorter branch lengths, respectively. Thus, under Brownian motion, for a given character, the variance of this distribution is set to be proportional to divergence time (along the length of each branch segment sequentially; Felsenstein, 1985Go, 1988Go). Note also that the distribution from which changes are drawn does not need to be (bivariate) normal (see Felsenstein, 1985Go, 1988Go). Moreover, the means of the distribution can be set to positive or negative values to impose directional trends in character evolution (Garland et al., 1993Go). These sorts of changes create models that are no longer simple Brownian motion.

 

Concern about the possible influence of phylogeny in comparative and ecological physiology antedated Felsenstein's (1985Go) publication. For example, explicit comparisons of marsupial with placental mammals (MacMillen and Nelson, 1969Go; Dawson and Hulbert, 1970Go) and of passerine with non-passerine birds (Lasiewski and Dawson, 1967Go) were motivated by cognizance of phylogeny, and some workers tried to partition the effects of phylogeny on physiological relationships (e.g. Andrews and Pough, 1985Go). Moreover, some workers voiced concerns about specific adaptive interpretations of characters shared more widely in their clades (e.g. Dawson and Schmidt-Nielsen, 1964Go; Dawson et al., 1977Go). What those earlier studies lacked was not necessarily a general perspective on the importance of phylogeny, but rather a formal logical and statistical methodology for incorporating detailed phylogenetic information. Analytical techniques have been greatly expanded and modified since 1985 (see below and Appendix), but Felsenstein's IC method is still the most widely used and his insights were pivotal to modernization of the comparative method. Moreover, the realization that IC is a special case of generalized least-squares (GLS) methods (see Appendix) means that the former can always serve as a useful entry point for the latter, and one that retains the major heuristic of `tree thinking' (sensu Maddison and Maddison, 2000Go).

Traditional interspecific comparative analyses applied conventional statistical methods to test for associations between traits (e.g. metabolic rate and body size), or between a trait and an environmental variable (e.g. blood oxygen carrying capacity and altitude). This approach treats all data points (e.g. mean values for a series of species) as statistically independent of each other. Unfortunately, mean phenotypes of biological taxa usually will not be statistically independent because they are all related through their hierarchical phylogenetic history. Empirically, more closely related species do indeed tend to resemble one another; put simply, hummingbirds look like hummingbirds, and turtles look like turtles, and the same is true for physiological traits (Blomberg et al., 2003Go; see below). This general tendency exists for several good biological reasons (Harvey and Pagel, 1991Go), including time lags for change to occur after speciation, occupation of similar niches by close relatives, and conservative phenotype-dependent responses to selection. Thus, the extent of these phylogenetic relationships – and hence the expected degree of resemblance – must also be figured into comparative analyses. Analytical techniques that do not incorporate phylogenetic information make the tacit statistical assumption that all the species studied are equally distantly related to each other, that is, that they descended along a `star phylogeny' (Fig. 3A), when in fact their ancestral associations are hierarchical (Fig. 3C).



View larger version (19K):
[in this window]
[in a new window]
 
Fig. 3. (A) Illustration of what conventional statistical analyses assume when applied to comparative data (`star' phylogeny with equal-length branches). This model implies that values at tips of the tree (phenotypic means for some trait for 10 species) are statistically independent and identically distributed. (B) Phylogenetic tree that might be inferred from taxonomic information, e.g. if five genera within a single family were represented that contained, from left to right, one, one, three, three and two species in the data set. This assumes that the genera are an unrelated series of `mini-stars' with no hierarchical structure within any of them. It also assumes that the taxa actually represent separate evolutionary lineages (monophyletic groups or clades), but such is not always the case for taxonomies. (C) Estimates of real phylogenies usually indicate hierarchical relationships and branches that do not necessarily line up along the tips of the tree. Non-contemporaneous tips can indicate that extinct taxa are included in the data set or that the rate of evolution has varied among branches. Real phylogenies like this cause various statistical problems, stemming from the non-independence of species' phenotypes, so phylogenetically based statistical methods are required for proper analyses. Modified from Garland (2001Go) and Rezende and Garland (2003Go).

 
The foregoing statement requires substantial amplification. First, there is an alternative way to view the tacit statistical assumption. A star phylogeny, as shown in Fig. 3A, is usually drawn to imply that a set of species all originated from a common ancestor at virtually the same point in the past, i.e. that a `big bang' of speciation occurred very rapidly for that particular set of species (and perhaps others not presently being considered). Alternatively, with respect to the assumption of statistical independence, it could imply that recent evolution of some character(s) has been so rapid that any evidence of successive speciation events is lost. In other words, some of the species may in fact be more closely related than others, phylogenetically speaking, but we would never know it just by looking at the characters we are trying to study: no `phylogenetic signal' remains. Extremely high measurement error could have a similar effect, but would also make us seriously doubt that the data were good enough for any sort of analysis. As discussed below, the issues of high trait lability and/or high measurement error can be addressed empirically, and recent studies have found that most traits do indeed exhibit phylogenetic signal, indicating that a star phylogeny does not provide a good fit to the data (Freckleton et al., 2002Go; Blomberg et al., 2003Go; Tieleman et al., 2003Go; Ackerly, 2004Go; Al-kahtani et al., 2004Go; Ashton, 2004aGo,bGo; Hutcheon and Garland, 2004Go; Laurin, 2004Go; Rezende et al., 2004Go; Rheindt et al., 2004Go; Ross et al., 2004Go; Muñoz-Garcia and Williams, in pressGo).

Second, it is important to consider what is meant by the `branch lengths' of a phylogenetic tree that is used for analysis. In general, proponents of phylogenetically based comparative methods assume that analyses of physiological and other traits will involve use of a phylogenetic tree that was inferred from other data, such as variation in DNA sequences, which is presumed to be independent of the data being analyzed. Otherwise, it seems intuitively obvious that analyses may involve some circularity. However, this is actually a complicated subject and beyond the scope of the present paper (Felsenstein, 1985Go; de Queiroz, 2000Go). Leaving aside the general issue of having available a phylogeny that is independent of the characters under study, the branch lengths of the working phylogenetic tree are confounded with the model and rates of character evolution that will be assumed for statistical analyses of most real data sets (see Figs 1, 2). In other words, we usually do not have independent information on, for instance, divergence times and selective regimes that may have prevailed along various branches of the tree. In any case, all of the main phylogenetically based statistical methods require branch lengths in units proportional to expected variance of evolution for the characters(s) under study (see Felsenstein, 1985Go, 1988Go; Garland et al., 1992Go, 1993Go, 1999Go; Garland and Ives, 2000Go; Rohlf, 2001Go; Blomberg et al., 2003Go; Housworth et al., 2004Go). Branch lengths essentially indicate our a priori expectations for how likely a given trait was to change (increase or decrease in value) from one node to another along a phylogenetic tree, and thus become an integral component of our statistical null model. Under a simple Brownian motion model, those branch lengths would necessarily be proportional to divergence times. Under any other model, such as the Ornstein–Uhlenbeck (OU) process, which is like Brownian motion while tethered to an elastic band and is used to model stabilizing selection or constraints on trait space (Felsenstein, 1988Go; Garland et al., 1993Go; Diaz-Uriarte and Garland, 1996Go; Martins and Hansen, 1997Go; Blomberg et al., 2003Go; Freckleton et al., 2003Go; Butler and King, 2004Go; Housworth et al., 2004Go), they would be more-or-less different from divergence times.

A simple hypothetical example can illustrate this distinction. Many traits evolve within limits set by physical or biological properties. Some of these are trivial. For example, body mass cannot evolve to be as small as 0 g. Others are more interesting. Apparently, for example, activity body temperatures (Tb) of squamate reptiles (lizards and snakes) cannot evolve to be more than about 42°C. We do not know the ancestral activity Tb of squamates, but it was probably substantially lower that 42°C. Thus, during their initial radiation and diversification, Tb would have been free to evolve, perhaps in a fairly Brownian motion-like fashion, with an increase or decrease about equally likely to occur along any branch of the phylogeny. However, lineages that `explored' the climate space towards higher Tb would eventually be constrained by the reduction in Darwinian fitness that can be caused by exceedingly high temperatures (e.g. via failure of spermatogenesis or outright death). Thus, if we were to depict a phylogenetic tree of squamates with branch lengths proportional to expected variance of Tb evolution, then we would need to know the Tb at the start of each branch segment and also have the branches be, in effect, different if the lineage was near a thermal limit, either upper or lower. That is, a lineage near an upper limit would have a low probability of evolving a higher Tb, but a `typical' probability of evolving a lower Tb, and vice versa. It should be obvious that our ability to specify such detailed branch-length information for any trait in any group of wild organisms is severely limited. Thus, for simplicity and/or analytical tractability, phylogenetically based statistical methods usually begin with an assumption of Brownian motion evolution along whatever branch lengths are specified in a working phylogeny (e.g. Fig. 1). And in many cases (e.g. see reviews of published studies in Blomberg et al., 2003Go; Ashton, 2004aGo), these will be arbitrary values, such as setting all segments equal to unity in length or by some other simple rule (e.g. Fig. 4B). In such cases, it is often prudent to perform computations with more than one set of branches as a sensitivity analysis for the conclusions (e.g. see Ashton, 2004bGo; Hutcheon and Garland, 2004Go; Laurin, 2004Go). Similarly, some studies use multiple phylogenies (topologies) (e.g. Bauwens et al., 1995Go; Symonds and Elgar, 2002Go; Hodges, 2004Go).



View larger version (48K):
[in this window]
[in a new window]
 
Fig. 4. Use of computer simulations to illustrate how Type I error rates for testing an association between two traits can be inflated by ignoring phylogenetic relationships. Shown are three distributions of ordinary, non-phylogenetic, Pearson product–moment correlations of tip data. In each of the three figures, data were simulated along the phylogeny shown, under a simple Brownian motion model of character evolution (see Fig. 2), with the correlation between the two traits set to zero (Martins and Garland, 1991Go; Garland et al., 1993Go). (A) Data simulated along a `star' phylogeny, here depicted as a `comb.' The upper 95th percentile is +0.504, which is statistically indistinguishable from the conventional critical value of +0.497. Compared with this distribution, the correlation for the real data on lizards (+0.585; see text) would be considered statistically significant at P<0.05. (B) Data simulated along the hierarchical topology, but with less extreme branch lengths (arbitrary values as suggested by Pagel, 1992Go) than for the real branch lengths shown in (C). For these simulated data, the 95th percentile is +0.641, which is larger than the conventional critical value. Judged against this empirical null distribution, the correlation for the real data would be considered statistically non-significant (P>0.05). (C) Simulations along the actual phylogeny used by Garland et al. (1991Go). The simulated data include an even greater number of sets for which the correlation is strongly positive, as compared with (B). The 95th percentile is +0.828, which is much larger than the nominal one-tailed critical value for testing a correlation coefficient (+0.497). If the phylogeny shown in C is close to reality, and if evolution has been similar to Brownian motion, then the results of C are more trustworthy than those of A.

 
As introduced above, for some models of evolution, including ones in which phenotypes respond essentially instantaneously (in evolutionary time) to changes in the selective regime, the appropriate branch lengths would be very long for those leading to tips of the tree and very short internally and near the base. In the limit, this becomes a star with no hierarchical structure (Fig. 3A). (A similar situation can arise if the tip data contain very large amounts of measurement error.) So, a conventional statistical analysis can be justified on first principles under some models of evolution, and computer simulations have confirmed this (Diaz-Uriarte and Garland, 1996Go; Price, 1997Go; Harvey and Rambaut, 2000Go; Martins et al., 2002Go). Furthermore, even if Brownian motion were an adequate descriptor of character evolution, we never have exact information on divergence times (and different characters likely evolve at different rates), so our branch lengths will always contain some amount of error. If that error were large enough, as in certain cases where evolution has been very much unlike Brownian motion, then we might be better off just assuming a star phylogeny, which can be accomplished by using conventional statistical methods. On the other hand, even if traits evolve very rapidly in response to altered environmental conditions (selective regimes), environments can have a phylogenetic history (ecological or niche conservatism), which would confer phylogenetic structure on trait evolution (Harvey and Pagel, 1991Go; Desdevises et al., 2003Go). Most organisms do not have infinite mobility, and hence descendant generations are likely to live fairly near the haunts of their ancestors, and habitat selection can accentuate this `inheritance' (see p. 30 in Garland et al., 1992Go). Indeed, several studies have shown that such traits as the latitude from which species (populations) were sampled can show significant phylogenetic signal (Freckleton et al., 2002Go; Hodges, 2004Go; Rezende et al., 2004Go; see also Desdevises et al., 2003Go).

These points have suggested to some that phylogenetically based analyses are so fraught with pitfalls that we should stick with non-phylogenetic ones. But a conventional statistical analysis actually has as many assumptions as a phylogenetic one. For example, it assumes that the species under analysis have not been interacting, e.g. as by character displacement (Hansen et al., 2000Go). It assumes that each species should be equally weighted, which is equivalent to saying that the heights of each branch from the root of the tree (assumed to be a star) are equal. And so forth.

In any case, it has become increasingly clear that, because we never know the true branch lengths and/or model of character evolution, we should pay careful attention to the branch lengths used, employing methods that can consider options ranging between a star and our working hierarchical phylogeny, and possibly something even more hierarchical. Thus, recent methods emphasize estimation of optimal branch length transformations as an essential part of phylogenetic analyses of comparative data (e.g. see Grafen, 1989Go; Diaz-Uriarte and Garland, 1996Go, 1998Go; Pagel, 1999Go; Harvey and Rambaut, 2000Go; Freckleton et al., 2002Go; Martins et al., 2002Go; Blomberg et al., 2003Go; Housworth et al., 2004Go). Although some researchers may be uneasy with such transformations of branch lengths, they are analogous to use of a Box–Cox procedure to find the optimal transformation of data (e.g. best approximation of normality) in conventional statistical procedures (for instance, use of a Box–Cox procedure to transform branch lengths; Reynolds and Lee, 1996Go). Moreover, aside from its benefits with computer-simulated data, such careful attention to branch lengths can sometimes improve statistical power to an important extent with real data (see below).


    An example of how phylogeny can affect statistical analyses
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
By overestimating the true number of independent observations, conventional statistical methods applied to comparative data typically lead to inflated Type I error rates, i.e. statistical significance is claimed too often (e.g. Grafen, 1989Go; Martins and Garland, 1991Go; Purvis et al., 1994Go; Diaz-Uriarte and Garland, 1996Go). A real example of the influence of phylogeny on interpretation of comparative data comes from a study that tested the hypothesis that the preferred body temperature and the optimal temperature for sprint running speed would be positively correlated among 12 species of lizards (Huey and Bennett, 1987Go; Garland et al., 1991Go). The ordinary Pearson correlation coefficient between these two temperatures, uncorrected for phylogenetic associations, is +0.585. Is this statistically significant, or might it have been obtained by chance sampling if the true correlation among all Australian skinks were zero?

The answer depends on what is assumed about the phylogenetic relationships of the 12 species. If we assume that species are unrelated, then we can refer to conventional tables of critical values for correlation coefficients. For a one-tailed test with 12 data points (and hence 10 degrees of freedom for testing a correlation), the critical value is +0.497, so a value of +0.585 would be considered significant at P<0.025. If, instead, we want to assume that the species are related in a hierarchical fashion, then we cannot use the conventional tables. Fortunately, however, we can incorporate phylogenetic information as follows (Martins and Garland, 1991Go; Garland et al., 1993Go). We can construct different working phylogenies, model the uncorrelated evolution of these traits by a Monte Carlo computer simulation that assumes random, Brownian-motion like trait change, and calculate a correlation coefficient for each simulated data set. We can then determine the critical 5% level for the correlation coefficient for each distribution. If we assume that all species are completely independent or related by a star phylogeny (Fig. 4A), then the one-tailed probability for obtaining a correlation as large as +0.585 is 0.023 (based on this particular set of 1000 simulated data sets), so the relationship is statistically significant at P<0.05. In fact, if we do a very large number of simulations, then we will obtain exactly the same results as when referring to conventional tables.

If, however, we simulate data along our best estimate of the phylogeny of these lizards (Fig. 4C), then a correlation of +0.585 would be observed much more frequently than 5% of the time and would not be considered very unusual, hence not statistically significant (P>0.15). If a hypothetical phylogeny with different branch lengths, involving fewer deep roots, were assumed (Fig. 4B), then a value of +0.585 would have a lower probability of being observed, but in this case would still be non-significant. Thus, the assumed pattern of the relationships among the species crucially affects the statistical significance of the observations: the more the phylogeny departs from a star, the lower is the number of effectively independent observations and the more likely we are to observe an extremely large (or small) correlation just by chance. If the working topology, branch lengths, and simulation model are somewhat realistic, then we will claim significance too often if we ignore phylogeny.

Another important point is that if the simulated data of Fig. 4B or C are analyzed with IC, using the corresponding phylogenies, then the resulting distribution of correlation coefficients will be the same as in Fig. 4A (results not shown). Thus, as discussed in the Appendix, the IC method uses the specified phylogenetic information to transform the data to make them independent and identically distributed. This then allows one to refer to conventional tables of critical values for hypothesis testing.

All of the simulations shown in Fig. 4 were done under a simple Brownian motion model (Fig. 2), but the model used can have a large effect on the resulting distributions of statistics (e.g. see Garland et al., 1993Go; Diaz-Uriarte and Garland, 1996Go; Price, 1997Go; Harvey and Rambaut, 2000Go; Martins et al., 2002Go; Freckleton et al., 2003Go). Brownian motion is a very simple model of character evolution, and its analytical tractability was exploited by Felsenstein (1985Go) to develop IC. It is a good model for traits that evolve solely by random genetic drift, and may also be adequate for some types of `fluctuating' selection (i.e. when the direction of selection changes from generation to generation). As a basis for statistical methods to estimate and test character correlations, it may also be an adequate model for traits that are subject to certain types of selection (Felsenstein, 1985Go, 1988Go; Grafen, 1989Go). But most traits probably evolve in ways that are too complicated and idiosyncratic to be modeled realistically by Brownian motion (Felsenstein, 1988Go; Hansen et al., 2000Go). Fortunately, simulations can use arbitrarily complex models of character evolution, limited only by one's ability to write computer programs and imagination (e.g. Garland et al., 1993Go; Diaz-Uriarte and Garland, 1996Go, 1998Go; Price, 1997Go; Harvey and Rambaut, 2000Go; Freckleton et al., 2003Go). Of course, whether more complicated models lead to more accurate analyses depends on whether they are actually a better descriptor of past evolution by the characters under study, and that is something very difficult to know. Moreover, finding a model that fits a set of data reasonably well does not necessarily mean that it is the correct model, and other models can probably be found that would provide equally good fit (see also Blomberg et al., 2003Go). (As always, it is risky to attempt to infer process from pattern.) Given the near impossibility of knowing how traits actually evolved in the distant past, simulation studies are also used to gauge how robust analytical methods such as IC are to violation of their assumptions (e.g. Brownian motion, accurate knowledge of branch lengths), how diagnostic tests can alert one to such violations, and how well remedial measures (e.g. transformations of tip data or branch lengths) can rescue statistical performance when assumptions are violated (e.g. Martins and Garland, 1991Go; Purvis et al., 1994Go; Diaz-Uriarte and Garland, 1996Go, 1998Go; Harvey and Rambaut, 2000Go; Diniz-Filho and Torres, 2002Go; Freckleton et al., 2003Go). Still, physiologists may sometimes be able to improve the accuracy of assumed models by their knowledge of how organisms work (or could have worked), as in the case of limits to body temperature evolution discussed above. Similarly, paleontological information can be used to improve the realism of simulations (Garland et al., 1993Go).

We close this section by emphasizing that the use of computer simulations to obtain `phylogenetically correct' (PC) null distributions for testing hypotheses about comparative data is a very general tool that can be used for virtually any analysis (Martins and Garland, 1991Go; Garland et al., 1993Go), including bivariate (e.g. Ricklefs and Nealen, 1998Go) or multivariate analyses of evolutionary diversification. For example, the PHYLOGR (available at http://cran.r-project.org/) program allows one to test hypotheses about canonical correlation or principal components analysis (PCA) in relation to computer-simulated data (R. Diaz-Uriarte and T. Garland, manuscript in preparation).


    When and why to use phylogenetic information in comparative studies
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
What kinds of characters are amenable to comparative phylogenetically based analyses? Basically, any measurable trait can be studied with these methods. It can be a discrete character, such as presence or absence of a structure, or a continuously distributed trait, such as the length of a bone or the value for a rate process. The methods are not confined to analysis of structure and function of individuals, but may also include aspects of ecology, such as home range size or environmental indices (e.g. mean annual temperature). However, for the analysis of evolutionary differences it is desirable to minimize the influence of the immediate environment on phenotypic characters prior to their measurement. That is, to the extent possible, all species should be exposed to common conditions (acclimated to the same environment) for some period of time prior to measurement. Ideally, they would be bred and raised under common conditions for one or two generations (Garland and Adolph, 1991Go). That is unfortunately not possible for many species. Moreover, given that the species under study probably vary in the conditions they inhabit, which set or sets of environmental conditions should be used? And what if the ordering of species phenotypes varies among those conditions because of genotype-by-environment interactions? These questions have no easy answers, but should be borne in mind during analysis and interpretation.

What kinds of characters demand a phylogenetic analysis? Although we may generally expect that most characters will tend to `follow phylogeny', this is an empirical question. The simplest general test for whether related organisms actually do tend to resemble each other more than they resemble those that might be chosen randomly with respect to phylogenetic position uses randomization procedures (see also Abouheif, 1994; Ackerly, 2004Go; Laurin, 2004Go; Rheindt et al., 2004Go). Specifically, once phylogenetically IC have been computed, it is possible to calculate the variance of those contrasts. The lower the variance of the contrasts, the better the fit of the phylogeny (topology and branch lengths) to the character in question. To determine whether a given variance indicates the presence of statistically significant `phylogenetic signal' (i.e. more closely related species tend to resemble each other more than they resemble randomly chosen species), one can compare it with the distribution of variances for a large number of data sets that have been randomized (shuffled) across the tips of the phylogeny (Blomberg and Garland, 2002Go; Blomberg et al., 2003Go). For studies with 20 or more species (for which statistical power should be reasonably high), more than 90% of the traits examined to date (including behavioral, physiological, morphological, life history and ecological/environmental traits) do exhibit a significant phylogenetic signal (P<0.05: Blomberg et al., 2003Go; Al-kahtani et al., 2004Go; Ashton, 2004aGo,bGo; Rezende et al., 2004Go; Ross et al., 2004Go; Muñoz-Garcia and Williams, in pressGo; see also Freckleton et al., 2002Go).

The empirical finding of pervasive phylogenetic signal implies that hierarchical phylogenies – as presented and used in numerous publications – provide a better fit to the data under analysis than does a star phylogeny (Figs 3A, 4A). This sends a strong message that we should routinely consider phylogenetic information in statistical analyses of comparative data. However, this does not necessarily mean that, for any given set of data, we should simply obtain a phylogenetic tree, perform an IC, GLS or Monte Carlo simulation analysis, and automatically presume that the results will be more reliable than the comparable conventional statistical analysis. As we and others have emphasized for more than a decade, analyses using a given topology and branch lengths can perform relatively poorly if their assumptions are severely violated (e.g. see Grafen, 1989Go; Martins and Garland, 1991Go; Diaz-Uriarte and Garland, 1996Go, 1998Go; Price, 1997Go; Garland and Diaz-Uriarte, 1999Go; Harvey and Rambaut, 2000Go; Diniz-Filho and Torres, 2002Go; Martins et al., 2002Go; Freckleton et al., 2003Go; Housworth et al., 2004Go). Thus, we urge practitioners to apply robust tests for phylogenetic signal, diagnostic checks, and branch length transformations as warranted (for recent discussions and methods, see Freckleton et al., 2002Go; Blomberg et al., 2003Go), and sensitivity analyses by varying branch lengths and/or model of evolution (e.g. see Garland et al., 1993Go; Ashton, 2004bGo; Hutcheon and Garland, 2004Go; Laurin, 2004Go; Muñoz-Garcia and Williams, in pressGo).

What sorts of branch lengths should be used? Given the uncertainties regarding branch lengths (see above), many workers have reported results with multiple branch lengths to explore consistency or the lack thereof (e.g. Ross et al., 2004Go). Although it is often the case that conclusions are relatively robust (insensitive) to the branch lengths used, this is not always true, and the importance of attempting to use `optimal' branch lengths transformations can be illustrated with two empirical examples. Garland et al. (1993Go) analyzed home range areas in relation to body size for 49 species of carnivores and ungulates. Conventional analysis of covariance (ANCOVA) indicated a highly significant (P<0.001) different in size-adjusted home range areas of the two groups. Analysis via IC (or Monte Carlo simulations), however, revealed no statistically significant difference (two-tailed P=0.126 for IC). The branch lengths used for the phylogenetic analyses were estimates of divergence times, derived from various sources. They passed the diagnostic `lack of fit' tests as described in Garland et al. (1992Go). However, power to detect a difference is apparently improved by applying the transformations of branch lengths as proposed by Blomberg et al. (2003Go) to mimic particular models of character evolution. Using the branches transformed under the OU model for log body mass, the P value is reduced to 0.099, and using their Accelerating–Decelerating (ACDC) model the P value is reduced to 0.044, thus crossing the typical threshold of <0.05 to be considered statistically significant (degrees of freedom were reduced by one in both cases to reflect the additional parameter estimated in these models; see also Diaz-Uriarte and Garland, 1996Go, 1998Go; Garland and Diaz-Uriarte, 1999Go). Similarly, in a recent comparison of the generic average body sizes of `megabats' and `microbats,' Hutcheon and Garland (2004Go) found statistical significance in the IC analysis only when using branch lengths transformed under the OU or ACDC models. We suspect that such increases in power may be more likely to occur in comparisons between groups that are fairly highly phylogenetically confounded (i.e. the independent variable of interest, such as diet, is highly clumped with respect to phylogeny, as in comparisons of clades; e.g. see Garland et al., 1993Go; Vanhooydonck and Van Damme, 1999Go; Perry and Garland, 2002Go; Rezende et al., 2004Go) than in tests of correlations between two traits.

How do we choose species for study? Traditionally, animals were chosen for comparative studies for any number of reasons, including convenience (e.g. local availability or an existing literature data base), possession of an interesting biological trait (e.g. the long neck of the giraffe), occupation of an extreme environment (e.g. a hot dry desert, the Arctic), or characteristics that make it well suited to study a particular physiological process (Garland and Adolph, 1994Go; Garland and Carter, 1994Go; Bennett, 2003Go). Frequently, a particular species or group living in a particular environment has been the key that originally sparked interest in the project. It is now clear that phylogenetic information should also be considered when choosing species for study (for a simulation study on the effects of taxon sampling when testing for correlated evolution, see also Purvis and Webster, 1999Go; Ackerly, 2000Go).

To increase analytical power, it is a good idea to include other species that experience a very broad range of the environmental (`independent') variable. You might then randomly sample species from a broad taxon (e.g. mammals) or focus exclusively on a particular lineage, such as bats or rodents. From a design perspective, the latter strategy is preferable because the broader comparison will involve distant relatives that vary in many traits, potentially complicating the analysis of particular traits of interest. From a phylogenetic point of view, comparisons of distant relatives are like an experiment with multiple uncontrolled variables (Garland and Adolph, 1994Go; Garland, 2001Go). To quote Felsenstein (1985Go, p. 465), `Comparative biologists tend to suspect comparisons of distantly related species; they hope to base their comparisons on recent evolutionary events that have not been overlaid by much subsequent change'. In principle, it might be possible to control for confounding traits that differ in distant relatives by including additional independent variables in the analysis, but it is often difficult to know a priori what those traits might be, let alone actually obtain quantitative data for them. In any case, an example in which casting too broad a net seems to reduce statistical power is provided by a study of body mass evolution in birds (Garland and Ives, 2000Go, p. 354). A comparison of passerines with their sister clade indicates that the former have significantly smaller log body masses, on average, whereas a comparison of passerines with all birds (including their sister clade) does not. (It should be noted that the identity of the sister clade of passerines is controversial, and the foregoing example may well change as improved phylogenetic information becomes available.) A related topic is whether one might a priori exclude certain subclades from a comparative analysis because they are `unusual' as compared with the larger clade in general. For example, many studies of lizards (e.g. Perry and Garland, 2002Go) exclude snakes. A recent study by Bininda-Emonds and Gittleman (2000Go) suggests that this sort of a priori data exclusion may be less warranted than is often presumed.

A particularly powerful comparative design is one that has several different pairs of closely related species that differ in the variable of interest (e.g. high and low temperature) and has these species pairs relatively distantly related to each other (i.e. in different branches of the phylogeny). As noted by Garland (2001Go), a particularly favorable distribution of this sort has the power to detect significant associations even when conventional statistical methods fail to do so. However, some workers choose species in this way, but then analyze only the pairs of tip species rather than performing a full analysis of the entire phylogeny (e.g. Lavergne et al., 2004Go). If that is done, Type I error rates should be correct, and the analysis should be robust with respect to errors in branch lengths and/or model of evolution, but statistical power will likely be lost (see Ackerly, 2000Go). A more extreme analytical variant is to perform a sign test on the tip pairs (Felsenstein, 1985Go), thus not using any information on branch lengths, but this comes at the extreme loss of statistical power (Ackerly, 2000Go).

The worst, that is, the least powerful, comparative design is one in which all species on one side of the root of the tree share, say, high values for an independent variable of interest (e.g. high temperature) and those on the other side of the root share low values (e.g. low temperature) (e.g. Garland et al., 1993Go; Garland, 2001Go). Although some methods can enhance inferential power in such situations (e.g. Schondube et al., 2001Go), it is not an attractive comparative scenario.

How many species or other taxa need to be included in a comparative study? In general, the statistical power of phylogenetically based analyses, when applied with an accurate phylogeny and model of character evolution, is the same as for conventional statistical methods, so standard power calculations can be employed (e.g. see fig. 5 in Garland and Adolph, 1994Go). However, it is also true that phylogenetic analyses sometimes uncover relationships that were not apparent in conventional analyses (see below).


    Examples of the utility of incorporating a phylogenetic perspective
 TOP
 Summary
 Introduction
 Phylogeny and modern...
 An example of how...
 When and why to...
 Examples of the utility...
 Some notes of caution
 Summary and perspectives
 Appendix
 References
 
One obvious utility of phylogenetically based analytical methods is their ability to estimate ancestral values (Schluter et al., 1997Go; Martins and Lamont, 1998Go; Cunningham et al., 1998Go; Garland et al., 1999Go). Such estimations would be impossible without phylogenetic information. Using phylogenetic methods, one can ask where a particular trait arose within the evolutionary diversification of a group and whether it arose once or multiple times (e.g. Schondube et al., 2001Go; Reznick et al., 2002Go; Johnston et al., 2003Go; Espinoza et al., 2004Go; Hodges, 2004Go; Lane et al., 2004Go; Rezende et al., 2004Go; Berenbrink et al., 2005Go). Inferences about nodal values permit the estimation of evolutionary change along each branch segment of an evolutionary tree and hence analysis of features correlated with that change, including inferences regarding the order of evolution of the components of a complex trait (e.g., see Lauder, 1980, 1981Go, 1990Go; Pagel, 1999Go; Autumn et al., 2002Go; Pérez-Barbería et al., 2002Go; Oakley, 2003Go; Berenbrink et al., 2005Go).

We will now review just a few examples from the literature in which phylogenetic information has added to our understanding and interpretation of comparative data. The first two of these deal with the evolution of lower metabolic rate in endotherms as part of their adaptation to desert environments.

It has long been recognized that low metabolic rates, low body temperatures, and an ability to become torpid would be beneficial to endotherms in hot, arid environments, in order to minimize heat load and energy (and water) demands in environments of low productivity (e.g. Dawson and Bartholomew, 1968Go; Dawson and Hudson, 1970Go; Williams, 1996Go; Tieleman et al., 2003Go; Rezende et al., 2004Go). When these traits were first discovered in desert caprimulgid birds (e.g. poorwills and nighthawks), they were initially interpreted as adaptations to desert conditions (e.g. Bartholomew et al., 1962