Am. J. Bot.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (52)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Burleigh, J. G.
Right arrow Articles by Mathews, S.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Burleigh, J. G.
Right arrow Articles by Mathews, S.
Agricola
Right arrow Articles by Burleigh, J. G.
Right arrow Articles by Mathews, S.
(American Journal of Botany. 2004;91:1599-1613.)
© 2004 Botanical Society of America, Inc.


Invited Special Papers

Phylogenetic signal in nucleotide data from seed plants: implications for resolving the seed plant tree of life1

J. Gordon Burleigh2,4 and Sarah Mathews3

2Section of Evolution and Ecology, University of California, Davis, California 95616 USA; 3Arnold Arboretum of Harvard University, Cambridge, Massachusetts 02138 USA

Received for publication January 26, 2004. Accepted for publication June 17, 2004.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Effects of taxonomic sampling and conflicting signal on the inference of seed plant trees supported in previous molecular analyses were explored using 13 single-locus data sets. Changing the number of taxa in single-locus analyses had limited effects on log likelihood differences between the gnepine (Gnetales plus Pinaceae) and gnetifer (Gnetales plus conifers) trees. Distinguishing among these trees also was little affected by the use of different substitution parameters. The 13-locus combined data set was partitioned into nine classes based on substitution rates. Sites evolving at intermediate rates had the best likelihood and parsimony scores on gnepine trees, and those evolving at the fastest rates had the best parsimony scores on Gnetales-sister trees (Gnetales plus other seed plants). When the fastest evolving sites were excluded from parsimony analyses, well-supported gnepine trees were inferred from the combined data and from each genomic partition. When all sites were included, Gnetales-sister trees were inferred from the combined data, whereas a different tree was inferred from each genomic partition. Maximum likelihood trees from the combined data and from each genomic partition were well-supported gnepine trees. A preliminary stratigraphic test highlights the poor fit of Gnetales-sister trees to the fossil data.

Key Words: Gnetales • multilocus analyses • phylogenetic signal • rate class • seed plant phylogeny • taxonomic sampling


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Phylogenetic relationships among the five extant lines of seed plants remain controversial despite the recent accumulation of molecular data sets to address the question (reviewed in Magallón and Sanderson, 2002 ). These five lines comprise cycads, ginkgos, conifers, Gnetales, and angiosperms. Stratigraphic evidence places the origin of cycads, ginkgos, and conifers in the Paleozoic, with Gnetales and modern conifer families appearing in the Triassic to Jurassic, and angiosperms later in the Mesozoic (Stewart and Rothwell, 1993 ; Crane, 1996 ). From the Permian through the late Jurassic many seed plant lineages went extinct, including lyginopterids, medullosans, Callistophytaceae, glossopterids, Cordaitales, and Voltziales (Stewart and Rothwell, 1993 ), and their relationships with extant groups remain poorly characterized. Additionally, during the Cretaceous and Tertiary, the diversity of all surviving seed plant lines except angiosperms decreased (Knoll, 1984 ; Crane, 1987 ). Thus, as is common in studies of deep divergences, taxonomic diversity is incompletely captured in molecular data sets. Moreover, extant lines vary markedly with respect to levels of current diversity (cf., angiosperms with ~260 000 species and ginkgos with one species) and rates of divergence. In Gnetales, low diversity (70 species in three genera, Ephedra, Gnetum, and Welwitschia) is combined with high rates of divergence. Not surprisingly, the position of Gnetales has been one of the more enigmatic questions in studies of seed plant phylogeny, with molecular data sets strongly supporting alternative hypotheses (Fig. 1).



View larger version (20K):
[in this window]
[in a new window]
 
Fig. 1. Four major hypotheses of relationships among extant seed plant lineages. AN = angiosperms; CY = cycads; GI = Ginkgo; GN = Gnetales; PI = Pinaceae; CO = non-Pinaceae conifers

 
Prior to the use of cladistic methods, competing hypotheses united Gnetales with conifers (Bailey, 1944 ; Eames, 1952 ; Takhtajan, 1969 ; Bierhorst, 1971 ; Doyle, 1978 ) or with angiosperms (Arber and Parkin, 1907 , 1908 ; Wettstein, 1907 ). Chamberlain (1935) included Gnetales in his Coniferophytes along with conifers, ginkgos, and Cordaitales but considered their placement problematic and did not rule out a relationship with angiosperms. Cronquist (1968) and Thorne (1976) rejected a relationship with angiosperms, but did not strongly advocate an alternative position for Gnetales. Nonetheless, a series of cladistic analyses of morphological data united Gnetales with angiosperms (Parenti, 1980 ; Crane, 1985 ; Doyle and Donoghue, 1986 ; Loconte and Stevenson, 1990 ; Nixon et al., 1994 ; Rothwell and Serbet, 1994 ; Doyle, 1996 , 1998b ), seeming to confirm the views of Arber and Parkin (1907 , 1908) and Wettstein (1907) . In most cladistic analyses of morphological characters that included fossil taxa, angiosperms, Gnetales, Bennettitales, and Pentoxylon formed a clade (Crane, 1985 ; Doyle and Donoghue, 1986 , 1992 ; Nixon et al., 1994 ; Rothwell and Serbet, 1994 ). The term "anthophytes" was used for this clade because the aggregations of sporophylls in each line were interpreted as flower-like structures (Doyle and Donoghue, 1987 ). Doyle (1996) later found a glossophyte clade that nested Caytonia within the anthophytes and placed glossopterids as their sister clade. Gnetales were sister to angiosperms in the trees of Crane (1985) and Rothwell and Serbet (1994) but not in the trees of Doyle (1996) or Doyle and Donoghue (1986 , 1992 ). Nixon et al. (1994) found trees with angiosperms nested within Gnetales. Nonetheless, the morphological analyses were consistent in supporting the anthophyte hypothesis (Doyle, 1998a ).

This result was challenged when early analyses of molecular data placed Gnetales as sister to all remaining seed plants (Hamby and Zimmer, 1992 ; Albert et al., 1994 ) in "Gnetales-sister" trees or as sister to all other extant gymnosperms (Fig. 1; Hasebe et al., 1992 ; Goremykin et al., 1996 ). More recently, trees with Gnetales sister to all other extant gymnosperms also were recovered in parsimony analyses of plastid rpoC1 (Samigullin et al., 1999 ), AGL6 and AGL-like genes (Winter et al., 1999 ; Becker and Theissen, 2003 ), Floricaula/LEAFY (Frohlich and Parker, 2000 ), and phytochromes (Mathews and Donoghue, 2002 ; Schmidt and Schneider-Poetsch, 2002 ). Trees that placed Gnetales with angiosperms, consistent with an anthophyte concept, were rare. Hamby and Zimmer (1992) found that neighbor joining analyses of their ribosomal RNA data united Gnetales with angiosperms and that a parsimony tree with this clade was just one step longer than the Gnetales-sister tree. Parsimony analysis of 26S ribosomal DNA (rDNA) also united Gnetales and angiosperms but with bootstrap support below 50% and a Bremer support value of one (Stefanovic et al., 1998 ). Rydin et al. (2002) also obtained anthophyte trees from unweighted parsimony analyses of a 26S rDNA and a combined 26S rDNA and 18S rDNA data set. The lack of support for the anthophyte hypothesis is consistent with evidence that it would be difficult to infer from psaA or psbB using parsimony even if it were true (e.g., Sanderson et al., 2000 ).

Many subsequent analyses of molecular data provided support for the previous suggestion that Gnetales were related to conifers (Bailey, 1944 ; Eames, 1952 ; Bierhorst, 1971 ; Doyle, 1978 ). Analyses of 18S rDNA contradicted the trees of Hamby and Zimmer (1992) , uniting Gnetales with a monophyletic conifer clade in "gnetifer" trees (Fig. 1; Chaw et al., 1997 , 2000 ; Bowe et al., 2000 ; Rydin et al., 2002 ; Soltis et al., 2002 ), and several gene trees united Pinaceae (other conifers were not sampled) with Gnetales (Goremykin et al., 1996 ; Hansen et al., 1999 ; Samigullin et al., 1999 ; Winter et al., 1999 ; Antonov et al., 2000 ; Nickerson and Drouin, 2004 ). A series of multilocus analyses further suggested a close relationship between conifers and Gnetales but indicated that conifers were paraphyletic, uniting Gnetales with Pinaceae in "gnepine" trees (Fig. 1; Qiu et al., 1999 ; Bowe et al., 2000 ; Chaw et al., 2000 ; Nickrent et al., 2000 ; Gugerli et al., 2001 ). Gnepine trees also were obtained when 19 exemplars were sampled for eight loci representing all three genomes (Soltis et al., 2002 ). In contrast, analyses of a two-genome data set representing just nuclear and plastid genes but from many more taxa yielded highly supported Gnetales-sister trees in parsimony analyses (Rydin et al., 2002 ; contra Soltis et al., 2002 , p. 1679). Parsimony analyses of all sites in a ~13.5-kb multilocus plastid data set also gave a Gnetales-sister tree (Rai et al., 2003 ). The investigation of rooted and unrooted trees by Bowe et al. (2000) partially addressed the possibility that long-branch effects, introduced by the inclusion of divergent outgroups, were influencing the topology of seed plant trees. If their unrooted trees were rooted at Gnetales, Pinaceae or all conifers would diverge next from the remaining seed plants, conflicting with the topology of Gnetales-sister trees inferred in analyses that included outgroups, in which the next split was between angiosperms and the remaining seed plants (Hamby and Zimmer, 1992 ; Albert et al., 1994 ; Rydin et al., 2002 ; Rai et al., 2003 ). This suggests that inclusion of divergent outgroups might have influenced the rooting of Gnetales-sister trees.

In addition, several analyses revealed conflict and bias within molecular data sets. Third codon positions of two plastid genes psaA and psbB supported a Gnetales-sister tree, whereas first and second positions supported a gnepine tree; bias was detected at third positions in psaA and psbB and in first and second positions of psbB (Sanderson et al., 2000 ; Magallón and Sanderson, 2002 ). Additionally, hypothesis testing of the same psaA and psbB data sets using models that account for heterogeneity in the process of evolution among codon positions supported a gnepine hypothesis (Aris-Brosou, 2003 ). Rydin et al. (2002) found that two plastid genes supported a Gnetales-sister tree if transitions were included but supported gnepine trees when transitions were excluded. Analyses of the plastid sequences in the combined data set of Soltis et al. (2002) yielded results similar to those of Sanderson et al. (2000) and Magallón and Sanderson (2002) ; maximum parsimony (MP) analyses of all codon positions or of just third codon positions gave Gnetales-sister trees, whereas maximum likelihood (ML) and MP analyses of first and second codon positions gave gnepine trees (Soltis et al., 2002 ). A series of analyses also revealed that the use of different optimality criteria gave conflicting but well-supported trees (Samigullin et al., 1999 ; Bowe et al., 2000 ; Frohlich and Parker, 2000 ; Mathews and Donoghue, 2002 ; see detailed summary in Magallón and Sanderson, 2002 ). Notably, the trees differed markedly in several cases, with MP analyses placing Gnetales as sister to all other gymnosperms and ML or Bayesian analyses placing them as sister to Pinaceae (Samigullin, 1999; Frohlich and Parker, 2000 ; Mathews and Donoghue, 2002 ; Rydin and Källersjö, 2002 ). Finally, Rydin and Källersjö (2002) found that the rbcL data sets gave different results, depending on character weighting, method of analysis, and taxonomic sampling.

In summary, results of molecular analyses have highlighted the difficulty of resolving seed plant relationships, and the three hypotheses that receive the most support in molecular analyses, the gnepine, gnetifer, and Gnetales-sister trees, are not completely consistent with all the available data. Thus, it would be productive to carefully examine the available molecular data for further insight into the causes of ambiguity. Previous studies have suggested that taxonomic sampling may be partly responsible for inconsistent results (e.g., Rydin and Källersjö, 2002 ; Soltis et al., 2002 ), and this is a plausible explanation for the disparity among published trees. For example, the analyses of Qiu et al. (1999) , Bowe et al. (2000) , Chaw et al. (2000) , Gugerli et al. (2001) , Soltis et al. (2002) , and Rai et al. (2003) included 8, 17, 19, 17, 11, and 16 gymnosperms, respectively, whereas the analyses of Rydin et al. (2002) included 69 gymnosperms. Several studies also have revealed that heterogeneity in rates of nucleotide evolution across sites could be a confounding factor (e.g., Chaw et al., 2000 ; Sanderson et al., 2000 ; Magallón and Sanderson, 2002 ; Rydin et al., 2002 ; Soltis et al., 2002 ; Aris-Brosou, 2003 ). Other studies have shown that the choice of optimality criterion has an effect (e.g., Hamby and Zimmer, 1992 ; Hasebe et al., 1992 ; Bowe et al., 2000 ; Chaw et al., 2000 ; Frohlich and Parker, 2000 ; Magallón and Sanderson, 2002 ; Mathews and Donoghue, 2002 ). We conducted a series of analyses using data from 13 loci to further explore the effects of these factors on inference of seed plant phylogeny.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Genes and taxonomic sampling
We sampled data from GenBank (http://www.ncbi.nlm.nih.gov) in a way that maximized both the taxonomic sampling among gymnosperms and the number of loci sampled. We sampled loci that, at a minimum, had sequence data from at least one taxon from each of the major clades of seed plants as well as representative conifers, including members of Araucariaceae, Cupressaceae, Pinaceae, Podocarpaceae, and Taxaceae. Only Cephalotaxaceae, Phyllocladaceae, and the monotypic Sciadopityaceae were not included. Molecular data indicate that Cephalotaxaceae and Phylloclaceae are members of Taxaceae and Podocarpaceae, respectively (Quinn et al., 2002 ). Additionally, the locus had to be alignable across all seed plants. We augmented phytochrome data in GenBank with unpublished sequences, focusing on two loci, PHYP, an unambiguous homolog of angiosperm PHYB, and PHYN, a putative homolog of angiosperm PHYA (Sharrock and Mathews, in press ). In total, we sampled 13 loci, including four nuclear loci (18S rDNA, 26S rDNA, PHYP/B, and PHYN/A), five plastid loci (atpB, matK, psaA, psbB, and rbcL) and four mitochondrial loci (atpA, coxI, mtSSU, and nad5). For each of the 13 loci, we searched GenBank for all available gymnosperm sequences. We also included Equisetum and selected ferns as outgroups (Pryer et al., 2001 ). For the angiosperms, we selected sequences from four basal lineages: Amborellaceae, Nymphaeales, Austrobaileyales, and Chloranthaceae (e.g., Mathews and Donoghue, 1999 ; Parkinson et al., 1999 ; Qiu et al., 1999 ; Doyle and Endress, 2000 ). The single-locus data sets that contain all available gymnosperm sequences are referred to as the complete taxon sampling data sets. Each complete taxon sampling data set was aligned using AVID (Bray et al., 2003 ; http://baboon.math.berkeley.edu/mavid/, and then manually adjusted. The alignments of 26S rDNA, matK, and mtSSU had large regions that could not be easily aligned, and these regions were excluded from analyses. We removed all matK outgroup sequences because they were extremely difficult to align with the seed plant sequences. The PHYP/B and PHYN/A alignments contain two indel regions of approximately 20 nucleotides each that also were excluded. The accession tables from the complete sampling data sets are archived in the Appendix (see Supplemental Data accompanying online version of this article).

To combine the single-locus data sets, we identified a set of 31 exemplar genera, including 21 gymnosperm genera, for which sequences were available from at least 10 of the 13 loci (see Appendix in Supplemental Data). In some of the complete taxon sampling data sets, certain genera were represented by multiple species. In these cases, all but one species per genus were removed, and the resulting single-locus data sets are referred to as the limited taxon sampling data sets. The limited taxon sampling data sets were combined to build multilocus combined data sets. This sometimes required merging sequences from different species to represent a single genus (see Appendix). We built combined data sets that included all loci, only nuclear loci, only plastid loci, and only mitochondrial loci. The combined data set of all loci contained 31 genera and is 18 906 nucleotides long. The nucleotide sequence sampled for a single taxon ranged from 7785 nucleotides to 18 042 nucleotides, and the median for a taxon was 12 760 nucleotides. In the combined data set, 32.5% (31.1% nuclear, 23.9% plastid, and 47.0% mitochondrial data) of the cells are coded as missing data. Though this is a large percentage of missing data, the number of complete characters, which may have a greater effect on phylogenetic accuracy (Wiens, 2003 ), is also large.

Phylogenetic analyses
We performed ML phylogenetic analyses on each of the complete taxon sampling, limited taxon sampling, and combined data sets using PAUP* (Swofford, 2002 ). In each analysis, we used a general reversible model (e.g., Tavaré, 1986 ) with rate variation among sites estimated using a discrete gamma distribution with four categories (Yang, 1994 ) and a separate category for the percentage of invariable sites. To improve the computational tractability of analyses that implement this complex model, we estimated all substitution parameters from a parsimony tree and used these estimates in the ML tree search. We found that the estimates of substitution parameters varied little among major seed plant hypotheses (not shown), consistent with the observation that substitution parameters are often similar over a wide range of trees (e.g., Yang et al., 1994 ; Sullivan et al., 1996 ). Our ML analyses used a heuristic tree search algorithm consisting of one run of random sequence addition and TBR branch swapping (Swofford et al., 1996 ). In the case of three of the largest trees (the complete taxon sampling trees from 18S rDNA, rbcL, and matK), the running time of the branch swapping was limited to 1 week. We performed 100 replicates of nonparametric bootstrapping to assess the confidence in the ML topology (Felsenstein, 1985 ). The tree search for the bootstrap replicates was identical to the initial tree search. However, the single-locus, complete taxon sampling data sets for 18S rDNA, 26S rDNA, atpB, matK, and rbcL were too large to bootstrap with the available computational resources.

We also performed MP analyss on the combined data sets. The MP analyses used a heuristic tree search with 1000 random addition sequence replicates and TBR branch swapping. To assess support, we performed 1000 bootstrap replicates, each with 10 random sequence addition replicates.

Effects of taxonomic sampling and substitution parameters on phylogenetic inference
Although the effect of taxonomic sampling on phylogenetic inference remains controversial (e.g., Rosenberg and Kumar, 2001 ; Pollock et al., 2002 ), it may influence seed plant analyses (Magallón and Sanderson, 2002 ; Rydin and Källersjö, 2002 ; Soltis et al., 2002 ). Thus, while the number of extinctions within seed plants limits our ability to explore the effects of taxonomic sampling on the inference of seed plant phylogeny, it is important to explore its effects on the inference of trees from molecular data, particularly given the small number of gymnosperms included in most molecular analyses of seed plant data. We examined the effect of taxonomic sampling on ML analyses by comparing the seed plant trees reconstructed from the complete taxon sampling data sets, which included up to 361 taxa, with trees reconstructed from the limited taxon sampling data sets, which contained at most 30 sequences. Specifically, for each locus we compared ln likelihood differences ({delta}) between gnepine and gnetifer trees to see if {delta} changed with changes in taxonomic sampling. These tests focused on how taxonomic sampling might affect the ability to distinguish between gnetifer and gnepine hypotheses. The taxonomic sampling for nad5 was sparse, and thus, we excluded it from this test.

We also examined how robust the phylogenies from the limited taxon sampling data sets were to changes in the estimated parameter values. For example, estimates of rate variation may be more accurate when taxonomic sampling is greater (Sullivan et al., 1999 ). In particular, small data sets may overestimate the percentage of invariable sites and the shape parameter ({alpha}) of the gamma distribution that describes rate variation among sites (Sullivan et al., 1999 ). We repeated the ML analyses (unconstrained and constrained to gnepine and gnetifer trees) for each of the limited sampling data sets using substitution parameters estimated from the complete taxon sampling data sets. Finally, we explored the possibility that substitution parameters estimated from combined data sets might yield different results when used for analyses of single-locus data sets. To do this, we compared our results from ML analyses using parameters estimated from the limited taxon data sets with results from analyses of these same data sets using parameter values estimated from the combined data set.

Effect of evolutionary rate on phylogenetic signal
Previous studies of seed plant relationships have noted different phylogenetic signals in the slowly evolving first and second codon position sites when compared with the rapidly evolving third codon position sites (Chaw et al., 2000 ; Sanderson et al., 2000 ; Magallón and Sanderson, 2002 ; Soltis et al., 2002 ). To determine the distribution of phylogenetic signal among sites evolving at different rates, we first partitioned the 13-locus combined data set using an estimate of the evolutionary rate of a particular site. We estimated the most likely rate class for each site (e.g., Yang, 1994 ) based on the general reversible model (Tavaré, 1986 ) with invariable sites and rate variation among sites following a discrete gamma distribution with eight rate categories using HYPHY (Muse and Pond, 2002 ). We used the MP tree to estimate the likelihood parameters including the rate classes; however, we also tried this analysis with different trees and noted very little difference in the rate class assignments for sites. This analysis partitioned the data into nine rate classes, with rate class (RC) 0 representing the invariable sites, and RC1–RC8 representing the eight discrete rate categories of the gamma distribution. RC8 represents the most rapidly evolving sites. Partitioning by rate class instead of codon position allowed us to partition the noncoding loci like 18S rDNA, 26S rDNA, and mtSSU, and it provided a way to standardize rate class estimates across all loci. This procedure also avoids the arbitrary placement of third codon position sites, which may evolve more slowly in some loci than in others, into the faster rate classes.

For the ML analysis, we examined the likelihood score for each site on the unconstrained tree (a gnepine tree) and on the optimal tree from a search constrained to the gnetifer topology. We calculated the likelihood difference for each site under the two hypotheses, and we examined the relationship between observed likelihood difference and the rate class of a site. For the MP analysis, we examined the parsimony score for each site on the MP tree (a Gnetales-sister tree) and on gnepine and gnetifer constraint trees. We identified the sites that have different parsimony scores for the different seed plant hypotheses, and we examined their distribution among the different estimated rate classes.

Analysis of stratigraphic data
Previous studies have noted that Gnetales as sister to all extant seed plants contradicts the stratigraphic record (Crane, 1996 ; Doyle, 1998a ), but there are few formal tests of this observation. Doyle (1998a) found that trees with Gnetales sister to the rest had a greater percentage of "ghost lineages" or missing fossil data than anthophyte trees. However, Doyle (1998a) did not explicitly examine the fit of stratigraphic data to trees that join Gnetales and conifers. A likelihood-based approach allows one to calculate the significance of the likelihood ratio ({delta} in Fig. 8), comparing the fit of the stratigraphic data between two trees with Monte Carlo simulation (Huelsenbeck and Rannala, 1997 , 2000 ; see also Felsenstein, 2003 ). We did not perform these tests because we could not estimate {delta} without a thorough examination of the fossil record, particularly to estimate the number of fossil observations (N). However, we did calculate likelihoods using different values of N to estimate the quantity of data that might be required to distinguish among seed plant trees using this approach. The approach is based on calculating the likelihood of observing the stratigraphic data based on a model of fossil preservation. The likelihood model assumes that fossil preservation is a Poisson process with an equal probability of fossil preservation in all lineages and across all strata. For each topology, the fossil data have the highest likelihood when the amount of evolutionary time in which there are no fossil observations is minimized. Though the model is simple, it allows information about the distribution of fossils through time to be incorporated into tests of evolutionary hypotheses. We do not have a complete data set of the stratigraphic distributions for all seed plant lineages, but we can demonstrate how the likelihood for each hypothesis varies with different numbers of fossil observations. We first mapped the times of first and last appearance of the lineages from the Doyle (1996) tree onto anthophyte, Gnetales-sister, and gnepine/gnetifer trees. The times of first appearance of angiosperms were obtained from Magallón and Sanderson (2001) , and the rough estimates of first and last appearance of the seed plant lineages were obtained from Stewart and Rothwell (1993) . Changing the dates of first or last observation of a few lineages may change the overall likelihood values, but in preliminary analyses they had little or no effect on the likelihood ratios (not shown). We excluded four angiosperm taxa (Austrobaileya, Autunia, Eupomatia, and Piperales) for which we could find no reliable appearance dates, leaving 32 lineages in each tree. Using Felsenstein's (2003 , p. 554) likelihood equation, the ln likelihood of the fossil data given a tree = –N + N(ln N) – N(ln T), where N equals the total number of fossil observations, T equals the total time length of the tree, and the average rate of fossil preservation ({lambda}) may be estimated by N/T. The value of N is equal for all trees, but T depends on the tree topology. Because we do not know N without a complete record of the stratigraphic distribution of all lineages, we estimated the likelihoods using different N values. We chose N values ranging from 64, or two fossil observations per lineage, up to 320. The likelihood is maximized for a phylogenetic tree when T is minimized. The smallest possible T for a given tree can be calculated by finding the latest possible origin for each lineage based on the fossil record of its sister lineage (Benton and Storrs, 1994 ). The fit of the stratigraphic data to the gnepine and gnetifer trees is similar, and therefore, we do not examine the gnetifer tree separately.



View larger version (18K):
[in this window]
[in a new window]
 
Fig. 8. The likelihood ratio test comparing the fit of stratigraphic data to seed plant hypotheses as the number of fossil observations increases. "{delta}" is the likelihood ratio of the stratigraphic data from the comparison of two different seed plant hypotheses. "N" is the total number of fossil observations across all lineages. The likelihood model assumes that fossil preservation follows a Poisson distribution with a single rate across all lineages and all strata. The "AN—GS" line compares the anthophyte and Gnetales-sister trees, the "AN—GP" lines compares the anthophyte and gnepine trees, and the "GP—GS" line compares the gnepine and Gnetales-sister trees

 

    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Single-locus phylogenetic analyses
Trees from the single-locus data sets differ, but bootstrap support generally is low (Fig. 2). Overall, the highest bootstrap percentages support monophyly of the gymnosperms (Fig. 2). Support for a gnepine tree is highest in the atpA tree (100%) and lower in the matK (85%) and psaA (76%) trees (Fig. 2). Support for a gnetifer tree is highest in the 18S rDNA tree (71%; Fig. 2). The Gnetales-sister tree is not supported in ML analyses of any of the 12 loci. We also measured the likelihood ratio, or difference between the log likelihoods, between the optimal trees compatible with gnepine and gnetifer constraints ({delta} = ln LGP – ln LGF). A positive {delta} indicates that the gnepine hypothesis is more likely than the gnetifer hypothesis, and a negative {delta} indicates the gnetifer hypothesis is more likely than the gnepine hypothesis. For eight of the 12 loci, {delta} was positive in all analyses, and for another, rbcL, {delta} was positive in three of the four analyses (Fig. 2). The largest {delta} was obtained in analyses of atpA, but large values were also obtained in analyses of psaA, and PHYP/B and PHYN/A, in each case indicating that gnepine trees were more likely than gnetifer trees. Although large values were also obtained in analyses of mtSSU and the 26S rDNA complete sampling data set, the ML trees were not gnepine trees (Fig. 2). For two loci, 18S rDNA and atpB, gnetifer trees were optimal in all analyses (Fig. 2). The ML tree from the analysis of the rbcL complete taxon sampling data set is also a gnetifer tree, but the gnepine tree is nearly as likely. The gnetifer hypothesis also is more likely than the gnepine hypothesis in analyses of coxI data, but the optimal coxI ML tree is neither a gnetifer nor a gnepine tree (Fig. 2).



View larger version (55K):
[in this window]
[in a new window]
 
Fig. 2. The effect on seed-plant tree inference of sample size and estimates of substitution parameters for 12 loci. For each locus, trees inferred from the limited taxon data set and from the complete sampling data set are on the left and right, respectively. Maximum likelihood (ML) bootstrap percentages are on the branches of all limited taxon sampling data sets, and branches with bootstrap percentages less than 50 were collapsed. ML bootstraps are also on the complete taxon sampling trees for data sets that could be bootstrapped. In trees without bootstrap percentages, optimal trees from the likelihood searches are shown. Tables below the trees show the negative log likelihood values for unconstrained ML, the gnetifer (GF), and the gnepine (GP) constraint trees. {delta} represents the likelihood difference between the gnepine and gnetifer constraint trees for each analysis (= ln LGP – ln LGF). A positive {delta} means the gnepine tree has a higher likelihood than the gnetifer tree, and a negative {delta} means the gnetifer tree has a higher likelihood than the gnepine tree. The "limited" column shows the likelihood values for the limited sampling data sets with the substitution parameters estimated from the data. The "Complete P" column shows the likelihood scores for the limited sampling data set using substitution parameters estimated from the complete sampling data, and the "Combined P" column shows the likelihood score for the limited sampling data set using substitution parameters estimated from the combined 13-locus data set. The "Complete" column shows the likelihood scores from the complete sampling data sets. AN = angiosperms; CY = cycads; GI = Ginkgo; GN = Gnetales; PI = Pinaceae; CO = all other conifers. The trees are all rooted with fern or Equisetum outgroups

 
Effect of sample size and substitution parameter values
In general, taxonomic sampling appears to have little effect on the likelihood of inferring a gnepine or gnetifer tree from single-locus data sets (Fig. 2). For 11 of the 12 loci, taxonomic sampling does not affect the sign of {delta}, meaning the likelihood of a gnepine relative to a gnetifer tree does not change with taxonomic sample size (Fig. 2). The exception is rbcL (Fig. 2). Gnepine trees inferred from the limited sampling rbcL data set are slightly better than gnetifer trees, whereas the reverse is true in analyses of the complete sampling data set, with {delta} < 1 (Fig. 2). For two loci, 26S rDNA and atpA, differences in taxonomic sampling have a large effect on {delta}. The 26S data set is striking because the likelihoods of the gnepine and gnetifer trees are similar in analyses of the limited sampling data set, but the {delta} is by far the highest found in analyses of any complete sampling data set (Fig. 2). However, the likelihood tree inferred from the 26S complete sampling data set is not a gnepine tree and not consistent with any proposed seed plant hypothesis. The trend is reversed in analyses of atpA, in which adding just seven taxa, the smallest difference between any limited and complete sampling data set pair, leads to a change in {delta} of 48.51 (Fig. 2).

Estimates of substitution parameters appear to have little effect on the ML analyses of the limited sampling data sets. Changing the substitution parameters in analyses of the limited sampling data sets does affect branch length estimates, and it may even affect the tree topology; however, changing the substitution parameters has little effect on the likelihood of the gnepine relative to the gnetifer hypothesis. The sign of {delta} never changes among results from analyses of the limited sampling data set using different parameters, and the largest change in {delta} due to changing parameters is 3.22 in coxI (Fig. 2). The percentage of invariable sites and the alpha shape parameter estimated from the complete data sets were generally lower than the estimates from the limited sampling data sets (not shown). However, the greatest change in {delta} due to using parameters from the complete sampling data sets in analysis of the limited taxon sampling data set is 2.71, again in analyses of coxI (Fig. 2).

Distribution of phylogenetic signal among rate classes
Estimation of the rate class for each site placed the majority of sites in RC0, the invariable rate class, with the mitochondrial loci having the highest proportion of invariable sites (65.1%; Table 1). There were almost no sites in RC1 or RC2 and few sites in RC3 (Table 1). The majority of variable sites were in RC4–RC8, with each rate class in this interval containing between 6.4% and 9.1% of the total sites. The rate class having the highest proportion of sites varies by genome. The nuclear loci had the most sites in the fastest rate category, RC8 (14.1%), plastid genes had the highest number in RC7 (11.1%), and the mitochondrial loci had the highest number in RC4 (10.8%; Table 1).


View this table:
[in this window]
[in a new window]
 
Table 1. Rate class (RC) estimates for the nuclear, plastid, and mitochondrial genomes. RC0 refers to invariable sites, and RC1–RC8 refer to the discrete rate classes in the gamma distribution, ordered from slowest to fastest. Columns list the number of sites in each rate class, with the percentage of sites per genome in parentheses

 
Although the great majority of sites in all rate classes had similar likelihoods on the gnepine and gnetifer trees, there was a greater range of likelihood differences among sites in the faster rate classes (Fig. 3). More sites in RC4 and RC5 appear to have a higher likelihood on the gnepine tree than on the gnetifer tree, but in RC6–RC8, the number of sites with a higher likelihood on the gnepine tree is similar to the number of sites that have a higher likelihood on the gnetifer tree (Fig. 3). When we summed the likelihood differences obtained for each rate class, we found that the greatest likelihood differences in favor of the gnepine tree were at RC4 and RC5 (Fig. 3). However, the highest per site average difference ({delta}) was in RC5 followed closely by RC3 and RC4 (Fig. 3). The variance in {delta} was relatively stable among rate classes, with the highest variance in rate classes 5, 6, and 7 (Fig. 3). There was virtually no overall difference between the gnepine and gnetifer hypotheses at RC8 (Fig. 3).



View larger version (18K):
[in this window]
[in a new window]
 
Fig. 3. Distribution of sites that have higher likelihoods for gnepine or gnetifer hypotheses among rate classes. In the graph on the left, each dot represents the ln likelihood difference at a site when optimized on the gnepine and gnetifer constraint trees inferred from the combined 13-locus data set. Positive {delta} values indicate that the gnepine hypothesis has a higher likelihood than the gnetifer hypothesis, and negative values indicate that the gnetifer hypothesis has a higher likelihood than the gnepine hypothesis. The smaller graph in the top right corner sums together all the {delta} values for the sites in rate classes 3–8, and the graph on the bottom right shows the mean and variance of the {delta} values for each site in rate classes 3–8

 
In the MP analysis, sites in RC1 and RC2 were uninformative with respect to the gnepine, gnetifer, and Gnetales-sister hypotheses (Fig. 4A, B). In RC3–RC6, more sites had better parsimony scores on gnepine trees than on the Gnetales-sister (Fig. 4A) or gnetifer (Fig. 4B) trees. Conversely, in RC7 and RC8, more sites had better parsimony scores on the Gnetales-sister tree than on the gnepine tree (Fig. 4A). However, when gnepine and gnetifer trees were compared (Fig. 4B), the number of sites with a better parsimony score on the gnepine tree was similar to the number of sites with a better score on the gnetifer tree. We also noted that the number of informative sites differed between the two comparisons. There were more sites that distinguish between Gnetales-sister and gnepine trees than between gnetifer and gnepine trees (Fig. 4A, B).



View larger version (25K):
[in this window]
[in a new window]
 
Fig. 4. Distribution of parsimony informative sites that favor gnepine (GP), gnetifer (GF), or Gnetales-sister (GS) trees among rate classes estimated using likelihood. In 4A, the striped bars indicate the number of sites in which the parsimony score for the most parsimonious gnepine tree is better than the parsimony score for the most parsimonious Gnetales-sister tree. 4B similarly compares gnepine and gnetifer trees. Chi-square tests were performed for each rate class to test the null hypothesis that the sites in which the parsimony score is variable among seed plant hypotheses are equally likely to favor either hypothesis (e.g., Snedecor and Cochran, 1995 ). Stars above the bars indicate that a chi-square test rejects the null hypothesis ("*" P ≤ 0.05, "**" P ≤ 0.01, "***" P ≤ 0.001)

 
Phylogenetic analyses of combined data sets
ML analyses of all combined data sets recovered well-supported gnepine trees. The topologies of the nuclear, plastid, and mitochondrial gnepine trees (not shown) were similar to the topology of the 13-locus tree (Fig. 5). Bootstrap support for the gnepine hypothesis in trees from the different genomes ranges from 78% in the mitochondrial tree to 86% in the plastid tree and is 100% in the 13-locus tree (Fig. 5). The gymnosperms are monophyletic (98%), and the cycads are sister to the other gymnosperms (98%; Fig. 5). Ginkgo is sister to the Gnetales-conifer clade (100%; Fig. 5). Excluding the sites from the fastest rate classes had little effect on the ML analysis. For example, a ML analysis of the combined 13-locus data set with the RC8 sites excluded gave a tree similar to the 13-locus, also supporting the gnepine hypothesis (100%; not shown).



View larger version (20K):
[in this window]
[in a new window]
 
Fig. 5. Maximum likelihood tree inferred from the combined 13-locus data set. The tree is rooted with Equisetum. Bootstrap percentage (above 50) are placed on the branches (ln L =–124 544.1108)

 
In contrast, results from parsimony analyses of the combined data sets were strongly affected by the choice of data partition. Data from the different genomes gave different trees, as did exclusion of the most rapidly evolving sites. The 13-locus tree (Fig. 6) and the plastid tree (Fig. 7) place Gnetales as sister to all seed plants with 79% and 100% bootstrap support, respectively. The nuclear tree places Gnetales sister to all gymnosperms with 86% bootstrap support, whereas the mitochondrial tree unites Gnetales with Pinaceae (62%; Fig. 7). However, when sites from RC7 and RC8 were excluded from MP analyses, the 13-locus tree and all single-genome trees were gnepine trees, with bootstrap support for gnepines ranging from 83% in the mitochondrial tree to 100% in the 13-locus tree (Figs. 6, 7). Cycads are sister to all other gymnosperms in the trees inferred without RC7 and RC8 (Figs. 6, 7), as they are in the ML trees (e.g., Fig. 5).



View larger version (21K):
[in this window]
[in a new window]
 
Fig. 6. Parsimony trees inferred from the combined 13-locus data set. The tree on the left is one of two equally most parsimonious trees based on an analysis that includes all sites. Bootstrap percentages above 50 are on the branches (tree length = 21 514, CI = 0.57, RI = 0.59). The tree on the right is the parsimony tree inferred from the combined data set that excludes the 3378 sites from RC7 and RC8 (tree length = 7347, CI = 0.76, RI = 0.75). All trees are rooted with Equisetum

 


View larger version (45K):
[in this window]
[in a new window]
 
Fig. 7. Parsimony trees inferred from the combined nuclear, plastid, and mitochondrial data sets. The left column has the parsimony trees based on analyses of the nuclear (top, tree length = 8248, CI = 0.53, RI = 0.54), plastid (middle, tree length = 10 168, CI = 0.55, RI = 0.61), and mitochondrial (bottom, tree length = 2952, CI = 0.74, RI = 0.75) loci including all sites. The right column has parsimony trees based on analyses of nuclear (top, tree length = 1989, CI = 0.77, RI = 0.74), plastid (middle, tree length = 3684, CI = 0.71, RI = 0.71), and mitochondrial (bottom, tree length = 1664, CI = 0.88, RI = 0.86) loci excluding the sites from the two fastest rate classes. All trees are rooted with Equisetum

 
Stratigraphy
The total length of the evolutionary trees (T) ranged from 7564 million years in the anthophyte tree to 7795 million years in the Gnetales-sister tree. The length of the gnetifer and gnepine trees was 7595 million years. Therefore, the Gnetales-sister tree has the most fossil gaps, or time without fossil observations, and also the fossil data would have the lowest likelihood on the Gnetales-sister tree. Calculating the likelihood of the stratigraphic data requires knowing the number of fossil observations (N; Huelsenbeck and Rannala, 1997 , 2000 ; Felsenstein, 2003 ). Lacking these data, we assumed different numbers of total fossil observations to estimate how the likelihood differences would change with various amounts of data (Fig. 8). The likelihoods of the stratigraphic data given the anthophyte and gnepine trees were close, and increasing the number of fossil observations did little to separate the likelihoods (Fig. 8). However, even assuming that there were only 64 fossil observations, or two fossils per lineage, the likelihood difference between the anthophyte or gnepine and Gnetales-sister hypothesis is nearly two. If the number of fossil observations was increased to 320, the difference in log likelihoods increased to almost 10 (Fig. 8). The 320 fossil observations (or 10 per lineage) would represent an overall fossil preservation rate ({lambda}) of approximately one every 24 million years.


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 LITERATURE CITED
 
Taxonomic sample size and estimates of substitution parameters
Previous studies have suggested that increased taxonomic sampling may help to resolve estimates of seed plant relationships (e.g., Rydin and Källersjö, 2002 ; Soltis et al., 2002 ). We used differences in log likelihoods estimated from limited and complete taxon sampling data sets to assess whether increasing taxonomic sampling resulted in improved likelihoods, focusing on the likelihoods of gnepine and gnetifer trees. For most data sets, differences in log likelihoods between trees increased when taxa were added (Fig. 2). However, the differences were small and thus do not indicate a strong effect of taxonomic sampling on the power to choose between these trees. The notable departures were 26S rDNA and mtSSU, both of which gave unusual phylogenetic results in analyses of the complete taxon sampling data sets. Additionally, in the case of atpA, the smallest data set included, the difference in log likelihoods dramatically decreased when just seven taxa were added. This result appears anomalous, but it is difficult to interpret without additional sequences. Taxonomic sampling also had almost no effect on the sign of {delta}, or whether the gnepine or gnetifer hypothesis had a higher likelihood. In only the case of rbcL did the sign change from positive to negative, indicating a switch from support for a gnepine tree to support for a gnetifer tree with the addition of taxa. However, there is little overall change in {delta} with increased taxonomic sampling in rbcL, and rbcL by itself appears to have little power to distinguish between gnepine and gnetifer trees no matter the taxon sampling. Rydin and Källersjö (2002) noted that taxonomic sampling affected the outcomes from equally weighted parsimony analyses of an rbcL data set from seed plants but not from weighted parsimony or Bayesian analyses using a general time reversible model. Magallón and Sanderson (2002) also noted the limited effects of adding taxa, pointing out that key branches are missing as a result of extinctions.

Taxonomic sampling may also affect the estimates of substitution parameters (Sullivan et al., 1999 ). We explored the possibility that these indirect effects of taxonomic sampling on phylogenetic inference might influence the power to distinguish between gnepine and gnetifer trees and found no evidence that this was the case. Differences in log likelihoods among analyses of the limited sampling data sets were similar, regardless of the parameter estimates used. Together, these observations indicate that the benefits of increased sampling of extant gymnosperms may be limited. Still, our analyses do not determine if sampling the extant gymnosperms is adequate to accurately resolve seed plant phylogeny. Extinctions of major groups of seed plants have left a sparse distribution of lineages available for molecular analyses.

Conflicting phylogenetic signal
Several previous studies have revealed conflicting phylogenetic signals within single loci (e.g., Chaw et al., 2000 ; Sanderson et al., 2000 ; Magallón and Sanderson, 2002 ; Rydin et al., 2002 ; Soltis et al., 2002 ). To further explore these effects on the power to distinguish among trees that have been supported in multilocus analyses, we examined the phylogenetic signal in each site after partitioning all sites into nine evolutionary rate classes. Comparisons of parsimony scores on the gnepine and Gnetales-sister trees (Fig. 4a) indicate that across all loci, more sites evolving at intermediate rates favor a gnepine tree, while more sites evolving at rapid rates favor a Gnetales-sister tree. Comparisons of parsimony scores on gnepine and gnetifer trees (Fig. 4b) indicate that there are fewer sites that distinguish among these trees than in the comparison of gnepine and Gnetales-sister trees.

Comparisons of site likelihoods on the gnepine and gnetifer trees suggest that sites evolving at intermediate rates favor the gnepine hypothesis, whereas phylogenetic signal in the rapidly evolving sites appears to be more ambiguous. Overall, the distribution of phylogenetic signal among rate classes is comparable to the pattern from simulated data reported by Yang (1998) , who addressed the question of optimal evolutionary rate for phylogenetic inference. The slowest sites may have little phylogenetic information, whereas intermediate sites have the most, and the amount of information in the fastest sites may decrease slightly due to heterogeneity in the signal. In our analyses, the decrease in information at the most rapidly evolving sites is more pronounced than was noted by Yang (1998) , and both the gnepine and gnetifer hypotheses are supported by many sites. Yang (1998) also noted that the utility of sites with different rates will vary depending on the tree shape and patterns of selection, factors that we did not explore but that are likely to be important in analyses of seed plant data. As noted for the parsimony scores, of nearly 19 000 sites in our combined data matrix, relatively few sites have greatly different likelihoods when compared on gnetifer and gnepine trees (e.g., 72 sites have |{delta}| > 1). However, the 100% bootstrap score in the ML analysis of the 13-locus data set indicates that character sampling error is not affecting the phylogenetic inference (Fig. 5). Together, the results from exploration of the distribution of phylogenetic signal among rate classes help to explain the inconsistency in results from analyses of single-locus data sets. Considering also that Gnetales-sister trees are less consistent with stratigraphic evidence than other hypotheses (Fig. 8; Doyle, 1998a ), our results indicate that the most prominent signal at the faster evolving sites is misleading in parsimony analysis.

Phylogenetic analyses of combined data
The 13-locus combined data set included a minimum of four loci from each genome, and thus is the most extensive three-genome character set so far used to address seed plant relationships. Previous analyses provided ambiguous evidence regarding the first split within gymnosperms (Bowe et al., 2000 ; Chaw et al., 2000 ; Nickrent et al., 2000 ; Gugerli et al., 2001 ; Soltis et al., 2002 ; Rai et al., 2003 ), whereas all of the deep branches in our 13-locus ML tree are supported by bootstrap percentages ≥98% (Fig. 5). Similarly, they are supported by bootstrap percentages of 100% in our 13-locus MP tree without RC7 and RC8 (Fig. 6). These trees indicate that Ginkgo is the sister of conifers and Gnetales and that cycads are the sister clade of all other gymnosperms (Figs. 5, 6). As in trees from other analyses, gymnosperms and angiosperms are monophyletic sister clades.

The ML tree and the MP tree from analyses without RC7 and RC8 unite Pinaceae and Gnetales with bootstrap support of 98% and 100%, respectively (Figs. 5, 6). There is no support in these trees for the suggestion that Gnetales might be embedded within Pinaceae (Soltis et al., 2002 ). The monophyly of Pinaceae is supported by bootstrap percentages of 100%. Previous multilocus analyses that sampled all three genomes also inferred well-supported gnepine trees in both parsimony and likelihood analyses (Qiu et al., 1999 ; Bowe et al., 2000 ; Chaw et al., 2000 ; Nickrent et al., 2000 ; Gugerli et al., 2001 ; Soltis et al., 2002 ); our 13-locus parsimony tree inferred from all sites, a Gnetales-sister tree (Fig. 6), is an exception. Multilocus analyses that sampled just the nuclear and plastid genomes (Rydin et al., 2002 ), or just the plastid genome (Rai et al., 2003 ), inferred well-supported Gnetales-sister trees in MP searches. However, our results do not support the suggestion that different trees might result from variable signal among genomes (e.g., Donoghue and Doyle, 2000 ; Rydin et al., 2002 ). All single-genome ML trees had at least 78% support for the gnepine hypothesis, and when the most rapidly evolving sites (RC7 and RC8) were excluded from parsimony analyses, all MP genome trees had at least 77% bootstrap support for the gnepine hypothesis (Fig. 7). Rather, as noted before, our results indicate that the Gnetales-sister signal is restricted to sites in the fastest two of nine evolutionary rate classes (Fig. 4A), indicating that the Gnetales-sister result may result from bias in the most rapidly evolving sites to which parsimony is particularly sensitive (Felsenstein, 1978 ).

Despite this convergence on the gnepine tree in molecular analyses, the result should be examined in light of evidence of possible bias or error in seed plant data sets. Sanderson et al. (2000) provided evidence of bias in first plus second codon position sites and/or in third codon position sites in two plastid data sets from seed plants and of erroneous parsimony reconstructions from these data. The bias at third codon positions was more pronounced than at first and second codon positions. The combination of very short branches at the base of the seed plant tree and very long branches leading to Gnetales and outgroups was implicated as a source of bias, and it appears that third codon positions evolve under very different processes of evolution than first and second (Sanderson et al., 2000 ). There may be additional factors, like model misspecification, that also could result in an erroneous phylogenetic inference. The gnepine and gnetifer topologies are very similar, and potentially even a small amount of error or bias could influence phylogenetic results. Thus, as we used stratigraphic evidence to evaluate the Gnetales-sister hypothesis, the implications of other lines of evidence should be given careful consideration.

Other sources of evidence
Despite the cladistic analyses of morphological data that supported anthophyte trees (Crane, 1985 ; Doyle and Donoghue, 1986 , 1992 ; Nixon et al., 1994 ; Rothwell and Serbet, 1994 ), the clade of Ginkgo, Gnetales, and conifers is consistent with some morphological and anatomical evidence (Figs. 4, 5). Gnetales and conifers share a number of morphological similarities, including linear leaves, reduced sporophylls (Doyle, 1994), and circular bordered pits with tori in the protoxylem, interspersed with annular thickenings (Bailey, 1944 ; Carlquist, 1996 ). The similarity in the wood anatomical characters is striking and shared only with Ginkgo (Bailey, 1944 ; Bierhorst, 1971 ). Ginkgo, Gnetales, and conifers also share with extinct Cordaitales a common simple strobilar structure consisting of an axillary short shoot bearing both sterile scale leaves and simplified sporophylls. In Gnetales and Cordaitales but not Ginkgo, both male and female simple strobili are aggregated into compound strobili. In most conifers, female strobili are compound, whereas male strobili are simple, and this has been viewed as a feature separating conifers from Gnetales. However, in some Podocarpaceae (Wilde, 1944 ) and in the Paleozoic walchian conifer, Thucydia mahoningensis (Hernandez-Castillo et al., 2001 ), both male and female strobili are compound. These observations indicate that male and female compound strobili are potential synapomorphies of conifers and Gnetales. The two groups also share details of embryogeny, the presence of binucleate sperm cells, post-fertilization development of the female gametophyte, and a similar pattern of double fertilization (Friedman and Floyd, 2001 ). The extent to which embryogeny, fertilization patterns, and microstructural characters would support or refute the gnepine hypothesis remains to be determined in the absence of a careful survey of the distribution of the relevant character states in seed plants, and where possible, in their outgroups, but further investigation along these lines may yield data relevant to estimates of seed plant phylogeny.

The implications of gnepine trees for the evolution of other characters seem more apparent. If the gnepine tree were correct, it would suggest that a number of conifer features have evolved in parallel or were lost from Gnetales, including resin canals, tiered proembryos, and the ovulate cone scale (e.g., Chamberlain, 1935 ; Crane, 1985 ; Hart, 1987 ; Donoghue and Doyle, 2000 ). The ovules of modern conifers are borne on a woody or fleshy scale that is considered to be homologous with the lax female short shoot of Cordaitales and thus derived by a series of steps in which some appendages were lost and others congenitally fused (Wilde, 1944 ; Florin, 1951 ). If reduction of the fertile short shoot occurred once, nesting Gnetales within conifers would require the reversion of the cone scale to an axis with sporophylls subtended by bracts. The diversity of ovulate cone structure suggests that such a reversion would require a series of ontogenetic changes. Thus, it has been viewed as unlikely, and the cone scale, conversely, as a good conifer synapomorphy. However, if the female short shoot were modified independently in separate lines of conifers (Florin, 1951 ; Doyle, 1998b ), a sister group relationship between Pinaceae and Gnetales would be less contradictory.

The causes underlying the ontogenetic diversity of ovulate cones is an important question that remains to be addressed. After an extensive survey, Tomlinson and Takaso (2002) have suggested that development is so diverse that (1) it might be taken as evidence that different types of female strobili have evolved independently or (2) that we cannot assume part-for-part homology without invoking heterochrony and heterotopy to an extent that markedly alters the model of reduction that Florin (1951) envisioned. In their view, there is strong selective pressure to protect ovules and/or seeds during development, achieved in diverse ways within seed plants, gymnosperms, and conifers themselves (Tomlinson and Takaso, 2002 ). If the gnepine hypothesis were correct, there seem to be two plausible alternatives. Reduction of the female short shoot may have occurred in a single lineage that is ancestral to modern conifers, followed by dissimilar elaboration of the reduced short shoot in different conifer lines, involving spatial heterogeneit