|
|
||||||||
Systematics |
2School of Life Sciences, Arizona State University, Tempe, Arizona 85287-4501 USA; 3Department of Plant Sciences, Montana State University, Bozeman, Montana 59717 USA; 4Section of Evolution and Ecology, University of California, Davis, California 95616 USA
Received for publication December 18, 2003. Accepted for publication June 10, 2004.
| ABSTRACT |
|---|
|
|
|---|
Key Words: caesalpinioid legumes Leguminosae matK mimosoid legumes papilionoid legumes phylogeny
| INTRODUCTION |
|---|
|
|
|---|
Reconstructing the phylogenetic relationships of the Leguminosae is essential for understanding the origin and diversification of this ecologically and economically important family of angiosperms. Comprehensive phylogenetic analyses of Leguminosae began with the plastid gene rbcL (Doyle, 1995
; Käss and Wink, 1995
, 1996
; Doyle et al., 1997
) following the early, widespread use of this gene for phylogenetic studies of land plant relationships (e.g., Chase et al., 1993
). Among the conclusions that emerged, the monophyly of the Fabales (sensu Angiosperm Phylogeny Group, 2003
) and the sister relationship of legumes to Polygalaceae, Surianaceae, and the rosaceous genus Quillaja Molina were very strongly supported (Doyle et al., 2000
). Second, the monophyly of Leguminosae is consistently resolved although not as strongly as for the Fabales (Doyle et al., 2000
; Kajita et al., 2001
). Third, while monophyly of mimosoid legumes (subfamily Mimosoideae) is well supported by the rbcL data (Doyle et al., 2000
), a more extensive sampling of the subfamily suggested certain mimosoid genera, Dinizia Ducke and Piptadeniastrum Brenan, were unresolved with respect to related caesalpinioid outgroups (Luckow et al., 2000
). Fourth, the subfamily Caesalpinioideae (caesalpinioids) is consistently resolved as paraphyletic with respect to mimosoids and papilionoids (e.g., Polhill et al., 1981
; Doyle et al., 2000
; Kajita et al., 2001
), although several well-supported subclades have been detected in recent studies of this subfamily; for example, the tribe Cercideae, resolved as the sister clade to the rest of the family (Doyle et al., 2000
), the tribe Detarieae sensu lato (s.l.), distributed principally in tropical Africa and including approximately half of the genera in the Caesalpinioideae (Bruneau et al., 2001
; Herendeen et al., 2003a
), and the "Umtiza" clade (Herendeen et al., 2003b
). Lastly, the traditionally recognized subfamily Papilionoideae (sensu Polhill, 1981a
, 1994
) is consistently resolved as monophyletic, albeit with only modest support (e.g., Doyle et al., 1997
; Kajita et al., 2001
).
The Papilionoideae has received the most attention, if only because it is the largest and most widespread of the three legume subfamilies with an estimated 476 genera and 13 860 species (Lewis et al., in press). Papilionoids traditionally have been diagnosed by traits that now are considered synapomorphies of the subfamily. These include wood with predominantly paratracheal axial parenchyma that is usually storied; vessels with alternate vestured pits and simple perforation plates; absence of bipinnate leaves; unidirectional initiation of sepals, petals, and stamens; clawed petals; and a seed testa with a hilar valve and no pleurogram (Polhill, 1981a
; Tucker, 1987a
, 2002
; Tucker and Douglas, 1994
; Chappill, 1995
; Gasson, 2000). These many distinctions have sometimes resulted in papilionoids being ranked at the familial level (e.g., Hutchinson, 1964
; Takhtajan, 1969
). Moreover, support for the monophyly of Papilionoideae has not changed with family-wide molecular phylogenetic analyses involving the plastid rbcL locus (Käss and Wink, 1995
, 1996
, 1997
; Doyle et al., 1997
, 2000
; Kajita et al., 2001
) or trnL intron (Pennington et al., 2001
).
Despite insights gained into the higher-level relationships of the family from studies of the rbcL locus, and to a lesser extent the trnL-F region, many issues in legume phylogeny remain unresolved (reviewed in Wojciechowski, 2003
). This is particularly true for the relationships within the caesalpinioid and mimosoid subgroups and among some of the major papilionoid clades, the genistoids, dalbergioids, millettioids-phaseoloids, and Hologalegina (e.g., Crisp et al., 2000
; Hu et al., 2000
; Wojciechowski et al., 2000
; Lavin et al., 2001
). More variable nucleotide sequences are needed to improve the resolution of and support for the major clades within legumes. The most promising is the plastid gene matK, which has been shown by several recent studies on different papilionoid subgroups to provide excellent resolution among closely related genera (Hu et al., 2000
; Lavin et al., 2001
, 2003
; Steele and Wojciechowski, 2003
). Here we draw on these recent studies and a large number of new matK sequences as part of a more extensive phylogenetic analysis of the family, with particular emphasis on the major clades of the Papilionoideae.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Sequences from 140 taxa are formally reported here for the first time, complete with voucher specimen and database accession information, although a few of them have been used in part for phylogenetic analyses presented previously (Wojciechowski et al., 2000
). Papers by Hu et al. (2000)
, Lavin et al. (2001
, 2003
), Luckow et al. (2003)
, Miller et al. (2003)
, McMahon and Hufford (2004)
, Steele and Wojciechowski (2003)
, Riley-Hulting et al. (2004)
, and Thulin et al. (in press)
provide sampling information for approximately 190 matK sequences from subgroups of the taxa included here and should be consulted for more details. The sources of plant material and GenBank accession numbers for matK sequences from all taxa included in this paper are provided in the Appendix (see Supplemental Data accompanying the online version of this article).
DNA sequence data
The data presented here were gathered in our laboratories using similar methods. Genomic DNAs were isolated from field-collected, greenhouse-grown plants, silica-dried and herbarium material using the procedure of Doyle and Doyle (1987)
or using DNeasy Plant Minikits (Qiagen, Valencia, California, USA). Polymerase chain reaction (PCR) amplifications were performed using Taq and Platinum Taq DNA polymerases (Life Technologies, Gaithersburg, Maryland, USA) as described previously (Wojciechowski et al., 1999
; Lavin et al., 2000
). For most of the newly sequenced taxa, double-stranded copies of the matK gene and the flanking 3' trnK intron region were amplified using primers trnK685F and trnK2R*; typical reaction conditions were 2 min at 95°C for denaturation, followed by 35 cycles of 30 s at 95°C, 3060 s at 5557° C for annealing, 2 min 30 s at 72°C for primer extension, then followed by a final 7 min incubation at 72°C. Amplification products were purified and then sequenced using these same primers and others listed in Table 1. DNA sequencing was performed on Applied Biosystems 377 and 3100 sequencers (Applied Biosystems, Foster City, California, USA) at the University of California (DBS Sequencing Facility, Davis, California, USA), Iowa State University (DNA Sequencing Facility, Ames, Iowa, USA), Northwoods DNA (Becida, Minnesota, USA), and Arizona State University (DNA Laboratory, Tempe, Arizona, USA). Sequencer output files were assembled into contigs and edited using the program Sequencher 4.1 (GeneCodes, Ann Arbor, Michigan, USA) before alignment.
|
The matK sequences were initially assembled into a data matrix by first translating a small set of representative DNA sequences (25 taxa) to their corresponding amino acid sequences, which were then aligned using ClustalX (Thompson et al., 1997
) using standard pairwise and multiple alignment parameters settings (default gap penalty parameters and Gonnet weight matrix). The amino acid alignment was then used as a template to align the corresponding DNA sequences with insertions and deletions (indels) at equivalent positions. The remaining DNA sequences were added to this primary alignment as they became available, and the data matrix was manually readjusted as necessary to allow for additional indels. Indels were numerous but none were ambiguous with respect to their placement in the final aligned data matrix (see below). All sequences were essentially complete except for 20 taxa that had 50 or more nucleotides missing at one of the ends. All new sequences have been deposited in GenBank, and the final data matrix has been deposited in TreeBASE, study accession S1968 (http://www.treebase.org/).
Phylogenetic analyses
Phylogenetic analyses utilized maximum parsimony (MP) and Bayesian approaches. All parsimony analyses were performed using PAUP* (version 4.0b10; Swofford, 2002
). Multiple tree searches were conducted using heuristic search options that included SIMPLE, CLOSEST, or RANDOM addition sequences (1000 replicates) holding five trees per replicate, and tree bisection-reconnection (TBR) branch swapping, with retention of multiple parsimonious trees (MAXTREES = 500010 000). A nucleotide substitution model was selected using AIC implemented in Modeltest (version 3.06; Posada and Crandall, 1998
). Bayesian analyses were performed using MrBayes version 3.0 (Huelsenbeck and Ronquist, 2001
). Multiple Metropolis-coupled Markov chain Monte Carlo analyses were run with random or user-defined starting points for each run. Parameters for the Akaike information criteria (AIC)-selected GTR +
+ I model were estimated using the default value of four Markov chains and the "temperature" parameter set to 0.2. Markov chains of 2 000 000 generations each were sampled every 10010 000 generations, which was sufficient to distinguish the burn-in from stationarity phase. Log likelihood values for sampled trees stabilized after approximately 200 000 generations. Clade support was assessed using both nonparametric bootstrap resampling (Felsenstein, 1985
) and Bayesian posterior probabilities (Huelsenbeck et al., 2002
). Nonparametric bootstrap proportions were estimated from 100 to 500 bootstrap replicates incorporating heuristic parsimony searches using addition sequence and branch-swapping options as in our standard parsimony analyses. Bayesian posterior probabilities were estimated as the proportion of trees sampled after "burn-in" that contained each of the observed bipartitions. Although posterior probabilities may be over-credible as measures of clade support (Suzuki et al., 2002
) and currently are a subject of debate (Wilcox et al., 2002
; Alfaro et al., 2003
), we have observed that high Bayesian posterior probabilities often support nodes that are otherwise also detected by parsimony strict consensus.
Thirty-seven indels of 14 amino acids each were identified in the complete data set, corresponding to 219 nucleotide positions. These sites, and a 25-nucleotide region surrounding a primer-binding site (positions 541565) that was missing in about 80 taxa, were excluded from all analyses (244 total characters). Each of the 37 indels was treated as a separate character, and states were scored according to the presence or absence (1 or 0) of a sequence within an indel region (1711 total characters). Sensitivity analyses (cf. Whiting et al., 1997
) involving the inclusion or exclusion of these recoded indel characters were performed to determine the effect on tree topology and clade support.
| RESULTS |
|---|
|
|
|---|
Within Fabales, pairwise distances (calculated across all sites as uncorrected p values in PAUP*) in matK sequences ranged from a maximum of nearly 17% among outgroups and caesalpinioids, to nearly 7% among mimosoids, 12% among caesalpinioids, and just over 19% among papilionoids. Within monophyletic genera for which we have more than three species sampled, pairwise distances varied between 0.81.9% in Astragalus and 0.94.3% in Sesbania. Within major papilionoid clades, pairwise distances varied from 0.1% to almost 11% within the genistoid clade, 0.312% in Loteae + Robinieae; 0.0% (in Lens) to nearly 11% in the IRLC; 0.213% in dalbergioids, and 1.417.9% in the millettioids.
Phylogenetic reconstruction
Multiple heuristic searches of 1042 parsimony informative nucleotide characters, excluding indels, consistently converged on a large number of equally most parsimonious trees (maximum set saved = 10 000) with a minimum length of 8397 steps, a consistency index (CI) of 0.288 excluding uninformative characters, and a retention index (RI) of 0.791. The strict consensus tree of 5000 representative equally most parsimonious (MP) trees is highly resolved and presented in Figs. 15. The semi-strict consensus of the same set of most parsimonious trees resolves only three nodes that are not present in the strict consensus, while in the 50% majority-rule tree a total of only seven nodes were not fully resolved. Branching order and support values for the major clades of legumes resolved by these matK data were very similar in the maximum parsimony and Bayesian analyses (Figs. 16). To illustrate the heterogeneity in estimated branch lengths, a phylogram representation of a typical Bayesian tree (sampled post burn-in) is shown in Fig. 6.
|
|
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Consistent with recent results using the rbcL gene (Kajita et al., 2001
) and trnL intron (Bruneau et al., 2001
; Herendeen et al., 2003a
) sequences, analyses of matK sequences support the monophyly of the family Leguminosae (Fig. 1) and the paraphyly of subfamily Caesalpinioideae. Even with our limited sampling of caesalpinioid genera, at least four well-supported clades emerge from this analysis that correspond to major caesalpinioid clades detected by analyses of trnL intron sequences alone (Bruneau et al., 2001
), trnL sequences combined with morphological characters (Herendeen et al., 2003a
, b
), or rbcL sequences (Kajita et al., 2001
). The well-supported clade represented by Colophospermum, Hymenaea, and Berlinia (Fig. 1) corresponds to the "Detarieae s.l." clade of Herendeen et al. (2003a
; see their fig. 3). Our finding of a sister relationship of this clade to that comprising Bauhinia and Cercis (the tribe Cercideae clade) is unexpected yet interesting; it may be an artifact of poor sampling among these caesalpinioid taxa but to our knowledge has not been observed in any previous study. The genera Petalostylis and Poeppigia form a well-supported clade that corresponds to the "Dialiinae s.l." clade of Herendeen et al. (2003a)
. The sister relationship of this latter clade to all remaining Leguminosae (i.e., papilionoids, mimosoids, and closely related caesalpinioids) is also detected by these earlier analyses. A well-supported clade including the remaining caesalpinioids, all mimosoids and papilionoids corresponds to "clade A" of Bruneau et al. (2001
; see their fig. 6), while the Umtiza + Caesalpinia + Mimosoideae clade of Herendeen et al. (2003a
; see their fig. 2), is well supported with matK sequences (Fig. 1). We refer to this clade as the Caesalpinioid crown. The Umtiza subclade (Herendeen et al., 2003b
) is sister to the rest of the Caesalpinioid crown clade and is here represented by Arcoa, Ceratonia, Gymnocladus, and Gleditsia. This subclade is not supported by parsimony bootstrap (<50%) in our analyses, however. Regardless, the constituents and relationships of the Umtiza subclade within the Caesalpinioid crown clade detected in this study are in agreement with and further substantiate the findings of Herendeen et al. (2003a
, b
).
Although sampling of taxa from Mimosoideae was limited, our results generally agree well with those of Luckow et al. (2003)
, especially with respect to the monophyly of at least the vast majority of the genera traditionally assigned to the Mimosoideae and paraphyly of constituent tribes. With the exception of Dinizia, which appears more closely related to certain caesalpinioids than to mimosoids on the basis of morphological and molecular evidence, as concluded by Luckow et al. (2003)
, the rest of the taxa sampled from this subfamily are resolved as monophyletic with high support in our analyses (Fig. 1). Furthermore, our results clearly show Piptadeniastrum nested within the mimosoid clade, confirming recent results by Luckow et al. (2003)
. The mimosoid clade generally shows poorly resolved relationships or at least short internal branch lengths compared to other clades of Leguminosae (Fig. 6). This suggests either a slow down in the rate of substitution or a relatively recent diversification of most of the extant members of mimosoids. These alternative hypotheses are being addressed elsewhere (Lavin et al., in press
).
In contrast to the caesalpinioids and mimosoids, our results have significant implications with regard to papilionoid phylogenetics. In all our analyses, papilionoids are strongly supported as monophyletic (Fig. 1) compared to previous rbcL studies where papilionoids were resolved as monophyletic but with relatively low statistical support (e.g., Kajita et al., 2001
; 57% bootstrap and 62% parsimony jackknife). Similar to the findings of other studies involving broad sampling of caesalpinioid legumes (e.g., Herendeen et al., 2003a
), papilionoids are not resolved as sister to an isolated caesalpinioid lineage, as are the mimosoids, but rather are nested among the major caesalpinioid clades as an early branch in the legume phylogeny (Fig. 1). In marked contrast to the most recent rbcL analysis in which most major clades within papilionoids were weakly resolved (fig. 3 of Kajita et al., 2001
), the matK strict consensus is very highly resolved (Figs. 25). Furthermore, both bootstrap proportions and Bayesian posterior probabilities for the major subclades often exceed 95%. The results presented here provide some of the best evidence to date in support of relationships among the major papilionoid subclades, which heretofore have been largely unresolved by cladistic analyses of DNA sequences data.
Consistent with the results of Doyle et al. (1997)
and Pennington et al. (2001)
, the matK phylogeny resolves certain representatives of Swartzieae and Sophoreae as the sister group to the rest the subfamily. The clade that forms the sister group to all remaining papilionoids, here delimited by the MRCA of Swartzia simplex and Myrospermum sousanum (Fig. 2), is unexpected in that it now includes representatives of a number of disparate lineages such as Angylocalyx and Dipterygeae (Dipteryx and Pterodon) that had been poorly resolved or supported in previous studies (e.g., Pennington et al., 2001
). One of two subclades of this clade includes Swartzia and recent segregate Bobgunnia, Ateleia, and Cyathostegia, and corresponds to the "swartzioid" clade of Ireland et al. (2000)
and Pennington et al. (2001)
. The other contains Amburana, Angylocalyx, Dipterygeae, Dussia, Myrocarpus, and Myrospermum. The resolution of this larger clade of morphologically eclectic genera as sister to the rest of Papilionoideae suggests that the swartzioid clade of Pennington et al. (2001)
could be expanded to encompass the majority of papilionoid genera that lack the 50-kb inversion in the plastid DNA genome.
With respect to the rest of the papilionoid subgroups, our sampling is much more extensive. The following seven well-supported clades resolved in this study are thus validated not only by extensive sampling, but also by the resolution of these subclades in other recent studies. These seven are the Cladrastis clade, the genistoid s.l., the dalbergioid s.l., the mirbelioids, the millettioids, the robinioids, and the inverted-repeat-lacking clades, the last two of which comprise Hologalegina. Even if resolved by previous studies, relationships among these major papilionoid subclades have been heretofore resolved at best with only weak support (e.g., Hu et al., 2000
; Kajita et al., 2001
; Lavin et al., 2001
; Pennington et al., 2001
).
The Cladrastis clade
The genera Cladrastis and Styphnolobium traditionally have been classified in Sophoreae s.s. (Polhill, 1981b
) whereas Pickeringia Nuttall has been classified in tribe Thermopsideae (Turner, 1981
). These three genera form a well-supported clade in all our analyses. While a sister group relationship of Cladrastis and Styphnolobium has been observed in previous molecular studies (e.g., Doyle et al., 1997
; Pennington et al., 2001
) and is notable biogeographically because both genera exhibit East AsianNorth American disjunctions, this study is the first to suggest a close relationship of these two genera with Pickeringia. Pickeringia is a monotypic genus restricted to the sclerophyllous chaparral vegetation of the California Floristic Province of western North America (Raven and Axelrod, 1995
). The strongly supported position of this genus in the Cladrastis clade confirms Polhill's (1981b)
initial prediction and Sousa and Rudd's (1993)
subsequent conclusion of a close relationship between these three genera based on floral (bracts at base of inflorescence) and chromosomal similarities (n = 14; Goldblatt, 1981
; Palomino et al., 1993
). This clade is also supported by results from cladistic analyses of nuclear ribosomal DNA internal transcribed spacers (nrDNA ITS) sequence data (M. F. Wojciechowski, unpublished data). This placement of Pickeringia reveals that Thermopsideae sensu Yakovlev (Turner, 1981
) is not monophyletic, contrary to molecular evidence presented previously (Crisp et al., 2000
). Furthermore, the absence of quinolizidine alkaloids in Pickeringia of the type that is characteristic of other Thermopsideae (Turner, 1981
) is consistent with the matK results. The presence of quinolizidine alkaloids, a prominent group of secondary metabolites once considered to be widely distributed in papilionoid legumes (Kinghorn and Balandrin, 1984
), now appear to be of systematic significance only for the "genistoids" (see next paragraph). In a recent analysis (Kite and Pennington, 2003
), the failure to detect similar alkaloids in extracts of Cladrastis and Styphnolobium is also in accordance with the phylogenetic position of these taxa based on trnL sequence data (Pennington et al., 2001
) and the result presented here. The disjunct distribution of the Cladrastis clade in warm temperate to tropical regions of the Northern Hemisphere is common to many other legume groups (e.g., Gleditsia, Gymnocladus, Desmodium, Lespedeza, etc.; Schrire et al., in press
).
The genistoid s.l. clade
The genistoids include the many genera traditionally classified in the tribes Genisteae, Thermopsideae, Euchresteae, Crotalarieae, Liparieae, Podalyrieae, and Sophoreae s.s. (Käss and Wink, 1997
; Crisp et al., 2000
). The concept of a "genistoid alliance" was circumscribed by Polhill (1981a
, 1994
) who brought together for the first time this group of putatively related, predominantly Southern Hemisphere tribes that have been considered relatively isolated among quinolizidine-alkaloid-accumulating papilionoid legumes. The alliance as recognized by Polhill comprises four separate lineages. One includes the predominantly Northern Hemisphere Genisteae sensu stricto (s.s.), Euchresteae, and Thermopsideae together with certain Sophoreae (Sophora group). A second involves the mainly southern African Crotalarieae, Liparieae, and Podalyrieae, and now segregate tribe Hypocalypteae. A third comprises the endemic Australasian Bossiaeeae and Mirbelieae. The fourth includes the NeotropicalAustralian Brongniartieae (including the Templetonia group). Early molecular phylogenetic analyses by Käss and Wink (1996
, 1997
) suggested that most species of the alliance formed a monophyletic group with some certain members of Sophoreae (i.e., some but not all species of Maackia Rupr. & Maxim. and Sophora L.) near the base of papilionoids. The analysis of Doyle et al. (1997)
suggested the genistoids were polyphyletic and formed three clades, the largest of which approximates the genistoid clade of Käss and Wink. The monogeneric Euchresteae (Euchresta) has been shown by subsequent analyses (Kajita et al., 2001
) to be nested within a Sophora s.s. clade, while Liparieae has been formally included within Podalyrieae (Schutte and van Wyk, 1998
), a placement verified by analyses of rbcL and nrDNA ITS sequences (Käss and Wink, 1997
; Kajita et al., 2001
; van der Bank et al., 2002
).
Crisp et al. (2000)
confirmed the polyphyly of the genistoids sensu Polhill (1981a)
, but suggested that this name be restricted to a well-supported "core genistoids" group, from Africa and Eurasia, that comprises the majority of the tribes that made up Polhill's genistoid alliance. This clade is strongly supported by the matK data (Fig. 2; corresponds to the clade delimited by the MRCA of Bolusanthus speciosus and Spartium junceum). In addition, the matK phylogeny corroborates results of other studies in resolving a core genistoid clade nested within a larger genistoid clade. A more inclusive group, referred here to as the genistoid s.l. clade and well supported by matK sequence analysis (Fig. 2), includes the Brongniartieae (sensu Crisp and Weston, 1987
; Thompson et al., 2001
), Poecilanthe and Cyclolobium of Millettieae (Hu et al., 2000
, 2002
), and a number of largely woody Neotropical genera of Sophoreae such as Acosmium Schott, Bolusanthus Harms, Bowdichia Kunth, Cadia Forssk., Diplotropis Bentham, and most likely Ormosia Jackson (Kajita et al., 2001
), Dicraeopetalum Harms, Clathrotropis Harms, and Platycelyphium Harms (Pennington et al., 2001
), several of which have not been sampled for matK sequences. The monophyly of the genistoid s.l. clade as defined here is also supported by the taxonomic distribution of quinolizidine alkaloids (e.g., Kinghorn and Balandrin, 1984
; van Wyk, 2003
). All taxa known to accumulate these alkaloids, with the exception of Calia (Kite and Pennington, 2003
) and Ormosia (Kinghorn and Balandrin, 1984
), are members of the genistoid s.l. clade as defined here. While the relationship of these particular taxa to this clade is not definitively resolved by our analyses (Figs. 2, 6), Pennington et al. (2001)
did find weak support at least for the inclusion of Osmosia within an "expanded" genistoids. Further resolution and sampling of these taxa as well as Holocalyx, Uribea, and the vataireoids, may yet show quinolizidine alkaloids to be a non-molecular synapomorphy for an expanded genistoid clade.
The dalbergioid s.l. clade
The dalbergioid legumes, a mostly pantropical group of papilionoids, was originally circumscribed by a combined data approach to include 44 genera and ca. 1100 species from the tribes Aeschynomeneae, Adesmieae, subtribe Bryinae of Desmodieae, and tribe Dalbergieae except Andira, Hymenolobium, Vatairea, and Vataireopsis (Lavin et al., 2001
). In addition, this clade is diagnosed apomorphically by the presence of the aeschynomenoid root nodule (Sprent, 2001
). Although the position of the dalbergioid clade within the Papilionoideae was not well resolved or supported in previous studies using rbcL and trnL (Kajita et al., 2001
; Pennington et al., 2001
), there was preliminary evidence that its sister group included the predominantly North American temperate tribe Amorpheae (Lavin et al., 2001
). Our results, like those of McMahon and Hufford (2004)
, consistently show Amorpheae as the sole sister clade to the dalbergioid clade even if parsimony bootstrap support for this relationship is moderate (Fig. 3). Thus, the dalbergioid clade (sensu Lavin et al., 2001
) is now expanded to encompass this tribe and is herein referred to collectively as the dalbergioid s.l. clade. The similarity in base chromosome number among dalbergioids and genera of Amorpheae, where x = 10 is apparently ancestral with derived cases of aneuploidy (e.g., x = 9 and x = 8; Goldblatt, 1981
), supports this decision. Furthermore, the glandular punctate leaves and indehiscent, single-seeded pods (derived from a two-plus-ovulate ovary) of Amorpheae, once thought to indicate a relationship with the genera of Psoraleeae (e.g., Stirton, 1981
), are found variously within dalbergioids. The sister relationship of the primarily tropical American Andira plus Hymenolobium to the dalbergioid s.l. clade is very weakly supported but resolved in both the parsimony strict consensus (Fig. 3) and Bayesian analyses (Fig. 6). This relationship, though consistent with results of Lavin et al. (2001
; see their fig. 5) needs to be further investigated with additional sampling of Andira and Hymenolobium species.
The mirbelioid clade
The endemic Australasian tribes Bossiaeeae and Mirbelieae comprise a fifth clade of c. 31 genera and 750 species within the papilionoids, although matK sampling is still quite limited from this group. The analyses of Doyle et al. (1997)
and Crisp et al. (2000)
provided the first molecular evidence, albeit not well supported by bootstrap analysis, that validated Crisp and Weston's (1987)
hypothesis of a monophyletic Mirbelieae-Bossiaeeae group. Recent trnL intron and nrDNA ITS sequence analyses (Crisp and Cook, 2003
) suggest Bossiaeeae is nested within a paraphyletic Mirbelieae. Although the monophyly of MirbelieaeBossiaeeae is well-supported by both bootstrap and Bayesian analyses of matK sequences (Fig. 4), neither of these tribes is resolved as monophyletic, consistent with the results of Crisp et al. (2000)
and Crisp and Cook (2003)
.
The matK analyses provide the first unequivocal evidence for a sister group relationship between Mirbelieae-Bossiaeeae and the tribe Hypocalypteae (Schutte and van Wyk, 1998
). Although this relationship receives only modest support in bootstrap and Bayesian analyses, it is consistently resolved (Fig. 4). A sister group relationship of Hypocalypteae, rather than nested within the Australasian Mirbelieae or Bossiaeeae, is also more consistent with their respective geographic distributions. The taxonomic position of Hypocalyptus Thunberg, a genus of three species geographically confined to the Cape region of South Africa, has been uncertain since Bentham (1837)
, but is historically considered linked to various tribes of the genistoid alliance. A cladistic analysis of morphological and biochemical characters (Schutte and van Wyk, 1998
) resulted in the tribal ranking of Hypocalypteae and the determination that it was more closely related phylogenetically to Millettieae. Crisp et al. (2000)
however, resolved a sister relationship of Hypocalypteae to Indigofereae using combined rbcL and nrDNA ITS sequences, although bootstrap support was lacking. Neither of these alternative positions fundamentally conflicts with the position detected using matK because neither finds any support from parsimony bootstrap analyses (Bayesian analyses were not performed in these two studies). Regardless, Hypocalypteae and Mirbelieae-Bossiaeeae collectively, or perhaps individually, occupy a sister position to the rest of the papilionoids that accumulate nonprotein amino acids in seed (i.e., Indigofereae plus the millettioids and the robinioids plus the IRLC; Fig. 4).
The millettioid clade
This clade includes all genera of the tribes Millettieae, Abreae, Phaseoleae, and Psoraleeae, plus Desmodieae subtribes Desmodiinae and Lespedezinae (Lavin et al., 1998
; Hu et al., 2000
, 2002
; Kajita et al., 2001
), with Indigofereae as its moderately supported sister group (Fig. 4). The predominantly tropical, woody Old World tribe Millettieae has been considered to be a transitional link from the "less advanced" elements of Dalbergieae and Sophoreae to putatively "more advanced" Old World tribes like Phaseoleae and Galegeae (Geesink, 1981
, 1984
; Polhill, 1981a
). These authors used the term "advanced" to indicate a high degree of fusion of stamens and keel petals, as well as the accumulation of nonprotein amino acids in seeds rather than alkaloids. Geesink (1984)
went so far as to consider Millettieae the paraphyletic "stem" group from which all other "advanced" papilionoids branched. Early rbcL analyses (Doyle et al., 1997
) suggested the polyphyly of both Millettieae and Phaseoleae, and this was subsequently confirmed using nuclear phytochrome gene sequences (Lavin et al., 1998
), trnK/matK sequences (Hu et al., 2000
), and a combined analysis of rbcL, matK, and nrDNA ITS data (Hu, 2000
). Regardless, the emerging pattern of relationships derived from these studies and from this matK analysis is that most of the constituent genera of Millettieae and Phaseoleae clade fall out in two very well-supported subclades (Fig. 4). The first, previously referred to as "core Millettieae" (Hu et al., 2000
), comprises the majority of Millettieae and includes the large genera Millettia Wight & Arn. (c. 150 spp.), Lonchocarpus Kunth (c. 130 spp.), Derris Lour. (5060 spp.) and Tephrosia Pers. (c. 350 spp.), while the majority of Phaseoleae dominates the second. Although shown here (Fig. 4) as sister lineages to core Millettieae, relationships of the monogeneric Abreae and certain Phaseoleae such as Galactia and presumably the other genera of subtribes Galactinae and Diocleinae (see Lewis et al., in press), are not yet resolved with certainty with respect to the core Millettieae clade. Additionally, certain genera classified in Millettieae, including Xeroderris and Platycyamus, have unresolved or weakly supported relationships with respect to the two main millettioid clades.
The large Phaseoleae clade resolved by matK includes tribes Desmodieae (except subtribe Bryinae) and Psoraleeae, in agreement with other studies (e.g., Kajita et al., 2001
; Hu et al., 2002
). Lackey's (1981)
subtribal classification of Phaseoleae, the largest tribe of legumes in number of genera, is not congruent with the monophyletic subclades detected in this and other analyses (e.g., Kajita et al., 2001
). The delimitation of the subtribe Phaseolinae is restricted to the descendants of the MRCA of Wajira and Phaseolus (Fig. 4), and excludes other genera once included, such as Psophocarpus and Otoptera, which are now known to be more closely related to genera in Glycininae (Thulin et al., in press
; A. Delgado-Salinas, M. Lavin, M. Thulin, and N. Weeden, unpublished data). In spite of the findings of Lee and Hymowitz (2001)
, Glycininae is also not monophyletic and includes certain genera of subtribe Phaseolinae and probably all genera of tribe Psoraleeae (Kajita et al., 2001
; A. Delgado-Salinas, M. Lavin, M. Thulin, and N. Weeden, unpublished data).
In this and most previous analyses (e.g., Kajita et al., 2001
), tribe Indigofereae emerges as the moderately supported sister group to the millettioid clade (Fig. 4). The genera of Indigofereae have an inflorescence of the simple racemose type (each node bearing a single flower), whereas the tribes Abreae, Desmodieae, Millettieae, Phaseoleae, and Psoraleeae, which form the millettioid clade, share an unusual type of inflorescence, the pseudoraceme (see Tucker, 1987a
, b
). Indeed, the pseudoracemose inflorescence is found only and in all members of the millettioid clade, rendering it the most readily morphologically distinguished of the newly circumscribed large subclades of Papilionoideae. In addition, typical chromosome numbers in Indigofereae (n = 8, 7, 6; Goldblatt, 1981
) are different from that which predominate in the tribes comprising the millettioids (e.g., Millettieae, Phaseoleae; n = 11, 12; Goldblatt, 1981
). Whether the millettioid clade should be expanded to include the Indigofereae is as yet undecided, especially given its sister relationship is only moderately supported, but the lack of any nonmolecular evidence to support such a relationship argues against this.
The robinioid clade
Tribes Loteae (including Coronilleae; sensu Polhill, 1994
) and Robinieae (sensu Lavin and Sousa, 1995
) comprise an expanded robinioid clade, which is distributed primarily in the Northern Hemisphere of the New World, Europe, and Africa. Within this clade, a monophyletic Sesbania L. (Robinieae) is weakly supported as sister to Loteae and these collectively form the sister group to the remaining members of Robinieae (Fig. 5). This position of Sesbania thus renders Robinieae paraphyletic, a finding first suggested by a preliminary phylogenetic analysis of matK sequences (Wojciechowski et al., 2000
) and confirmed with a recent study utilizing exhaustive sampling for matK, trnL intron, and nrDNA ITS sequences (Lavin et al., 2003
). Allan et al. (2003)
have provided molecular evidence for the monophyly of the largely EurasianNorth American Loteae and the paraphyly of the large genus Lotus. The robinioid clade is here expanded from that described by Lavin et al. (2003)
to encompass Sesbania and Loteae. Multiple lines of molecular evidence strongly support the monophyly of this collective group. For example, surveys for the presence of the inverted repeat in plastid DNA genomes among papilionoid legumes first suggested Loteae was distinct from the other temperate herbaceous tribes (Lavin et al., 1990
) and its sister group relationship to Sesbania was only more recently determined (Wojciechowski et al., 2000
).
The IRLC
The inverted-repeat-lacking clade or IRLC (Wojciechowski et al., 1999
, 2000
) includes most members of Polhill's (1981a)
temperate herbaceous group. This group comprises all members of tribes Carmichaelieae, Cicereae, Hedysareae, Trifolieae, Vicieae, and Galegeae, but not Loteae (and Coronilleae), as well as at least three genera of Millettieae, Afgekia Craib, Callerya Endlicher, and Wisteria Nutt. The IRLC was essentially the first clade of legumes to be distinguished on the basis of a molecular synapomorphy, loss of one copy of the 25-kilobase inverted repeat in the plastid genome (Lavin et al., 1990
; Liston, 1995
), in addition to a number of morphological features shared by members of this group including a predominantly herbaceous habit, epulvinate compound leaves, and base chromosome numbers of n = 7 or n = 8 (Polhill, 1981a
). The monophyly of the IRLC has been consistently detected in all subsequent cladistic analyses of molecular sequence data (Sanderson and Wojciechowski, 1996
; Doyle et al., 1997
; Käss and Wink, 1997
; Hu et al., 2000
, 2002
; Wojciechowski et al., 2000
; Kajita et al., 2001
). Within the IRLC, the genera Afgekia (not included in this analysis), Callerya, and Wisteria, formerly classified in the tribe Millettieae, along with Glycyrrhiza L. of Galegeae, form a paraphyletic grade with respect to the remaining IRLC (Fig. 5). The "vicioid" subclade of the IRLC includes many of the particularly important agricultural genera such as Cicer L., Lathyrus L., Lens Mill., Medicago L., Melilotus Mill., Pisum L., Trifolium L., and Vicia L. (Wojciechowski et al., 2000
; Steele and Wojciechowski, 2003
). The vicioids are morphologically the most distinctive subclade of the IRLC, apomorphically characterized by craspedidromous leaflets and consistently well supported as monophyletic with Parochetus (Trifolieae) as the sister to all other vicioid taxa (Fig. 5). Other major clades within the IRLC include the Astragalean clade (Wojciechowski et al., 1999
, 2000
), defined as all descendants of the MRCA of Astragalus americanus and Clianthus puniceus (Fig. 5), and the hedysarioid clade (Sanderson and Wojciechowski, 1996
; Wojciechowski et al., 2000
), defined as all descendants of the MRCA of Hedysarum boreale and Caragana arborescens (Fig. 5).
Hologalegina
The robinioids and the IRLC comprise the largest of the well-marked papilionoid subclades, Hologalegina (Wojciechowski et al., 2000
). This clade includes over 4800 species that make up the vast majority of legumes presently distributed in temperate regions of the world. The robinioids-IRLC dichotomy had been detected in studies with rbcL (Doyle et al., 1997
; Kajita et al., 2001
) but resolution was weakly supported. Furthermore, the placement of Bolusanthus (Sophoreae) as the sister group to Robinieae in those studies stands in marked disagreement with the results presented here and in most other studies that consistently place Bolusanthus within the genistoids (Lavin et al., 1998
; Hu et al., 2000
, 2002