|
|
||||||||
Systematics |
2Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California 90095-1737 USA; 3Cell Biology and Molecular Genetics/Plant Biology, 2106 H. J. Patterson Hall, Building 073, University of Maryland, College Park, Maryland 20742-5815 USA; 4Department of Botany, Academy of Natural Sciences, 1900 Benjamin Franklin Parkway, Philadelphia, Pennsylvania 19103 USA
Received for publication July 16, 2002. Accepted for publication September 5, 2002.
| ABSTRACT |
|---|
|
|
|---|
2.5 kilobases [kb]) and matK ORF (
1.5 kb) are comparable in size to the intron and ORF of land plants, in which they are similarly found inserted in the trnK exon. Domain X, a sequence of conserved amino acid residues within matK, occurs in the Characeae. Phylogenetic analysis using maximum likelihood (GTR + I + gamma likelihood model) and parsimony (branch and bound search) yielded one tree with high bootstrap support for all branches. The matK tree was congruent with the rbcL tree for the same taxa. The number and proportion of informative sites was higher in matK (501, 31% of matK sequence) compared to rbcL (122, 10%). Characeae branch lengths were on average more than five times longer for matK compared to rbcL and provided better resolution within the Characeae. These findings along with recent genomic analyses demonstrate that the intron and matK invaded the chloroplast genome of green algae prior to the evolution of land plants.
Key Words: Characeae charophyte green algae matK maturase rbcL trnK
| INTRODUCTION |
|---|
|
|
|---|
Historically, the Characeae and related fossil taxa have been called "charophytes" (Peck, 1953
; Grambast, 1974
), although Karol et al. (2001)
used the term more broadly, including land plants. Several other green algal groups are also closely related to and monophyletic with the Characeae-land plant clade, and we refer to these algae collectively as charophyte green algae. The Characeae are relatively large (up to a meter or more in height) and morphologically complex (Wood and Imahori, 1965
; Feist and Grambast-Fessard, 1991
). Six extant genera of Characeae are recognized: Chara, Lamprothamnium, Nitellopsis, Lychnothamnus, Nitella, and Tolypella. Molecular systematic studies of the Characeae have generally employed one of two genes: the plastid-encoded large subunit of Rubisco (rbcL) or the nuclear-encoded small subunit of ribosomal DNA (SSU rDNA) (Chapman et al., 1998
). Although the two genes differ in the degree of sequence divergence, with that of rDNA typically being much lower than rbcL, both have been used to independently infer the relationships of the Characeae to land plants (Graham, 1993
; Wilcox et al., 1993
; Kranz et al., 1995
, 1997
; McCourt, 1995
; Kranz and Huss, 1996
; Chapman et al., 1998
; Cimino et al., 2000
; Karol et al., 2001
) and to decipher relationships among the six extant genera (McCourt et al., 1996a
, b
, 1999
; Meiers et al., 1997
, 1999
). Despite the utility of these two genes at elucidating phylogenetic relationships among and between the charophyte green algae, the two genes yield conflicting signals regarding the relationships among genera within the Characeae (McCourt et al., 1996a
, b
, 1999
; Meiers et al., 1997
, 1999
). Such ambiguity has led to the search for other genes or genomic characters (e.g., introns and gene arrangements) that might be used to address these questions.
In this study we report on the occurrence of a protein-coding gene, matK, in characean green algae. Although this gene has been shown to reside within a group II intron of the plastid-encoded transfer RNA gene for lysine (trnK) in all embryophytes studied (Ems et al., 1995
), the intron and matK are absent from most other green algae (Lemieux et al., 2000
; Turmel et al., 2002
). Phylogenetic studies of angiosperms have found matK to be more divergent in sequence than either rbcL or the SSU rDNA cistron, which fortuitously predisposes matK for use as a plausible gene for resolving relationships among genera and species of angiosperms (Mohr et al., 1993
; Johnson and Soltis, 1994
, 1995
; Steele and Vilgalys, 1994
; Liere and Link, 1995
; Ooi et al., 1995
; Gadek et al., 1996
; Soltis et al., 1996
; Kelchner, 2002
). We hypothesized that matK, if present in the Characeae, would be useful for lower-level phylogenetic analysis.
The present study is the first to report that the intron containing matK occurs in the Characeae. Moreover, matK sequences in the Characeae contain sufficient phylogenetic signal for reconstructing relationships among genera and species within this algal family and are congruent with phylogeny for the group inferred from rbcL data.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Sequence analysis
Sequences of the amplified fragments were translated to amino acids in each of three reading frames to locate start and stop codons of the matK ORF. Sequence contigs were assembled using Sequencher 3.1.1 (GeneCodes, Ann Arbor, Michigan, USA) and alignments adjusted by eye. Alignment of highly divergent regions flanking the group II intron were not possible across all genera. Maximum likelihood (ML) and maximum parsimony (MP) analyses of the matK-encoding region were performed using PAUP* (version 4.0b8 PPC, Swofford, 1998
). Branch and bound MP searches were performed to find the globally most parsimonious trees. Estimates of maximum-likelihood parameters based upon the optimal MP tree were calculated using a series of models in PAUP*, and pairwise likelihood ratio tests were used to determine the best model invoking the fewest estimated parameters (Swofford et al., 1996
). This model, with fixed parameters, was used in a branch and bound search to derive an optimal ML tree.
Both ML and MP bootstrapping were performed with 1000 replicates for each method. For ML, the best model (GTR + I + gamma), with parameters fixed as described earlier, was used. For MP, the heuristic search procedure using tree bisection-reconnection (TBR), save multiple trees (MULTREES), accelerated transformation (ACCTRA), and steepest descent options was used. Parsimony branch lengths were generated using these options to compare relative rates of matK and rbcL divergence.
Maximum likelihood and MP analyses of rbcL sequences from the same DNA samples (from McCourt et al., 1999
and new sequences noted in Table 1) were also performed. Sequences of matK and rbcL for each species were combined in a concatenated data set for the two genes, tested for homogeneity of signal (Farris et al., 1995
as implemented in Swofford, 1998
), and analyzed as described earlier, except that an ML bootstrap was not performed for the rbcL data set.
| RESULTS |
|---|
|
|
|---|
2.42.8 kb in the 10 characean taxa (Table 1), a length approximating that of the
2.5 kb reported for the corresponding region in angiosperms (Steele and Vilgalys, 1994
matK in the Characeae
The matK ORF within the Characeae trnK intron was identified as a stretch of
1.5 kb uninterrupted by stop codons. This ORF was sequenced for all taxa listed in Table 1 from the six characean genera. The matK region of these algae was approximately the same size as matK in angiosperms (Johnson and Soltis, 1994
, 1995
; Steele and Vilgalys, 1994
; Soltis et al., 1996
).
Phylogeny of the Characeae based on matK
The inferred phylogenetic relationships of the characean taxa based on matK sequences are shown in Fig. 1. A branch-and-bound ML search recovered a single optimal tree (ln = 7721.244 39), which was identical in topology to the branch-and-bound MP tree (tree length = 1335; consistency index [CI] = 0.8569; 0.7887 excluding uninformative characters) found. Using this taxon set, approximately one-third of the matK sequence (501 base pairs [bp] out of 1626 or 31% of total length) was parsimony informative.
|
Combining the matK and rbcL sequences revealed no heterogeneity of signal and the MP tree for the combined data set was identical to the tree in Fig. 1.
| DISCUSSION |
|---|
|
|
|---|
Turmel et al. (2002)
also examined the small and large subunits of chloroplast rDNA in charophyte green algae and land plants and found a similar basal phylogenetic placement of Mesostigma, although an alternative placement of this genus within the charophyte green algae plus land plant lineage could not be rejected based on a comparison of log likelihoods. A recent analysis of four gene sequences from nuclear, mitochondrial, and plastid genomes supports the inclusion of Mesostigma with charophyte green algae and land plants (Karol et al., 2001
). If Mesostigma and Chlorokybus are early diverging descendants of the line leading to the Charales and land plants, then the trnK group II intron and matK ORF were absent when this group diverged from other green algae.
Currently under investigation is whether other algae in the more basal orders of charophyte green algae, such as Coleochaetales, Zygnematales, and Klebsormidiales, possess the trnK-matK region. Turmel et al. (2002)
sequenced the entire plastid genome of Chaetosphaeridium globosum, a member of the Coleochaetales, sister group to the Charales/land-plant clade, and found that this species contains the matK ORF. C. Lemieux and M. Turmel (Université Laval, Quebec, Canada, personal communication) also found the matK ORF in the plastid genome of the zygnematalean genus Staurastrum, and we have amplified the gene from Coleochaete (K. G. Karol and R. M. McCourt, unpublished data). Identifying the group II intron and matK in the Coleocaetales, Charales, and other green algae related to land plants supports the hypothesis that these features of the plastid genome constitute a derived character for charophyte green algae that diverged after Mesostigma. Moreover, these studies show that the group II intron and matK ORF invaded the plastid genome at some time prior to the divergence of land plant ancestors from their aquatic algal relatives and thus before the transition from an aquatic to a terrestrial habitat.
The protein encoded by matK (MatK) has maturase, reverse transcriptase (RT), and DNA endonuclease activities in eubacteria and fungal mitochondria (Tsudzuki et al., 1992
; Mohr et al., 1993
; Ems et al., 1995
; Liere and Link, 1995
; Michel and Ferat, 1995
; Matsuura et al., 1997
). The latter two functions likely contribute to the mobility of the intron (Ferat et al., 1994
; Michel and Ferat, 1995
; Matsuura et al., 1997
). However, in land plant MatK, the RT and endonuclease regions of the protein are truncated, a deletion that may interfere with the reverse transcriptase activity of MatK, which would consequently account for the relatively stable location of group II introns in trnK from lower to higher plants (i.e., bryophytes to angiosperms) (Michel and Ferat, 1995
). The data reported here suggest that matK and its intron invaded the chloroplast genome, then were truncated and achieved this stable position long before the divergence of embryophytes.
The nucleotide sequence composition of the
1.5-kb characean matK ORF is more variable in genera and species of Characeae than in land plants. This variability is not surprising given the antiquity of characean genera and species relative to land plant species (Feist and Grambast-Fessard, 1991
). Characean matK nucleotide sequences were not readily alignable to those of angiosperms except for small regions involving approximately three dozen amino acids. This portion of domain X near the 3' end of the matK ORF, although clearly homologous to domain X in land plants, contained insufficient signal to obtain a resolved tree for Characeae and land plants. However, a BLAST (basic local alignment search tool) search of the translated amino acid sequences matched most closely to numerous land plant MatK polypeptide sequences in GenBank. Previous studies of group II intron-encoded proteins have shown that the only highly conserved region among all identified or proposed maturases is domain X, which consists of approximately 100 amino acids and has been proposed to function in RNA binding (Mohr et al., 1993
; Vogel et al., 1997
). Approximately three dozen derived amino acid residues for characean matK within domain X were easily aligned to those inferred from the cDNA sequence of matK for barley (an angiosperm, Hordeum vulgare) and a liverwort (Marchantia polymorpha) (Fig. 2). For the characean taxa, domain X begins near position 1400 in the matK sequence (there is some variability from indels in various taxa). The amino acid sequence similarity is highest for the first 39 amino acid residues (
50%), after which the similarity between the characean and two land plant sequences is low (<10%) (Fig. 2).
|
Sequence data for matK have been used extensively in phylogenetic studies of families, genera, and species of plants (Johnson and Soltis, 1994
, 1995
; Soltis et al., 1996
). In embryophytes, matK has greater levels of divergence than rbcL. This pattern holds true for the Characeae as well. Parsimony branch lengths for matK are on average more than five times longer than respective rbcL branches, which suggests that the gene will prove useful for studies within the Characeae (and in other charophyte green algae in which it may be found).
The congruence of tree topology for rbcL and matK support the conclusions of McCourt et al. (1996a)
and Karol et al. (2001)
regarding genus-level relationships within the Characeae. The matK data also support the suggestion of McCourt et al. (1999)
that C. connivens and C. globularis form a closely related, perhaps conspecific cluster of taxa. The matK tree, while unrooted, is congruent with the monophyly of the tribes Chareae and Nitelleae recognized by conventional taxonomy (Wood and Imahori, 1965
). McCourt et al. (1999b)
noted longer rbcL branch lengths in the Nitelleae compared to those within the Chareae, a pattern that also holds for the matK tree (Fig. 1). This finding for both genes suggests the branch asymmetry within the Characeae is not restricted to rbcL and may represent a different evolutionary rate or time of origin in the two main clades of the Characeae.
The data show that matK data provide better resolution within the Characeae than rbcL, but rooting the tree remains a problem. The matK ORF and its intron may provide other types of data to address fundamental issues of "deep green" research (see website http://ucjeps.berkeley.edu/bryolab/greenplantpage.html). Additional sampling of the trnK region in other charophyte green algae, combined with studies of the secondary structure of the intron, may provide additional characters or help align sequences and provide better estimates of the root of the tree and of the phylogeny of charophyte algae and land plants.
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
Bold H. C. M. J. Wynne 1985 Introduction to the algae, 2nd ed. Prentice-Hall, Englewood Cliffs, New Jersey, USA
Chapman R. L. M. A. Buchheim C. F. Delwiche T. Friedl V. A. R. Huss K. Karol L. A. Lewis J. Manhart R. M. McCourt J. L. Olsen D. Waters 1998 Molecular systematics of the green algae. In D. E. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants II, 508540. Kluwer Academic, Boston, Massachusetts, USA
Cimino M. T. K. G. Karol C. F. Delwiche 2000 An artifact in the small subunit rDNA sequence of Chaetosphaeridium globosum (Charophyceae, Streptophyta). Journal of Phycology 36: 440-442[CrossRef][ISI]
Doyle J. J. J. L. Doyle 1987 A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemical Bulletin 19: 11-15
Ems S. C. C. W. Morden C. K. Dixon K. H. Wolfe C. W. Depamphilis J. Palmer 1995 Transcription, splicing and editing of plastid RNAs in the nonphotosynthetic plant Epiphagus virginiana. Plant Molecular Biology 29: 721-733[CrossRef][ISI][Medline]
Farris J. S. J. Källersjo A. G. Kluge C. Bult 1995 Testing significance of incongruence. Cladistics 10: 315-319[CrossRef][ISI]
Feist M. N. Grambast-Fessard 1991 The genus concept in Charophyta: evidence from Palaeozoic to Recent. In R. Riding [ed.], Calcareous algae and stromatolites, 189203. Springer-Verlag, New York, New York, USA
Ferat J.-L. M. L. Gouar F. Michel J.-L. Ferat 1994 Multiple group II self-splicing introns in mobile DNA from Escherichia coli. C. R. Academie Sciences Paris, Sciences de la vie 317: 141-148
Fritsch F. E. 1935 The structure and reproduction of the algae, vol.1. Cambridge University Press, London, UK
Gadek P. P. G. Wilson C. J. Quinn 1996 Phylogenetic reconstruction in Myrtaceae using matK, with particular reference to the position of Psiloxylon and Heteropyxis. Australian Systematic Botany 9: 283-290[CrossRef][ISI]
Graham L. E. 1993 The origin of land plants. John Wiley & Sons, New York, New York, USA
Grambast L. 1974 Phylogeny of the Charophyta. Taxon 23: 463-481
Graur D. W.-H. Li 2000 Fundamentals of molecular evolution, 2nd ed. Sinauer, Sunderland, Massachusetts, USA
Johnson L. A. D. E. Soltis 1994 MatK DNA sequences and phylogenetic reconstruction in Saxifragaceae s.str. Systematic Botany 19: 143-156
Johnson L. A. D. E. Soltis 1995 Phylogenetic inference in Saxifragaceae sensu stricto and Gilia (Polemoniaceae) using matK sequences. Annals of the Missouri Botanical Garden 82: 149-175[CrossRef][ISI]
Karol K. G. R. M. McCourt M. T. Cimino C. F. Delwiche 2001 The closest living relatives of land plants. Science 294: 2351-2353
Kelchner S. A. 2002 Group II introns as phylogenetic tools: structure, function, and evolutionary constraints. American Journal of Botany 89: 1651-1669
Kranz H. D. V. A. R. Huss 1996 Molecular evolution of pteridophytes and their relationship to seed plants: evidence from complete 18S rRNA gene sequences. Plant Systematics and Evolution 202: 1-11[CrossRef][ISI]
Kranz H. D. R. McCourt K. Karol V. A. R. Huss 1997 Phylogenetic relationships of the charophytes as inferred from biological markers. Phycologia 36: (supplement) 54.
Kranz H. D. D. Miks M. L. Siegler I. Capesius C. W. Sensen V. A. R. Huss 1995 The origin of land plants: phylogenetic relationships among charophytes, bryophytes, and vascular plants inferred from complete small-subunit ribosomal RNA gene sequences. Journal of Molecular Evolution 41: 74-84[ISI][Medline]
Lemieux C. C. Otis M. Turmel 2000 Ancestral chloroplast genome in Mesostigma viride reveals an early branch of green plant evolution. Nature 403: 649-652[CrossRef][Medline]
Liere K. G. A. Link 1995 RNA-binding activity of the matK protein encoded by the chloroplast trnK intron from mustard (Sinapis alba L). Nucleic Acids Research 1923: 917-921
Martin-Closas C. C. Diéguez 1998 Charophytes from the lower Cretaceous of the Iberian ranges (Spain). Palaeontology 41: 1133-1152[ISI]
Matsuura M. R. Saldanha H. Ma H. Want J. Yang G. Mohr S. Cavanagh G. M. Dunny M. Belfort A. M. Lambowitz 1997 A bacterial group II intron encoding reverse transcriptase, maturase, and DNA endonuclease activities: biochemical demonstration of maturase activity and insertion of new genetic information within the intron. Genes and Development 11: 2910-2924
McCourt R. M. 1995 Green algal phylogeny. Trends in Ecology and Evolution 10: 159-163
McCourt R. M. K. G. Karol M. T. Casanova M. Feist 1999 Monophyly of genera and species of Characeae based on rbcL sequences, with special reference to Australian and European Lychnothamnus barbatus (Characeae: Charophyceae). Australian Journal of Botany 47: 361-369[CrossRef]
McCourt R. M. K. G. Karol M. Guerlisquine M. Feist 1996a Phylogeny of extant genera in the family Characeae (division Charophyta) based on rbcL sequences and morphology. American Journal of Botany 83: 125-131[CrossRef][ISI]
McCourt R. M. S. Meiers K. G. Karol R. L. Chapman 1996b Molecular systematics of the Charales. In D. B. Chaudhary and S. B. Agrawal [eds.], Cytology, genetics and molecular biology of algae, 323336. SPB Publishing, Amsterdam, Netherlands
Meiers S. T. V. W. Proctor R. L. Chapman 1999 Phylogeny and biogeography of Chara (Charophyta) inferred from 18S rDNA sequences. Australian Journal of Botany 47: 347-360[CrossRef][ISI]
Meiers S. T. W. L. Rootes V. W. Proctor R. L. Chapman 1997 Phylogeny of the Characeae (Charophyta) inferred from organismal and molecular characters. Archiv für Protistenkunde 148: 308-317[ISI]
Michel F. J.-L. Ferat 1995 Structure and activities of group II introns. Annual Review of Biochemistry 64: 435-461[CrossRef][ISI][Medline]
Mohr G. P. S. Perlman A. M. Lambowitz 1993 Evolutionary relationships among group II intron-encoded proteins and identification of a conserved domain that may be related to maturase function. Nucleic Acids Research 21: 4991-4997
Ooi K. A. Y. Endo J. Yokoyama N. Murakami 1995 Useful primer designs to amplify DNA fragments of the plastid gene matK from angiosperm plants. Journal of Japanese Botany 70: 328-331
Peck R. E. 1953 Fossil charophytes. Botanical Review 19: 209-227
Pickett-Heaps J. D. 1975 Green algae: structure, reproduction and evolution in selected genera. Sinauer, Sunderland, Massachusetts, USA
Soltis D. E. R. K. Kuzoff E. Conti R. Gornall K. Ferguson 1996 matK and rbcL gene sequence data indicate that Saxifraga (Saxifragaceae) is polyphyletic. American Journal of Botany 83: 371-382[CrossRef][ISI]
Soulie-Märsche I. 1989 Etude comparee de gyrogonites de charophytes actuelles et fossiles et phylogenie des genres actuels. Thése d'Etat soutenue á l'Université des Sciences et Techniques du Languedoc, Montpellier, France
Steele K. P. R. Vilgalys 1994 Phylogenetic analyses of Polemoniaceae using nucleotide sequences of the plastid gene matK. Systematic Botany 19: 126-142
Swofford D. L. 1998 PAUP*: phylogenetic analysis using parsimony (* and other methods), version 4. Sinauer, Sunderland, Massachusetts, USA
Swofford D. L. G. J. Olsen P. J. Waddell D. M. Hillis 1996 Phylogenetic inference. In D. M. Hillis, C. Moritz, and B. K. Mable [eds.], Molecular systematics, 2nd ed., 407514. Sinauer, Sunderland, Massachusetts, USA
Taylor T. N. W. Remy H. Hass 1992 Parasitism in a 400-million-year-old green alga. Nature 357: 493-494[CrossRef]
Tsudzuki J. K. Nakashima T. Tsudzuki J. Hiratsuka M. Shibata T. Wakasugi M. Sugiura 1992 Chloroplast DNA of black pine retains a residual inverted repeat lacking rRNA genes: nucleotide sequences of trnQ, trnK, psbA, trnI and trnH and the absence of rps16. Molecular and General Genetics 232: 206-214
Turmel M. C. Otis J.-C. de Cambiaire J.-F. Pombert C. Lemieux 2002 The chloroplast genome sequence of Chlorokybus atmophyticus: evidence that charophycean green algae from an early-diverging lineage adapted to terrestrial life. Botanical Society of America Abstracts website: http://www.botany2002.org/sympos8/abstracts/3.shtml
Turmel M. C. Otis C. Lemieux 1999 The complete chloroplast DNA sequence of the green alga Nephroselmis olivacea: insights into the architecture of ancestral chloroplast genomes. Proceedings of the National Academy of Sciences of the USA 96: 10248-10253
Vogel J. T. Huebschmann T. Boerner W. R. Hess 1997 Splicing and intron-internal RNA editing of trnKmatK transcripts in barley plastids: support for matK as an essential splice factor. Journal of Molecular Biology 270: 179-187[CrossRef][ISI][Medline]
Wilcox L. P. A. Fuerst G. L. Floyd 1993 Phylogenetic relationships of four charophycean green algae inferred from complete nuclear-encoded small subunit rRNA gene sequences. American Journal of Botany 80: 1028-1033[CrossRef][ISI]
Wood R. D. I. Imahori 1965 Monograph of the Characeae. First part of a revision of the Characeae. Verlag von J. Cramer, Weinheim, Germany
This article has been cited by other articles:
![]() |
G. Hausner, R. Olson, D. Simon, I. Johnson, E. R. Sanders, K. G. Karol, R. M. McCourt, and S. Zimmerly Origin and Evolution of the Chloroplast trnK (matK) Intron: A Model for Evolution of Group II Intron RNA Structures Mol. Biol. Evol., February 1, 2006; 23(2): 380 - 391. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |