Am. J. Bot.
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


  Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter
What's this?
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (57)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kelchner, S. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Kelchner, S. A.
Agricola
Right arrow Articles by Kelchner, S. A.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?
(American Journal of Botany. 2002;89:1651-1669.)
© 2002 Botanical Society of America, Inc.


Systematics and Phytogeography

Group II introns as phylogenetic tools: structure, function, and evolutionary constraints1

Scot A. Kelchner

Centre for Plant Biodiversity Research, Commonwealth Scientific and Industrial Research Organisation, Division of Plant Industry, Canberra, ACT 2601 Australia; School of Botany and Zoology, The Australian National University, Canberra, ACT 2601 Australia

Received for publication January 3, 2002. Accepted for publication May 3, 2002.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Group II introns comprise the majority of noncoding DNA in many plant chloroplast genomes and include the commonly sequenced regions trnK/matK, the rps16 intron, and the rpl16 intron. As demand increases for nucleotide characters at lower taxonomic levels, chloroplast introns may come to provide the bulk of plastome sequence data for assessment of evolutionary relationships in infrageneric, intergeneric, and interfamilial studies. Group II introns have many attractive properties for the molecular systematist: they are confined to organellar genomes in eukaryotes and the majority are single-copy; they share a well-defined and empirically tested secondary and tertiary structure; and many are easily amplified due to highly conserved sequence in flanking exons. However, structure-linked mutation patterns in group II intron sequences are more complex than generally supposed and have important implications for aligning nucleotides, assessing mutational biases in the data, and selecting appropriate models of character evolution for phylogenetic analysis. This paper presents a summary of group II intron function and structure, reviews the link between that structure and specific mutational constraints in group II intron sequences, and discusses strategies for accommodating the resulting complex mutational patterns in subsequent phylogenetic analyses.

Key Words: chloroplast noncoding DNA • group II introns • molecular evolution • phylogenetic analysis • RNA structure • rpl16 intron


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Group II (G2) introns comprise the majority of noncoding DNA in most chloroplast genomes (Jurica and Stoddard, 1999 ). Some are among the fastest evolving regions known in the plastome (Wolfe, Li, and Sharp, 1987 ; Downie, Katz-Downie, and Cho, 1996 ; Small et al., 1998 ; Downie, Katz-Downie, and Watson, 2000 ). As molecular systematists focus more attention on relationships at lower taxonomic levels in plants, chloroplast group II introns are increasingly providing a rich source of sequence characters for infrageneric and intrafamilial phylogeny estimation (Table 1).


View this table:
[in this window]
[in a new window]
 
Table 1. Phylogenetic analyses of land plants using chloroplast group II intron sequence data. Taxon level comparisons between studies do not necessarily reflect equivalent evolutionary time scales

 
There has been much speculation among molecular systematists as to the nature of group II introns and their possible evolutionary constraints (e.g., Clegg et al., 1994 ; Downie, Katz-Downie, and Cho, 1996 ; Jordan, Courtney, and Neigel, 1996 ; Kelchner and Clark, 1997 ; Downie et al., 1998b ). For many years, introns have been considered "junk" DNA and were widely presumed to evolve under minimal selective constraints in a fashion consistent with the neutral theory of sequence evolution (sensu Kimura, 1968 , 1983 ). The legacy of this expectation persists in molecular systematics, possibly due to the absence of readily attainable information to the contrary. There have been many recent advances in our understanding of group II intron structure and function, however, and this paper aims to provide a synopsis of that material in the context of its importance to phylogenetic methodology.

One of the most exciting features of group II introns as phylogenetic tools is the uniformity of their structure and function. It should be clear in the following discussion that functional requirements induce structural constraints on group II introns and that these constraints may contribute to heterogeneous mutation patterns across G2 intron sequences. Understanding the connection between structure, function, and evolutionary constraints in a G2 intron is therefore fundamental to improving all levels of phylogenetic analysis based on G2 intron sequence data.

The information presented here is intended to assist molecular systematists in the use of G2 intron sequences for phylogeny estimation in higher plants. This review does not cover the special case of the trnL intron, the sole group I intron in the chloroplast genome; it is expected, however, that most of the methodological approaches described here for G2 intron analysis will apply to similarly structured RNA molecules, including group I introns.


    GROUP II INTRON STRUCTURE AND FUNCTION
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Introns are classified on the basis of their conserved RNA folding patterns according to the nomenclature of Michel and Dujon (1983) and Michel, Umesono, and Ozeki (1989) . Each structural class, referred to as group I, group II, and group III, is characterized by a diagnostic secondary structure configuration. Group II introns include two subclasses, IIA and IIB, each consisting of two forms (IIA1 and IIA2, IIB1 and IIB2; Michel, Umesono, and Ozeki, 1989 ).

Whereas group I introns are found in all genomes of prokaryotic and eukaryotic organisms, group II introns are restricted to plant and fungi organelles and certain prokaryotes of cyanobacterial and proteobacterial lineages. Mitochondrial genomes of plants and fungi have their own set of group II introns that appear to differ historically from those found in chloroplast genomes (Michel and Ferat, 1995 ; Toor, Hausner, and Zimmerly, 2001 ). It has been demonstrated that each organelle may have maintained its unique group II intron assembly by vertical descent since the incorporation of the organelle into the eukaryotic cell (Toor, Hausner, and Zimmerly, 2001 ), and there is no contradictory evidence as yet to reject this conclusion. The situation is markedly different in organellar group I introns, several of which seem to have repeatedly and independently invaded mitochondrial genomes (Turmel et al., 1995 ; Cho et al., 1998 ; Cho and Palmer, 1999 ; Goddard and Burt, 1999 ; Holst-Jensen et al., 1999 ; Palmer et al., 2000 ). The sole group I intron in plastomes, that of the trnL gene, may predate endosymbiosis and is not thought to derive from post-endosymbiotic invasion of the genome (Besendahl et al., 2000 ).

Function
The primary function of a group II intron, whether in a chloroplast, mitochondrion, or prokaryote genome, is to self-direct its extrication from gene transcripts prior to translation of the mRNA into a protein. This process requires two rounds of autocatalytic chemical reactions, termed "splicing reactions." Splicing refers to the capacity of the intron to break the ribonucleic acid chain of the pre-mRNA transcript at the exon boundaries and reconnect the strand after the intron's removal. Splicing completely excises the intron from the disrupted gene transcript, allowing the pre-mRNA to continue its maturation pathway and subsequent translation. Failure to properly or efficiently remove the intron from the transcript prevents further transcript processing and translation; the protein is not synthesized, and presumably both the organism and the intron are strongly selected against.

These reactions define the primary functional phase of an intron and occur while the intron and host gene are a pre-mRNA transcript. Thus, in terms of evolutionary constraints, changes in the intron DNA sequence should be considered in terms of its RNA counterpart. This detail has significant implications when using G2 introns for comparative sequence analysis and phylogeny construction.

The two stages of intron-directed cis-splicing reactions are as follows (Jacquier, 1996 ; Podar, Perlman, and Padgett, 1998 ; Holländer and Kück, 1999 ; Jurica and Stoddard, 1999 ; Costa, Michel, and Westhof, 2000 ). The first stage consists of the folding of a pre-mRNA intron transcript into its secondary and tertiary structural formation, called a "ribozyme" (a term designating a catalytic RNA). This folding brings a single adenine in domain VI (Fig. 1) against the now neighboring intron/exon boundaries. The proximity of the adenine to the G1 nucleotide (the first nucleotide of the 5' end of the intron) triggers a nucleophilic attack, and a transesterification reaction cleaves the ribonucleic acid at the 5' intron-exon boundaries to form a structure known as a "lariat" (Schmelzer and Müller, 1987 ; Jacquier and Jacquesson-Breuleux, 1991 ). The second stage of splicing involves another transesterification reaction involving the 3' intron/exon boundary, rejoining the exon pre-mRNA fragments together and releasing the intron, still in lariat form.



View larger version (23K):
[in this window]
[in a new window]
 
Fig. 1. Stylized structure of group IIB introns—the most frequent class of G2 introns in the plastome—showing secondary structure and tertiary interactions based on Michel, Umesono, and Ozeki (1989) , Jacquier (1996) , and Toor, Hausner, and Zimmerly (2001) . Nomenclature of structural features follows Michel, Umesono, and Ozeki (1989) . Core structural elements consistent across all group II introns are drawn here in bold; light gray outlines indicate structures that may vary in length or terminal loop configurations. Roman numerals I–VI mark the six domains of group II introns. Lowercase letters (a, b, c, and d) mark subdomain helices of domain I. Lower case roman numerals (i–iv) indicate stem segments within each subdomain. IBS1, IBS2, and IBS3 are tertiary interaction sites in the 5' and 3' exons that correspond to intron domain I elements EBS1, EBS2, and EBS3, respectively. Other known tertiary interactions are indicated by Greek symbols next to the interactive sites. Conserved nucleotides are indicated as a single character state (e.g., A, G, or length-restricted sites N) or as purines (R) and pyrimidines (Y). The oversized "A" in domain VI (also marked with an asterisk) is the initiation nucleotide for group II intron splicing reactions. Nucleotide positions represented as dots signify partial 5' and 3' exon sequences. Group IIA introns differ slightly in structure in the domain I d helix—EBS2 is the terminal loop of a small helix just upstream of d2, and there is an additional small helix at the EBS3 region (see Toor, Hausner, and Zimmerly, 2001 ). The rpl16 intron is missing subdomains a and b in domain I and subsequently the {alpha} tertiary interaction with the d3 stem bulge

 
Although autocatalytic splicing occurs in vitro under conditions unusual to the cell (e.g., Peebles et al., 1986 ; Schmelzer and Schweyen, 1986 ), efficient in vivo splicing almost always requires the assistance of a catalytic enzyme (Michel and Ferat, 1995 ; Jacquier, 1996 ; Tanner, 1999 ). This enzyme, a reverse-transcriptase-like protein referred to as a "maturase," joins the pre-mRNA transcript prior to the splicing reactions to form a ribonucleoprotein complex (RNP complex; see Zimmerly et al., 1995a ; Cousineau et al., 2000 ). The RNP complex consists of at least the ribozyme form of the intron and a maturase. In many prokaryotic G2 introns, the maturase of the RNP complex is intron-specific and is coded for by the intron itself. When present, the maturase open reading frame (ORF) in both prokaryotes and eukaryotes resides in domain IV (Mohr, Perlman, and Lambowitz, 1993 ), which may also serve as a binding site for the maturase in the RNP complex (Wank et al., 1999 ). This gives rise to the curious circumstance in the trnK sequence region of a gene (matK) within an intron (trnK intron) within another gene (trnK).

Toor, Hausner, and Zimmerly (2001) surveyed 39 maturase ORFs from prokaryotic and organellar G2 introns and identified six phylogenetic lineages that correspond closely with individual G2 intron structural categories. For example, all maturases from class IIA1 mitochondrial introns belong to a single maturase lineage. Furthermore, their survey of 142 known group II introns revealed that most prokaryote G2 introns have ORFs, about half of lower eukaryote G2 introns maintain ORFs, but only two of the 42 higher eukaryote organellar G2 introns still have ORFs coding for a functional maturase (matK in the chloroplast, matR in the mitochondrion). Their data suggest a model of G2 intron evolution in which nearly all chloroplast and mitochondrial G2 introns have lost their functional maturases, and those G2 introns still maintaining them have each coevolved with their maturase since ancient times. These findings concur with those of an earlier study by Mohr, Perlman, and Lambowitz (1993) .

Chloroplast introns have managed to persist despite the loss of their unique maturases. They still seem to require an RNP complex for efficient splicing in vivo (e.g., Holländer and Kück, 1999 ), but it is thought that the maturase matK can successfully form RNP complexes with any of the G2 introns in the chloroplast (Mohr, Perlman, and Lambowitz, 1993 ; Ems et al., 1995 ; Vogel, Börner, and Hess, 1999 ). This arrangement may have freed the remaining G2 introns in the plastome from having to maintain their own ORFs, a situation that could in part be responsible for the high levels of sequence variation noted in many chloroplast intron domain IVs (e.g., Learn et al., 1992 ; Downie et al., 1998b ; Downie, Katz-Downie, and Watson, 2000 ). Not surprisingly, there is evidence that many chloroplast G2 introns contain degenerate maturase ORFs in their domain IV helix (Toor, Hausner, and Zimmerly, 2001 ).

Maturases are also reverse transcriptases and have several reverse transcriptase (RT) domains in their coding sequence. This enables some G2 introns to move about in their host genomes (Lambowitz and Belfort, 1993 ; Mueller et al., 1993 ; Zimmerly et al., 1995b ; Eickbush, 1999 ; Jurica and Stoddard, 1999 ). Mobile G2 introns are mostly known from prokaryotes and yeast, and several examples of G2 intron transpositioning have been demonstrated within the yeast mitochondrial genome (Mueller et al., 1993 ; Lazowska, Meunier, and Macadre, 1994 ; Moran et al., 1995 ; Yang et al., 1998 ; Sellem, Begel, and Sainsard-Chanet, 2000 ). Besides having complete RT domains, the typical maturase of a mobile group II intron contains a "zinc finger" that aids in targeting by sequence recognition (a process known as "homing"; Jurica and Stoddard, 1999 ; Mohr et al., 2000 ). It is this feature of some G2 introns that has made them candidates for medical applications, such as gene therapy in humans (Tanner, 1999 ; Guo et al., 2000 ; Mohr et al., 2000 ).

Mobility in chloroplast group II introns has not been detected. The lack of a zinc finger and complete reverse transcriptase domains in matK (Mohr, Perlman, and Lambowitz, 1993 ; Young and dePamphilis, 2000 ) is consistent with the expectation that matK RNP complexes do not possess homing capabilities.

A recent discovery has suggested that splicing of at least some chloroplast G2 introns in maize may also involve two nuclear-coded gene products, CRS1 and CRS2 (Jenkins, Kulhanek, and Barkan, 1997 ; Vogel, Börner, and Hess, 1999 ; Till et al., 2001 ). It is suspected that CRS1 is a required cofactor in atpF splicing reactions and that CRS2 may be a cofactor in group IIB intron RNPs, the most frequent class of introns in the plastome.

There is sound in vivo and in vitro experimental evidence that group II intron function is largely consistent in both the mitochondrion and the chloroplast (Herdenberger, Holländer, and Kuck, 1994 ; Holländer and Kück, 1998 , 1999 ). One in vivo system developed by Herdenberger, Holländer, and Kuck (1994) for point mutation studies of group II introns introduces the mitochondrial rI1 group II intron from the green algae Scenedesmus obliquus into the chloroplast gene tscA of Chlamydomonas reinhardtii. The inserted intron efficiently completes its splicing reactions in the chloroplast, enabling proper translation of the tscA protein. Such studies highlight the probable universality of mechanisms involved in G2 intron splicing reactions.

Function is inextricably linked to structure in G2 introns. We can therefore infer that site-specific mutations that terminate function in one G2 intron will likely have the same effect in other G2 introns if such mutations occur in homologous structural positions. As discussed in the final section of this paper, this concept may have important applications when using G2 introns for molecular phylogenetics.

Structure
Jacquier (1996) estimated that a G2 intron would need no less than 600 nucleotides to maintain all structural features involved in proper splicing. The standard group II intron folding model was created by identifying conserved secondary structures of folded RNA intron sequences among a wide variety of organisms and genomes (Michel and Dujon, 1983 ; Michel, Umesono, and Ozeki, 1989 ; Michel and Ferat, 1995 ). The Michel, Umesono, and Ozeki (1989) model still remains the best estimate of G2 intron structure and has largely been validated by in vitro and in vivo experimental investigations, including point mutation studies (Peebles et al., 1995 ; Abramovitz, Friedman, and Pyle, 1996 ; Holländer and Kück, 1999 ), chemical footprinting (Konforti, Liu, and Pyle, 1998 ; Costa, Michel, and Westhof, 2000 ), and NAIM analysis (Boudvillain and Pyle, 1998 ; Boudvillain, de Lencastre, and Pyle, 2000 ). Updated detailed models of the four group II intron structural categories can be found in Toor, Hausner, and Zimmerly (2001) .

Although group II introns from organisms as diverse as cyanobacteria, Euglena, higher plants, and fungi share little in the way of nucleotide sequence similarity, sequences from each organism can be folded into the same core secondary structure. The Michel, Umesono, and Ozeki (1989) model has six main domains, multiple subdomains, and a nomenclatural system (Fig. 1). The general structure consists of six major structural helices that radiate from a "central wheel" of single-stranded RNA segments. Domain I (D1; Table 2) is the most complex, with many structurally important subhelices, and it typically comprises more than half of the total intron sequence. This domain interacts strongly at the tertiary level with domains V and VI (Boudvillain and Pyle, 1998 ; Konforti, Liu, and Pyle, 1998 ) and with external binding sites in the flanking exons. Domains II (D2) and III (D3) are considerably shorter and vary in length in plants (Learn et al., 1992 ). These domains seem to contribute relatively little to tertiary structure and splicing efficiency (Kwakman et al., 1989 ; Koch et al., 1992 ; Konforti, Liu, and Pyle, 1998 ). Domain IV (D4) can be quite large in chloroplast G2 introns, and in the trnK intron it is the site of the maturase ORF, matK. Domain V (D5) is the most highly restricted in terms of length and sequence variation (Michel, Umesono, and Ozeki, 1989 ; Learn et al., 1992 ) and is almost always 34 nucleotides in length in plants. Domain V is not known to possess any binding sites with the mRNA substrate, and its high degree of conservation is most likely due to its fundamental role in ribozyme folding (Peebles et al., 1995 ; Jacquier, 1996 ; Konforti, Liu, and Pyle, 1998 ). Domain VI (D6) has tertiary interactions with domains I and V (Koch et al., 1992 ; Dib-Hajj et al., 1993 ; Podar and Perlman, 1999 ) and may vary in length, usually in its terminal loop.


View this table:
[in this window]
[in a new window]
 
Table 2. Function and interactions of the six group II intron domains

 
Tertiary interactive sites include both stem and nonpairing nucleotides that interact internally with nucleotides in other regions of the ribozyme or externally with the flanking exons (the external and internal binding sites, EBS and IBS, respectively). Helix D1d (Domain I, subdomain d) contains the three known external binding sites, EBS1, EBS2, and EBS3. These segments of the ribozyme interact directly with exon internal binding site sequences IBS1, IBS2, and IBS3, respectively (Michel, Umesono, and Ozeki, 1989 ; Costa, Michel, and Westhof, 2000 ).

There are several intraribozymic interactive sites in a group II intron sequence that assist in giving the mature ribozyme its functional tertiary structure. Those proposed or identified by experimental evidence are indicated by Greek symbols in Fig. 1 (see Table 3 for details). Some of these interactions are essential for splicing, an example being {gamma}{gamma}'. Holländer and Kück (1999) were able to demonstrate in vivo that splicing of an intron in the chloroplast depends on the ability of {gamma}{gamma}' to form a Watson-Crick (canonical) pairing. The {zeta}{zeta}' interaction conserves the sequence identity of the terminal helix in domain V and the bulge in D1d1 (Peebles et al., 1995 ). Other interactions, such as {eta}{eta}', may not be as essential, for the absence of this interaction does not significantly diminish splicing efficiency in those introns investigated (Koch et al., 1992 ).


View this table:
[in this window]
[in a new window]
 
Table 3. Tertiary interaction sites in group II introns determined by experimental investigations

 
In terms of functional importance of secondary structures, domains I and V are by far the most crucial for catalytic requirements of the ribozyme, followed by domain VI (Dib-Hajj et al., 1993 ; Pyle, 1996 ; Boudvillain and Pyle, 1998 ; Konforti, Liu, and Pyle, 1998 ; Costa and Michel, 1999 ). Domains II, III, and IV are of lesser importance (Koch et al., 1992 ), although each probably participates either in a particular step of splicing or in enhancing the overall catalytic reaction (Costa et al., 1997 ; Boudvillain and Pyle, 1998 ). All domain boundaries (the primary stem of domain helices) are highly conserved in G2 introns (Michel, Umesono, and Ozeki, 1989 ; Saldanha et al., 1993 ).


    MITOCHONDRIAL GROUP II INTRONS
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
There are nearly an equivalent number of group II introns in plant mitochondrial genomes (mtDNA) as in plant chloroplast genomes. Like plastome introns, mitochondrial G2 introns are thought to be derived by vertical descent (Palmer et al., 2000 ). Only a few studies to date have assessed the phylogenetic utility of mitochondrial group II introns in plants. The rps3 intron has shown minor variation at the interspecific level in Betula (Laroche and Bousquet, 1999 ), and the nad1b/c intron has recently been surveyed for phylogenetic characters in higher plants (Demesure, Sodzi, and Petit, 1995 ; Bakker et al., 2000 ; Freudenstein and Chase, 2001 ).

Two factors of mtDNA evolution may be partly responsible for the apparent difficulty in developing mitochondrial group II introns as phylogenetic tools. The first factor is the slow rate of synonymous substitution in mitochondrial DNA, which is estimated to be almost five times slower than that of chloroplast DNA (Wolfe, Li, and Sharp, 1987 ; Schuster and Brennicke, 1994 ). The second factor is the frequency of recombination in plant mtDNA that can sometimes relocate exon and intron elements of a disrupted gene to separate regions of the genome. The result can be a fragmented intron, of which one or more domains are scattered through the genome (Chapdelaine and Bonen, 1991 ; Wissinger, Schuster, and Brennicke, 1991 ; Knoop and Brennicke, 1993 ; Malek and Knoop, 1998 ; Sainsard-Chanet, Begel, and d'Aubenton-Carafa, 1998 ). One such recombination event gave rise to a "tripartite" intron in Oenothera berteriana (Knoop, Altwasser, and Brennicke, 1997 ), in which three separate fragments of a group II intron must now be brought together as post-transcriptional pre-mRNAs to form a functional splicing ribozyme. The reaction is referred to as trans-splicing (see Bonen, 1993 ; Knoop and Brennicke, 1993 ; Doetsch et al., 2001 ) and is thought to be the general mechanism for the splicing of all fragmented G2 introns in the mitochondrial genome. A targeted mitochondrial intron that is fragmented could make polymerase chain reaction (PCR) amplification difficult or impractical.


    CHLOROPLAST INTRON LOSS IN ANGIOSPERMS
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
A few group II introns have been lost in certain chloroplast genomes. Independent cases of intron loss with host gene persistence are rare in higher plants (Downie et al., 1991 ; Palmer and Delwiche, 1998 ), although there are documented cases of multiple independent intron losses in closely related taxa (e.g., Bauhinia, Lai et al., 1997 ; Medicago, Downie et al., 1998a ). Intron absence from a plastome can merely reflect the loss of the host gene and not a specific intron loss event (Doyle, Doyle, and Palmer, 1995 ). In cases when the host gene is still present but the intron is not, the loss of the intron is presumed to have occurred by recombination between a post-splicing cDNA and the original disrupted gene sequence (Palmer and Delwiche, 1998 ).

Several higher plant lineages have been extensively surveyed for the presence of chloroplast group II introns. The rpl2 intron has the most widespread reported losses, with absences in members of at least 17 angiosperm families: Aizoaceae, Amaranthaceae, Basellaceae, Cactaceae, Caryophyllaceae, Chenopodiaceae, Convolvulaceae, Cuscutaceae, Didiereaceae, Droseraceae, Fabaceae, Geraniaceae, Menyanthaceae, Nyctaginaceae, Phytolaccaceae, Portulacaceae, and Saxifragaceae (Downie et al., 1991 ; Doyle, Doyle, and Palmer, 1995 ; Lai et al., 1997 ). The rpl16 intron is absent in certain members of the Geraniaceae, Goodeniaceae, and Plumbaginaceae (Downie and Palmer, 1994 ; Campagna and Downie, 1998 ). The rpoC1 intron is missing in members of the Aizoaceae, Cactaceae, Fabaceae, Goodeniaceae, Passifloriaceae, and the Poaceae (Downie, Llanas, and Katz-Downie, 1996 ; Katayama and Ogihara, 1996 ; Downie et al., 1998a ). The rps12 intron has been lost in at least three members of Anemone (Hoot and Palmer, 1994 ). The absence of the rps16 intron in Epifagus (Wolfe, Morden, and Palmer, 1992 ) and several genera in Fabaceae (Downie and Palmer, 1992 ; Doyle, Doyle, and Palmer, 1995 ) is due to loss of the rps16 gene itself in these plastomes.


    MUTATION PATTERNS IN GROUP II INTRONS
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Molecular function and structural requirements are widely expected to be a source of complex mutation patterns in nucleic acids (e.g., Learn et al., 1992 ; Downie, Katz-Downie, and Cho, 1996 ; Hickson et al., 1996 ; Kellogg and Juliano, 1997 ; Ballard et al., 1998 ; Soltis and Soltis, 1998 ; Hershkovitz, Zimmer, and Hahn, 1999 ). All biological molecules have physical structure, and selection preserving that structure for functional purposes may translate to heterogeneous mutation patterns in its underlying DNA sequence. As such, it is expected there is a direct link between molecular function, molecular structure, and mutational dynamics in nucleic acids.

For example, conservation of amino acids in a protein can lead to codon-specific substitution patterns in protein-coding sequences due to the flexibility of the genetic code (e.g., Reeves, 1992 ; Olmstead, Reeves, and Yen, 1998 ; Berg, 1999 ; McClellan, 2000 ). Conservation of the active site in the RuBisCo protein complex restricts the number of mutable sites in the rbcL sequence of plant chloroplast genomes (Kellogg and Juliano, 1997 ). Conformational requirements of ribosomal RNA can influence the degree, distribution, and nature of nucleotide change in rDNA (e.g., Hickson et al., 1996 ; Soltis and Soltis, 1998 ; Hershkovitz, Zimmer, and Hahn, 1999 ). Finally, group II intron function in higher plants may create mutation rate variation among ribozyme structures experiencing differing functional constraints (Learn et al., 1992 ; Clegg et al., 1994 ; Downie et al., 1998b ).

Heterogeneous mutation patterns in the rpl16 intron in Myoporaceae
As part of an ongoing phylogenetic analysis of Myoporaceae (a probable lineage of Scrophulariaceae sensu Olmstead et al. [2001] ), the chloroplast rpl16 intron was sequenced by the author for 46 taxa representing nearly 30 morphologically defined lineages. The rpl16 intron is among the fastest evolving sequence regions in the plastome (Wolfe, Li, and Sharp, 1987 ; Small et al., 1998 ; Downie, Katz-Downie, and Watson, 2000 ) and is often used for inter- and infrageneric phylogeny estimation in plants (Table 1). Mutation patterns were assessed across the entire sequence as well as on a partition by partition basis. The results were presented at a recent conference (Kelchner et al., 2000 ) and will be summarized here to illustrate the heterogeneous manner of mutation accumulation in a chloroplast group II intron.

The mutation data was compiled by a direct tally of mutations across a matrix of aligned sequences. In general, a tally of mutations is not an optimal approach to mutation pattern assessment for at least three reasons: first, a tally cannot accommodate superimposed mutation events; second, it does not take into account any influence that historical relationship may have on the distribution of character state variation; and third, it does not adequately test the possibility that observed heterogeneity in mutation patterns may still be the result of a largely stochastic mutation process under a uniform model. However, in this study sequence variation is very low, what variation exists is largely autapomorphic, and there is no available independently derived phylogeny for the taxa. A statistical test of the difference between observed and expected mutation patterns is not readily applicable, for without a model of mutation and a phylogeny to map change upon, it is difficult to determine expected values for the manner and distribution of mutations in these sequences.

The probable recency of the family's origin and the abundance of autapomorphic change in the rpl16 intron sequences provide an interesting opportunity for estimating mutational "tendencies" in a group II intron. The majority of observed rpl16 intron mutations across Myoporaceae are autapomorphic (110 substitutions), and potentially informative character state transformations are relatively few in number (38 substitutions). Tallying mutations across such a sequence matrix should minimize the potential influence of hierarchical structure on observed mutation patterns. From the very low p distance values between sequences (0.00% to 1.73%) we would expect that superimposed substitutions are limited in number (but certainly not impossible; see Kelchner and Clark, 1997 ).

Sequence alignment followed the criterion of Kelchner (2000) , which integrates structural and mutation class arguments for character homology with the conventional sequence similarity approach. Secondary structures for each sequence in its RNA form were determined using the domain-by-domain folding method (see below, Techniques: Inferring G2 intron secondary structures). Three data partitions were considered: partitioning by each of six G2 intron domains (D1–D6); partitioning by four structural categories of stem, loop, bulge, and single-strand interhelix sequence (in the manner of Vawter and Brown, 1993 ); and a partition consisting of the entire intron sequence.

All rpl16 intron nucleotide characters in Myoporaceae were readily assigned to domain partitions because of the distinctiveness of domain boundary sequences (Michel, Umesono, and Ozeki, 1989 ). The classification of nucleotides into four structural categories was more difficult. Multiple minimum free energy foldings exist for terminal loops in helices D3 and D4. In these cases, nucleotides not decisively placed in a structural category were classified as "ambiguous" and removed from the analysis. Difficulties notwithstanding, 822 of the 953 nucleotides in the aligned matrix (86.25%) could be assigned to a structural class.

Rate heterogeneity ({alpha} parameter) for the unpartitioned data set was estimated under the HKY85 + {Gamma} likelihood model. The tree and parameter estimation analysis took nearly three weeks on a G3 computer using PAUP*4 beta 4 (Swofford, 1998 ). Substitution class frequencies, base composition, and distribution of variable and potentially informative characters were calculated for all partitions.

If selective constraints are consistent across all nucleotide sites in a G2 intron sequence, then each subset (partition) of an intron sequence should reflect the mutation pattern of the entire sequence as a whole. Any strong deviation of mutation patterns among partitions or between a partition and the entire sequence should indicate the presence of heterogeneous mutation processes in the data. Furthermore, if the group II intron sequences were under minimal selective constraints and evolving in a neutral fashion, we would expect to find nearly equal base composition (i.e., 25% frequency of each nucleotide), a more or less equal distribution of substitutions among sites, and about twice as many transversions as transitions. Each character partition, if sufficiently large in sample size, would also be expected to show these patterns of mutation.

The results of the mutation pattern assessment for the rpl16 intron in Myoporaceae are presented in Fig. 2. Overall, there does not seem to be a consistency in mutation pattern between all partitions, suggesting that an heterogeneous mutation process underlies this data. Several of the following points are particularly interesting in the context of phylogenetic analysis.



View larger version (67K):
[in this window]
[in a new window]
 
Fig. 2. Mutation patterns in character partitions of 46 rpl16 intron sequences in Myoporaceae. Partitions were created for all domains (domain I–VI; D1–D6) and four structural categories (S, stems; L, loops; B, bulges; I, interhelical sequence). All nucleotides were allocated to one of six domain categories; 822 of 953 nucleotides (86%) could be unambiguously assigned to stem, loop, bulge, or interhelical sequence partitions. (A) Relative number of variable and potentially informative character changes by domain; domain V (D5) showed no observable mutations. (B) Relative number of variable and potentially informative character state changes by structural category; "All" indicates the value when assessed across the entire sequence matrix. (C) Frequency of substitution classes across the entire sequence matrix. (D) Transition bias in each partition, expressed as the transition : transversion (Ti : Tv) ratio. (E) Base composition by structural category. (F) Base composition by domain

 
In the Myoporaceae rpl16 intron matrix, the percentage of potentially informative characters for phylogenetic analysis seems evenly distributed between domain and structural classes (Fig. 2A, B). This is despite partition differences in the number of variable characters, which are particularly high in both domain II and the bulge nucleotides. Some authors (e.g., Dixon and Hillis, 1993 ; Miyamoto et al., 1994 ) have considered an a priori weighting scheme for characters occurring in stem structures with the expectation that stem nucleotides have lower rates of mutation due to their role in secondary structure formation. However, there is a nearly equal distribution of informative characters between all partitions of the Myoporaceae data, even though the number of variable characters in the stem partition is lowest among the structural categories. This suggests that at the level of divergence in Myoporaceae rpl16 intron sequences, no particular domain or structural category would be expected to contribute unequally to a phylogenetic analysis.

There is also evidence of variation in substitution types by partition, as well as unequal frequency of substitution classes across the entire sequence. The most common substitution class in the matrix is A/G, which is ten times more frequent than C/G substitutions (Fig. 2C). Transitions are more frequent than transversions when averaged across the matrix, but there is variation in the degree of the transition : transversion ratio between partitions. The stem category of structural partitions shows a very high transition rate, nearly four times higher than the transition rate across the entire matrix (Fig. 2D). This is perhaps the most dramatic example in the study of a structural partition deviating from an averaged sequence value for a mutation category.

In terms of base composition, loops, bulges, and interhelical sequences are particularly rich in A, loops having almost twice the A content of stems (Fig. 2E). In the domain partitions, domains I–IV and domain VI are all high in A/T content, although domain IV has nearly twice the frequency of T as domain V (Fig. 2F). Base composition in domain V approaches equivalency for all nucleotide states, perhaps due to strong functional constraints resulting in a relative increase in G/C content.

Although this was not a statistical test of mutation dynamics, the variation of mutation patterns between subsets of a group II intron sequence is consistent with the findings of other researchers (e.g., Learn et al., 1992 ; Downie, Katz-Downie, and Cho, 1996 ; Downie, Katz-Downie, and Watson, 2000 ) and suggests that heterogeneous modes of mutation may be a general feature of group II introns. The presence of heterogeneous processes in G2 intron sequence data has important implications for their use in phylogenetic analysis (see below, Alignment and analysis).

Parameter values in likelihood analysis try to account for such mutational biases, but are usually estimated in a likelihood framework using the entire sequence. Figure 2, however, illustrates that parameter values derived from the entire sequence may differ from those estimates derived within specific partitions. For example, a transition : transversion ratio for the entire rpl16 intron data in Myoporaceae is 1.37 : 1, but for nucleotides in the stem category this ratio is more than 5 : 1. Applying the total sequence average of 1.37 to an analysis of Myoporaceae rpl16 intron sequences would be treating 57% of the categorized nucleotides (those in RNA stem positions) under an improper value for transition rate.

Constraints on sequence evolution
Site-specific limitations on character state transformation
Several sites in G2 intron sequences experience a restriction in potential character states due to specific functional requirements of a nucleotide or secondary structure. Many of these sites have been tested experimentally by point-mutation studies to assess their influence on splicing reaction efficiency. Figure 1 indicates the many sites that are thought to be restricted solely to purines (R) or pyrimidines (Y). Most of these nucleotides are involved in tertiary interactions with other regions of the ribozyme. Other nucleotides are highly conserved in all group II introns and are examples of the "invariable" character in phylogenetic analysis (see Lockhart et al., 1996 ; Steel, Huson, and Lockhart, 2000 ). Two specific cases include the A in the D6 bulge (marked in bold with an asterisk, Fig. 1) and the primary 5' intron nucleotide G (also referred to as the "G1" nucleotide; Holländer and Kück, 1999 ). Both nucleotides are involved in the transesterification reactions that cleave the pre-mRNA substrate, and any change in character state for either site will prevent splicing.

Mikheeva et al. (2000) recently investigated the conservation of the sequential GA nucleotide couplet just upstream of domain III (Fig. 1). The GA couplet is present at this structural location in almost all group II introns, which suggests these nucleotides possess a functional importance. Mikheeva et al. (2000) found that the dinucleotide contributes to the second step in intron splicing reactions and occupies an important spatial position in the ribozyme tertiary structure. Alteration or deletion of the GA led to highly reduced splicing efficiency.

The {gamma}{gamma}' interaction must pair canonically (G-C, A-U) or significant reduction in splicing efficiency results (Holländer and Kück, 1999 ). We may therefore infer that any change in one {gamma} nucleotide must be accompanied by a simultaneous and compatible mutation in its {gamma} partner to retain Watson-Crick pairing of these two sites. Site mutation studies of the D5 loop sequence, GAAA, greatly reduced splicing efficiency (Chanfreau and Jacquier, 1994 ), probably by limiting its {zeta}{zeta}' tertiary interaction with domain I (Costa and Michel, 1995 ; Jacquier, 1996 ).

Group II intron loops consistently demonstrate very high A and T content in comparison to the relatively G/C rich stem nucleotides, a feature reminiscent of loop sequence in group I introns and mitochondrial and nuclear RNAs (e.g., Ballard et al., 1998 ; Gutell et al., 2000 ). Additional character state restrictions that may be present in RNA stems are discussed below (High transition rates in stems).

G-U "wobble" base pairing
The G-U, or "wobble," base pair is a non-Watson-Crick association that is fundamental to nearly every class of RNA, including introns (Varani and McClain, 2000 ). Group II intron folding models invoke a great number of G-U pairings, emphasizing why it is essential to fold intron DNA sequences as RNA transcripts. In their review of G-U pairing and its importance in biological systems, Varani and McClain (2000) list several properties of the pairing that may be invaluable for catalytic RNA. Among these properties are the conformational flexibility that permits "sharp turns" in RNA structures, the positioning of metal ions at active sites, increased electronegativity in the major groove of paired nucleic acid strands to create a recognition site by induced fit or chemical identity, and provision of a thermodynamically viable alternative to canonical base pairing. In group II introns, some G-U pairing can be highly conserved, two examples being the functionally important G-U pair in domain V (Peebles et al., 1995 ; Abramovitz, Friedman, and Pyle, 1996 ; Konforti, Liu, and Pyle, 1998 ; see Fig. 1) and the G-U pairs often surrounding the branch site (the A bulge) in domain VI (Chu et al., 1998 ).

Though less energetically stable, C-A pairing may also be prevalent in group II intron structures. Varani and McClain (2000) suggest that C-A bonding can provide many of the structural features of the G-U pairing. More importantly, perhaps, for mutation dynamics in G2 introns, C-A pairs may be of minimal hindrance in the formation of key stem structures and therefore may not be strongly selected against in certain structural positions.

High transition rates in stems
Stem nucleotide substitutions may be more likely to persist in group II intron sequences if they are transitions. This can be understood in terms of selective constraints conserving stem structure. If stem formation in a ribozyme must be maintained for a functional reason, substitutions occurring in stem nucleotides should only persist if they do not significantly reduce the likelihood of proper stem formation. In nuclear RNA, it has been reasoned that mutation leading to nonpairing nucleotides within a stem must eventually result in compensatory mutation to maintain stem structure (Wheeler and Honeycutt, 1988 ; Dixon and Hillis, 1993 ; Muse, 1995 ; Springer, Hollar, and Burk, 1995 ; Hickson et al., 1996 ). It is proposed that such compensatory mutations happen in a step-wise process over time (Rousset, Pelandakis, and Solignoc, 1991 ; Kraus et al., 1992 ; Gatesy et al., 1994 ), which requires the mispairing to persist until a compensatory mutation can take place.

Consider a case, however, in which an RNA structure may be so highly constrained that any mispairing would result in an altered structure and loss of function. For example, the domain V helix of group II introns is nearly invariable in its stem and loop lengths and is an essential pillar in the tertiary folding of the intron ribozyme (Fig. 3). Point mutation studies of this domain reveal a link between precise structure formation and splicing efficiency of the intron (Peebles et al., 1995 ; Höllander and Kück, 1999 ). In such a structure, selection may only tolerate those mutations that maintain immediate pairing after the substitution—in other words, it would not be possible in such cases to achieve compensatory mutation by a stepwise process.



View larger version (19K):
[in this window]
[in a new window]
 
Fig. 3. The unequal effect of transversion and transition substitutions on the energy of structure formation in a highly constrained group II intron helix, D5ii. (A) A transversion of C to G (circled nucleotides) requires a compensatory change in a second nucleotide site to restore proper structural configuration. Compensatory mutation generally requires the helix to persist as an altered, suboptimal structure until pairing can be restored with a second substitution event. In this case, there is a tenfold increase in energy required to fold the sequence ({Delta}G becomes nearly zero, at –0.6 kcal/mol) after the primary transversion event. (B) A transition at the same site in RNA can maintain relatively favorable folding conditions (in terms of {Delta}G) due to acceptable noncanonical pairings in RNA (G-U and C-A). In functionally constrained formations such as those of group II introns, alteration of the structure may block ribozyme function. Therefore, transitions would be more likely to persist in these stem structures, perhaps contributing to the high frequency of transition mutations observed in group II intron stem nucleotides

 
The need to preserve an RNA stem in a nearly exact form would allow only four possible mutations to persist, each resulting in an acceptable (G-U, C-A) non-canonical pairing for RNA: G to A, C to T, A to G, and T to C. Notably, each of these substitutions are transitions. Mutations may still occur relatively frequently in functionally constrained secondary structures, but we would expect only transitions to persist in those stems that are under the most intense degree of selection.

If correct, this reasoning provides at least one possible explanation for the low number or total absence of observed compensatory mutations in the some group II intron studies (e.g., Laroche and Bousquet, 1999 ; this paper) and the high rate of transitions in intron stems. It also suggests that any RNA sequence demonstrating exceptionally high transition rates may be under intense structural conservation as a stem structure.

Positional rate heterogeneity
As a consequence of variable functional constraints on different structural features, we would expect a certain heterogeneity in mutation rates per site in group II intron sequence data. The phenomenon is referred to as "positional" (Steel and Penny, 2000 ) or "among-site" rate heterogeneity (Kuhner and Felsenstein, 1994 ; Yang, 1994 , 1996 ). Some sites are immutable in G2 introns (discussed above); others, such as nonpairing nucleotides in RNA, may experience particularly high levels of substitution (e.g., Hickson et al., 1996 ; Downie, Katz-Downie, and Watson, 2000 ; and Fig. 2B).

Under a neutral evolution hypothesis, site mutation probabilities are expected to follow a normal distribution. When positional rate heterogeneity is present, site mutation probabilities more closely follow a gamma distribution. The {alpha} parameter of likelihood/distance models describes the shape of a gamma distribution function for site-mutation probabilities. A low {alpha} value (near zero) indicates a highly skewed distribution of mutation rates and strong positional rate heterogeneity; a higher value describes a more equal mutation probability per site (Yang, 1994 ; Swofford et al., 1996 ).

We might expect a certain level of substitution rate heterogeneity between sites in group II intron sequences due to structural constraints that may occasion heterogeneous mutation processes. Interestingly, for the 46 rpl16 intron sequences in Myoporaceae, the {alpha} parameter estimate under an HKY85 + {Gamma} model was "infinite." An infinite value for the {alpha} parameter signifies that site mutation probability is equivalent for all sites—in other words, the estimation method has not detected significant positional rate heterogeneity, as assessed by a full-sequence estimate of the {alpha} parameter under the HKY85 + {Gamma} model in Myoporaceae.

Does this mean a group II intron may have no significant positional rate differences in its sequence? The reality of site mutation rates in a G2 intron may be more complex than it first appears. The author ran the same likelihood analysis (with {alpha} parameter estimated under the given model) on the aligned sequences in Table 4Go for the conserved regions of rpl16 intron sequences for 21 higher plants. This time, the {alpha} parameter estimate was 0.367, indicating significant positional rate heterogeneity in these partitioned sequences from structurally conserved regions. Therefore, at this high taxonomic level across angiosperms, a partition of conserved characters in a group II intron shows a skew in site mutation probabilities that is not detectable in a lower-level analysis of complete intron sequences.


View this table:
[in this window]
[in a new window]
 
Table 4. Matrix of 185 structure-forming nucleotides of the chloroplast rpl16 intron in 22 species of higher plants. Secondary structure nomenclature above each sequence corresponds with the general group II intron model (Fig. 1). Domains I–VI are designated as D1–D6. The 5' and 3' modifiers refer to relative sequence position in a folded single-strand RNA stem (e.g., di5' and di3' together make up the di stem in domain I). The number of nucleotides separating each structural element is included in brackets between labeled structural features. The {psi} symbol indicates the position of the d3 bulge in domain I, which is lacking the {alpha} tertiary interaction in the rpl16 intron and can be highly variable in length. Question marks indicate missing data, usually due to the proximity of the F71 primer to the 5' intron boundary

 

View this table:
[in this window]
[in a new window]
 
Table 4. Continued

 
Apparently, we have more to learn about the nature and possible temporal aspect of positional rate heterogeneity in group II introns (e.g., Sullivan, Holsinger, and Simon, 1996 ). It may be that the infinite parameter value here is a function of the fairly even distribution of informative sites between structural and domain categories in rpl16 intron sequences of Myoporaceae. It may also be a result of assessing the gamma parameter across the entire sequence and not by individual partitions. At present, it may be worth the practitioner's time to carefully assess the value of the {alpha} parameter in terms of both the entire sequence and each constituent partition (structural and domain) before incorporating an {alpha} parameter estimate in a likelihood analysis of G2 intron sequences.

Linked mutations in G2 introns
Nonindependence of nucleotide characters is plainly as much an issue in group II intron sequences as it is in any sequence underlying a structured molecule (Kjer, 1995 ; Huelsenbeck and Neilsen, 1999 ; Kelchner, 2000 ; Tufféry and Darlu, 2000 ; Felsenstein, 2001 ). Nucleotide sites in conserved intron secondary structures evolve in conjunction with their pairing nucleotides in an RNA stem. In the 46 Myoporaceae taxa of this study, 57% of all categorized nucleotides in each rpl16 intron sequence occurred in stem formations, illustrating the extent of nonindependent characters in a G2 intron sequence.

Huelsenbeck and Neilsen (1999) discuss a form of character nonindependence that involves correlated mutation events through time—for example, one mutation may increase the likelihood of additional mutations that are linked with the primary event. Temporally correlated mutations that may occur in group II introns include length variation due to increased slipped-strand mispairing activity in a region of numerous adjacent sequence repeat units (Kelchner, 2000 ). Another example, described by Kelchner and Wendel (1996) , are the multiple minute inversion events linked with the formation of a hairpin just downstream of helix D1d2 in the rpl16 intron. Both situations may give rise to accelerated rates of mutation due to the presence of a "mutational trigger" (Kelchner and Clark, 1997 )—a specific sequence pattern that increases the likelihood of subsequent mutation events linked to that sequence pattern (see also Graham et al., 2000 ).

Nonindependence of characters in a sequence data set can be somewhat alleviated by the application of complex models of character evolution during phylogeny estimation and evaluation of clade support. Compensating for nonindependent characters in G2 intron sequence analysis is discussed in the section ALIGNMENT AND ANALYSIS.


    TECHNIQUES
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
PCR amplification and sequencing
Polymerase chain reaction amplification of organellar group II introns is usually straightforward. In chloroplast genomes, many G2 introns are usually 600–1200 residues in length, the main exception being trnK, which still carries its functional maturase ORF, matK. The size of these sequence regions make them easy to amplify in standard PCR reactions. Because G2 introns are generally employed at moderate or low taxonomic levels, primers are placed in the surrounding exons of the host gene, which generally demonstrate a much higher level of sequence conservation than the intron as a whole.

Highly conserved structural features of a G2 intron may serve as excellent sites for internal primers, such as the 3' primary stem sequence of domain III and the adjacent 5' primary stem sequence of domain IV. Although this is a case of how intron structure can be exploited for more efficient PCR reactions, some phenomena related to intron structure may negatively affect amplification of double-stranded intronic DNA. Lengthy stem structures can form in single-stranded DNA template during the PCR reaction, particularly if the DNA version of the RNA stem is composed solely of canonical pairings (G-C, A-T). If such a stem has a strong (exceptionally negative) {Delta}G value, this structure in the PCR template could make amplification and sequencing difficult.

One may address this difficulty in a similar manner as countering secondary structure-based problems in ITS and other rDNA. Baldwin et al. (1995) suggest using high-temperature PCR and sequencing reactions in such cases to assist in disassociating secondary structures in the template. Dimethyl sulfoxide (DMSO) can also be helpful in limiting structure formation (Winship, 1989 ), as well as formamide (Zhang, Reading, and Deisseroth, 1992 ) and mild detergents (Bechmann, Luke, and Hunsmann, 1990 ).

Another potential problem in group II intron sequencing is the occasional presence of long mononucleotide repeats, usually of A or T. If particularly lengthy, such regions can appear polymorphic in sequence reads, perhaps due to variation in repeat length in somatic cells of a plant. The usual solution is to sequence the complementary strand from the opposite end of the intron up to the mononucleotide repeat and then relinquish the assessment of repeat length homology in this region, which is impossible to accurately determine and may be polymorphic in a single taxon.

Inferring G2 intron secondary structures
Secondary structures for group II introns are created from the RNA equivalent of a DNA sequence (i.e., T becomes U). However, attempting to fold the entire sequence at once with folding programs like GCG's Stemloop (Genetics Computer Group, Madison, Wisconsin, USA) or Zucker's MFOLD (Zuker, 1989 ; http://bioinfo.math.rpi.edu/~zukerm/nph-home.cgi) seldom returns a folding pattern in complete agreement to the accepted and experimentally tested Michel, Umesono, and Ozeki (1989) structural model. This may be due to the fact that many pairings in G2 introns occur between elements that are widely separated in the sequence itself, an example being the 5' sequence for stem D1i and its 3' complement, which can be as much as 400 nucleotides further along the sequence.

Learn et al. (1992) could not recover a consistent minimum free energy (MFE) folding pattern for domain I in the trnV intron in several taxa, but did retrieve the standard model for domains II–VI. The authors suggest this may be due to a potential interaction of domain I with the maturase when forming the RNP complex, although there is no evidence yet that such an interaction exists.

A strategy that largely circumvents such difficulties is to fold each domain individually, a process referred to here as "domain-by-domain" folding. Each domain is first folded within the limit of its established domain boundary sequences (the nucleotides composing the primary stem of the domain), then merged into a single folded structure for the complete intron ribozyme. The domain-by-domain method is intended to produce folding patterns consistent with Michel, Umesono, and Ozeki (1989) and other recent G2 intron structure analyses (e.g., Boudvillain and Pyle, 1998 ; Boudvillain, de Lencastre, and Pyle, 2000 ; Costa, Michel, and Westhof, 2000 ; Toor, Hausner, and Zimmerly, 2001 ) and was used by the author to specify structural categories in the study of Myoporaceae mutation patterns in this paper.

Domain-by-domain folding is dependent on the proper identification of domain boundaries and certain structural elements in the sequence. In their appendix, Michel, Umesono, and Ozeki (1989) identify these boundaries for many chloroplast G2 introns from a few representative species of higher plants. To assist in structure identification in the rpl16 intron in land plants, the domain boundary sequences and other structural elements for this intron are listed in Table 4 for Pinus thumbergia, Marchantia polymorpha, and 20 angiosperm species.

Folding of domain I and domain IV within their domain boundaries may still return multiple structural arrangements. For domain I, the next step is to fold within subdomains (e.g., the D1C helix, the D1d helix, etc.). Structural variation in domain IV is very common and is usually the result of loss, gain, or rearrangement of sequence. The lack of conserved structural motifs in domain IV is probably due to the absence of strong functional constraints on this domain's structure.


    ALIGNMENT AND ANALYSIS
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Alignment and gap coding
Algorithmic alignment programs such as CLUSTAL that proceed by maximizing character state similarity across taxa achieve little in the way of correcting for linked, multinucleotide mutations because they implicitly assume each nucleotide character evolves independently of all others in the matrix and that all mutations involve single nucleotide events (Ritland and Eckenwalder, 1992 ; Swofford et al., 1996 ; Page and Holmes, 1998 ; Lutzoni et al., 2000 ). These assumptions are invalid in most nucleic acids, including group II introns. Gap weighting during CLUSTAL alignment may assist in recovering linked and multibase mutations, but a priori assignment of gap weights that accurately reflect the manner of sequence evolution in group II introns will be difficult and controversial. In the case of structured sequence data, we may instead incorporate evidence of molecular structure (e.g., Kjer, 1995 ; Hickson et al., 1996 ) and mutational mechanisms (e.g., Kelchner and Clark, 1997 ; Downie, Katz-Downie, and Watson, 2000 ) into the alignment process. Both are valid, biologically relevant information that can be used to supplement our conventional sequence similarity criterion when estimating character homology among nucleotides (reviewed in Kelchner, 2000 ).

Structure-based alignment has already found widespread application in rDNA comparative sequence analyses, predominately in animal and fungal systematics (e.g., Wesson, Porter, and Collins, 1992 ; Kjer, 1995 ; Hickson et al., 1996 ; Morrison and Ellis, 1997 ; Ballard et al., 1998 ; Maddison, Baker, and Ober, 1999 ; Hickson, Simon, and Perry, 2000 ; Lutzoni et al., 2000 ; Lydeard et al., 2000 ). Because the majority of length mutation in group II introns occurs in nonpairing elements of the ribozyme structure (e.g., Downie, Katz-Downie, and Cho, 1996 ; Kelchner and Clark, 1997 ; Downie, Katz-Downie, and Watson, 2000 ; this study), gap positioning can be aided by identifying the conserved structures that surround these regions of length variation.

Summaries of structure-related sequence elements in specific introns (e.g., Table 4) can be used to establish stem structural elements in a G2 intron sequence prior to alignment. By aligning sequence elements corresponding to homologous structures, local boundaries around labile regions of length variation are automatically determined. It is then considerably easier to position indels, particularly when mutation mechanisms are considered as possible explanations for length variation in these regions (Kelchner and Clark, 1997 ; Kelchner, 2000 ).

Once gaps are positioned and indel homology is proposed, length mutations in G2 intron sequences can provide a rich source of characters for phylogeny estimation, particularly at low taxonomic levels (Downie, Katz-Downie, and Cho, 1996 ; Kelchner and Clark, 1997 ; Oxelman, Liden, and Berglund, 1997 ). A variety of methods and discussions on coding gap characters are now available (Graham et al., 2000 ; Lutzoni et al., 2000 ; Simmons and Ochoterena, 2000 ) to assist in scoring such microstructural changes for subsequent phylogenetic analysis.

Phylogenetic analysis
Nearly all published research using group II intron sequences for phylogeny estimation have employed equal-weight parsimony analysis for primary topology selection. However, equal-weight parsimony is susceptible to bias, particularly when branch lengths are not equivalent on a tree (Hendy and Penny, 1989 ; Goldman, 1990 ), when superimposed mutations have occurred (Felsenstein, 1978 , 1983 ; Swofford et al., 1996 ), and when positional rate heterogeneity is a feature of the data (Steel and Penny, 2000 ; Sullivan and Swofford, 2001 ). In other words, equal-weight parsimony is expected to be inaccurate for phylogeny estimation when faced with the kind of complex mutational patterns that regularly occur in group II intron sequence data.

This observation certainly does not rule out the parsimony method as a criterion for phylogeny estimation (e.g., Olmstead, Reeves, and Yen, 1998 ; Sanderson and Kim, 2000 )—it merely cautions that equal-weight parsimony as a model of character evolution is unlikely to represent the history of mutational change in group II intron sequences. Instead, the presence of mutational biases in the data suggests a weighted parsimony or maximum likelihood approach to phylogeny estimation (Lockhart et al., 1994 ; Yang, Goldman, and Friday, 1994 ; Swofford et al., 1996 ; Steel and Penny, 2000 ; Conant and Lewis, 2001 ).

It may be that some group II intron data sets contain enough hierarchical signal to be robust to variation in character mutation models. In such cases, each conventional likelihood or parsimony model would select the same topology of relationships between taxa. A simple strategy can indicate whether this is the case with an intron data set (Fig. 4). Most standard likelihood models are nested within a continuum of models that differ in complexity (here, the term complexity is correlated with the number of parameters in a model). The Jukes-Cantor model and GTR+I+{Gamma} model are commonly used extremes of this continuum ("simple" and "complex," respectively; see Posada and Crandall, 2001 ). By running a separate phylogenetic analysis under each of these two models, one can ascertain whether the signal in a G2 intron data set is robust to systematic error derived from the application of an inaccurate model of character evolution for the data. If so, the topology selected by the Jukes-Cantor model will be the same as that selected by GTR+I+{Gamma}. If the two trees differ, then it is evident the validity of any phylogenetic conclusions will depend on the model selected and how closely it fits the mutational history of the data. As stated above, for group II intron sequences it is unlikely this model will be equal-weight parsimony or Jukes-Cantor. All other conventional models require the incorporation of likelihood parameters or character weighting to compensate for the presence of mutational biases.



View larger version (22K):
[in this window]
[in a new window]
 
Fig. 4. An analysis strategy for tree selection using group II intron sequence data. Like coding, rDNA, and transcribed spacer sequences, group II intron sequence may evolve under heterogeneous mutation processes. One strategy for investigating and accommodating multiple mutation dynamics in tree selection is proposed here. Data in which an hierarchical signal will overcome violations in model assumptions should produce identical topologies under any character evolution model used in tree construction. However, measures of clade support (bootstrap, jackknife, and decay) will be influenced by the model used to build the tree (hence the asterisk for equal-weight parsimony). If topologies differ depending on the model employed, then character partitions should be examined to better identify the heterogeneous processes of mutation evident in the sequence. Weighted parsimony or maximum likelihood would serve as more accurate approaches to discerning phylogenetic relationships among sequences in these cases

 
How best to ensure the accuracy of parameter values and weighting schemes will remain a topic of debate and research for many years to come (Lewis, 1998 ; Page and Holmes, 1998 ; Sanderson and Kim, 2000 ; Posada and Crandall, 2001 ). It is known that incorrect values for particular parameters may in some cases lead to the consistent recovery of inaccurate topologies (e.g., Yang, 1996 ; Sullivan and Swofford, 2001 ). Consequently, the accuracy of parameter values for any data set is highly desirable. Unfortunately, the current method of parameter value estimation under the likelihood criterion has the potential to increase systematic error in phylogenetic analyses of group II intron sequences.

The standard approach is to specify a model under the likelihood criterion and then allow all parameter values to be estimated during the analysis. The values are thus derived from a set of characters that are expected to evolve within the confines of that particular model. But this approach could lead to error if different segments of the sequence are experiencing significantly different substitution dynamics, as they appear to be in structure-based partitions of the rpl16 intron in Myoporaceae. In such circumstances, it would be inaccurate to assess the model's parameter values across the entire sequence, as is commonly done in conventional likelihood analyses. Consider, for example, the excessive transition rate in stem nucleotides of this study (Fig. 2D), which is four times higher than that estimated across the entire data set. In this case, unless nucleotide characters are partitioned by structural categories, the overall ti : tv value as estimated by a uniform mutational model will be inaccurate for nearly all characters in the sequence matrix.

This observation points to one of the great advantages of group II introns as phylogenetic tools: there exists a sound structural model within which character partitions can be investigated to better determine the occurrence and distribution of mutational biases in the data. A variety of character subsets can be explored, from domain and structure partitions to smaller-scale groupings of specific nucleotides under known functional constraints. As demonstrated in this paper, such partitioning experiments can elucidate an otherwise hidden complexity in mutation dynamics and thus have the potential to more accurately detail an heterogeneous model of character evolution for nucleotides of G2 intron sequences. Wilgenbusch and deQueiroz (2000) used just such a strategy for their mitochondrial 12S rDNA sequences in sand lizards. The authors grouped RNA sequence characters into pairing and nonpairing nucleotides and treated each partition as if they evolved under different mutational constraints in an heterogeneous likelihood analysis. The parameters were estimated for each partition individually in deference to detectable and statistically significant variation in nucleotide substitution patterns between RNA stem and non-stem nucleotides.

Coded indel characters often carry a good deal of phylogenetic information that cannot at present be easily incorporated into standard likelihood analysis of nucleotide characters; this may be the primary reason why most researchers choose parsimony methods to estimate phylogeny from group II intron sequences. There might be a way, however, to use likelihood correction factors for substitutions (i.e., parameters) within a parsimony framework, thus allowing the addition of coded indel characters to a weighted parsimony analysis that captures the essence of heterogeneous substitution patterns in the data. Parsimony and maximum likelihood can select the same set of topologies under certain conditions (Hendy and Penny, 1989 ; Goldman, 1990 ; Tuffley and Steel, 1997 ), and one may be able to take advantage of this correlation by converting parameter estimates from likelihood analysis into scaled weight matrices for character classes in a parsimony tree search strategy (Cook, Gullan, and Trueman, in press ; S. A. Kelchner, unpublished data).

The conversion will not be precise, but if it is at all reasonable the advantages are threefold: first, weight schemes for parsimony analysis can be determined by likelihood in a more rigorous manner than conventional a priori character weighting; second, creating an heterogeneous model based on weight classes allows the inclusion of gap characters into the analysis (currently not applicable to most distance and likelihood analyses); and third, weighted parsimony analyses often take considerably less time than parameter-rich likelihood models. The method, however, has yet to be tested with either simulations or empirical data to determine how closely parsimony weight matrix conversions of heterogeneous likelihood models represent the same hierarchical signal in sequence data.

Finally, the model or models eventually selected for phylogenetic analysis will directly influence support measure values (decay, bootstrap, or jackknife) for the resultant tree. Even if identical trees eventuate under simple and complex models following the strategy detailed in Fig. 4, branch lengths may differ significantly in the trees depending on the model of character evolution employed. The degree of support on each branch is a function of the weights or probabilities for individual nucleotide substitutions (see Buckley, Simon, and Chambers, 2001 ); therefore, estimates of support for clades in a tree based on group II intron sequence data may differ considerably depending on the mutational model underlying the topology (e.g., Felsenstein, 1985 ; Bremer, 1994 ; Sanderson, 1995 ; Swofford et al., 1996 ; Oxelman, Backlund, and Bremer, 1999 ; Mort et al., 2000 ). This has significant bearing on any phylogenetic conclusions made from the data and highlights the importance of establishing appropriate and defendable models of character evolution for group II intron data sets.


    GROUP II INTRONS: A BOON FOR PHYLOGENY ESTIMATION
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Chloroplast group II introns are excellent phylogenetic tools. The primary benefit of plastome G2 introns is their probable vertical descent and single copy status. Those not located in the inverted repeat regions (IRs) of the chloroplast genome are almost always single copy, not just in the plastome but in the organism. And unlike some prokaryotic and lower eukaryotic group II introns, chloroplast G2 introns appear to have lost their ability to move, which may be due to the degradation of their open reading frames and the now corrupted RT domains of matK. Absence of mobility increases the likelihood that chloroplast G2 introns have evolved by vertical descent within their current host genes, which could make them accurate indicators of both host gene and plastome phylogenetic history.

On a technical level, group II introns are easily amplified and of a handy length for sequence analysis. Some, like the rpl16 intron, have amplification primers that work in a diverse array of plant lineages. Internal primers can be fashioned to correspond with conserved domain boundaries within the ribozyme structure.

All sequence regions used in phylogeny estimation produce structured molecules (proteins, RNA, ribozymic introns) or contain elements with functional purpose (intergenic spacers, transcribed spacers). In most cases the functional (and consequently structural) constraints are poorly understood, making detailed study of mutational processes in these regions difficult. Group II introns have the advantage that both their structure and function have been carefully investigated and largely confirmed in vitro and in vivo, and several site-specific constraints have been discovered and mapped. The core structural models of Michel, Umesono, and Ozeki (1989) apply to all chloroplast group II introns and can be used to identify and accommodate heterogeneous mutation patterns in sequence data, giving the researcher an opportunity to create better, partition-specific models of character evolution for phylogenetic analysis.

Variable selective constraints on conserved structural elements can produce group II intron sequences that contain hierarchical signal at remarkably divergent taxonomic levels. This point is nicely illustrated by their success in phylogeny estimation at both the infrageneric and intraordinal levels (e.g., Kelchner and Clark, 1997 ; Renner, 1999 ; Applequist and Wallace, 2000 ; Wanntorp et al., 2001 ). Specific regions of high sequence mutation, such as domain IV, could be targeted for low-level systematics or genetic markers by placing primers in neighboring, highly conserved sequence, such as the domain II and III boundary and domain V. Deep taxonomic questions could be addressed by focusing on only those sequence elements corresponding with crucial, functionally constrained structures like domains I and V.

Unlike sequence data from different process regions, such as ndhF or the trnL-F spacer, one may expect group II intron sequences to evolve under nearly the same selective constraints. This suggests that sequences from multiple G2 introns can be readily combined. The combination can even occur at the homologous structure level; that is, sequence data from identical structures of two or more chloroplast group II introns could be treated under the same model of character evolution. This unusual situation does not exist for any other set of sequence regions in the plastome; hence, it may well be that chloroplast group II introns are the most combinable form of sequence data in plants.


    FOOTNOTES
 
1 The author thanks Bernard Pfeil for assistance in compiling Table 4 , Conny Asmussen for providing two unpublished rpl16 intron sequences for Arecaceae, and Lyn Cook and John Trueman for discussion of analysis methodologies. The manuscript was significantly improved by the thoughtful comments and criticisms of Curt Brubaker, Mark Simmons, and two anonymous reviewers. Back


    LITERATURE CITED
 TOP
 ABSTRACT
 INTRODUCTION
 GROUP II INTRON STRUCTURE...
 MITOCHONDRIAL GROUP II INTRONS
 CHLOROPLAST INTRON LOSS IN...
 MUTATION PATTERNS IN GROUP...
 TECHNIQUES
 ALIGNMENT AND ANALYSIS
 GROUP II INTRONS: A...
 LITERATURE CITED
 
Abramovitz D. L. R. A. Friedman A. M. Pyle 1996 Catalytic role of 2'-hydroxyl groups within a group II intron active site. Science 271: 1410-1413[Abstract]

Anderson C. L. J. H. E. Rova L. Andersson 2001 Molecular phylogeny of the tribe Anthospermeae (Rubiaceae): systematic and biogeographic implications. Australian Systematic Botany 14: 231-244

Andersson L. S. Andersson 2000 A molecular phylogeny of Tropaeolaceae and its systematic implications. Taxon 49: 721-736[CrossRef][Web of Science]

Andersson L. M. W. Chase 2001 Phylogeny and classification of Marantaceae. Botanical Journal of the Linnean Society 135: 275-287[CrossRef]

Andersson L. J. H. E. Rova 1999 The rps16 intron and the phylogeny of the Rubioideae (Rubiaceae). Plant Systematics and Evolution 214: 161-186

Applequist W. L. R. S. Wallace 2000 Phylogeny of the Madagascan endemic family Didiereaceae. Plant Systematics and Evolution 221: 157-166

Asmussen C. B. 1999 Toward a chloroplast DNA phylogeny of the tribe Geonomeae (Palmae). Memoirs of the New York Botanical Garden 83: 121-129

Asmussen C. B. M. W. Chase 2001 Coding and noncoding plastid DNA in palm systematics. American Journal of Botany 88: 1103-1117[Abstract/Free Full Text]

Asmussen C. B. A. Liston 1998 Chloroplast DNA characters, phylogeny, and classification of Lathyrus (Fabaceae). American Journal of Botany 85: 387-401[Abstract]

Baker W. J. J. Dransfield 2000 Phylogeny, character evolution, and a new classification of the Calamoid palms. Systematic Botany 25: 297-322

Baker W. J. T. A. Hedderson J. Dransfield 2000 Molecular phylogenetics of subfamily Calamoideae (Palmae) based on nrDNA ITS and cpDNA rps16 intron sequence data. Molecular Phylogenetics and Evolution 14: 195-217

Bakker F. T. A. Culham C. E. Pankhurst M. Gibby 2000 Mitochondrial and chloroplast DNA-based phylogeny of Pelargonium (Geraniaceae). American Journal of Botany 87: 727-734[Abstract/Free Full Text]

Baldwin B. G. M. J. Sanderson J. M. Porter M. F. Wojciechowski C. S. Campbell M. J. Donoghue 1995 The ITS region of nuclear ribosomal DNA: a valuable source of evidence on angiosperm phylogeny. Annals of the Missouri Botanical Gardens 82: 247-277[CrossRef][Web of Science]

Ballard J. W. O. M. K. Thayer A. F. Newton E. R. Grismer 1998 Data sets, partitions, and characters: philosophies and procedures for analyzing multiple data sets. Systematic Biology 47: 367-396

Baum D. A. R. L. Small J. F. Wendel 1998 Biogeography and floral evolution of baobabs (Adansonia, Bombacaceae) as inferred from multiple data sets. Systematic Biology 47: 181-207

Bechmann B. W. Luke G. Hunsmann 1990 Improvement of PCR amplified DNA sequencing with the aid of detergents. Nucleic Acids Research 18: 1309[Free Full Text]

Berg O. G. 1999 Synonymous nucleotide divergence and saturation: effects of site-specific variations in codon bias and mutation rates. Journal of Molecular Evolution 48: 398-407

Besendahl A. Y. L. Qiu J. D. Palmer D. Bhattacharya 2000 The cyanobacterial origin and vertical transmission of the plastid tRNA(Leu) group-I intron. Current Genetics 37: 12-23

Bonen L. 1993 Trans-splicing of pre-mRNA in plants, animals, and protists. FASEB Journal 7: 40-46[Abstract]

Boudvillain M. A. de Lencastre A. M. Pyle 2000 A tertiary interaction that links active-site domains to the 5' splice site of a group II intron. Nature 406: 315-318[CrossRef][Medline]

Boudvillain M. A. M. Pyle 1998 Defining functional groups, core structural features and inter-domain tertiary contacts essential for group II intron self-splicing: a NAIM analysis. EMBO Journal 17: 7091-7104[CrossRef][Web of Science][Medline]

Bremer B. J. F. Manen 2000 Phylogeny and classification of the subfamily Rubioideae (Rubiaceae). Plant Systematics and Evolution 225: 43-72

Bremer K. 1994 Branch support and tree stability. Cladistics 10: 295-304[CrossRef][Web of Science]

Buckley T. R. C. Simon G. K. Chambers 2001 Exploring among-site rate variation models in a maximum likelihood framework using empirical data: effects of model assumptions on estimates of topology, branch lengths, and bootstrap support. Systematic Biology 50: 67-86

Campagna M. L. S. R. Downie 1998 The intron in chloroplast gene rpl16 is missing from the flowering plant families Geraniaceae, Goodeniaceae, and Plumbaginaceae. Transactions of the Illinois State Academy of Science 91: 1-11

Chanderbali A. S. H. van der Werff S. S. Renner 2001 Phylogeny and historical biogeography of Lauraceae: evidence from the chloroplast and nuclear genomes. Annals of the Missouri Botanical Gardens 88: 104-134[CrossRef][Web of Science]

Chanfreau G. A. Jacquier 1994 Catalytic site components common to both splicing steps of a group II intron. Science 266: 1383-1387[Abstract/Free Full Text]

Chapdelaine Y. L. Bonen 1991 The wheat mitochondrial gene for subunit I of the NADH dehydrogenase complex: a trans-splicing model for this gene-in-pieces. Cell 65: 465-472[CrossRef][Web of Science][Medline]

Cho Y. J. D. Palmer 1999 Multiple acquisitions via horizontal transfer of a group I intron in the mitochondrial cox1 gene during evolution of the Araceae family. Molecular Biology and Evolution 16: 1155-1165[Abstract]

Cho Y. Y.-L. Qiu P. Kuhlman J. D. Palmer 1998 Explosive invasion of plant mitochondria by a group I intron. Proceedings of the National Academy of Sciences, USA 95: 14244-14249

Chu V. T. Q. Liu M. Podar P. S. Perlman A. M. Pyle 1998 More than one way to splice an RNA: branching without a bulge and splicing without branching in group II introns. RNA 4: 1186-1202[Abstract]

Clausing G. S. S. Renner 2001 Molecular phylogenetics of Melastomataceae and Memecylaceae: implications for character evolution. American Journal of Botany 88: 486-498[Abstract/Free Full Text]

Clegg M. T. B. S. Gaut G. H. J. Learn B. R. Morton 1994 Rates and patterns of chloroplast DNA evolution. Proceedings of the National Academy of Sciences, USA 91: 6795-6801

Conant G. C. P. O. Lewis 2001 Effects of nucleotide composition bias on the success of the parsimony criterion in phylogenetic inference. Molecular Biology and Evolution 18: 1024-1033[Abstract/Free Full Text]

Cook L. G. P. J. Gullan H. G. Trueman In press A preliminary phylogeny of the scale insects (Hemiptera: Sternorrhyncha: Coccoidea) based on SSU rRNA. Molecular Phylogenetics and Evolution

Costa M. E. Deme A. Jacquier F. Michel 1997 Multiple tertiary interactions involving domain II of group II self-splicing introns. Journal of Molecular Biology 267: 520-536[CrossRef][Web of Science][Medline]

Costa M. F. Michel 1995 Frequent use of the same tertiary motif by self-splicing RNAs. EMBO Journal 14: 1276-1285[Web of Science][Medline]

Costa M. F. Michel 1999 Tight binding of 5' exon to domain I of a group II self-splicing intron requires completion of the intron active site. EMBO Journal 18: 1025-1037[CrossRef][Web of Science][Medline]

Costa M. F. Michel E. Westhof 2000 A three-dimensional perspective on exon binding by a group II self-splicing intron. EMBO Journal 19: 5007-5018[CrossRef][Web of Science][Medline]

Cousineau B. S. Lawrence D. Smith M. Belfort 2000 Retrotransposition of a bacterial group II intron. Nature 404: 1018-1021[CrossRef][Medline]

Demesure B. N. Sodzi R. J. Petit 1995 A set of universal primers for amplification of polymorphic noncoding regions of mitochondrial and chloroplast DNA in plants. Molecular Ecology 4: 129-131[Medline]

Dib-Hajj S. D. S. C. Boulanger S. K. Hebbar C. L. Peebles J. S. Franzen P. S. Perlman 1993 Domain 5 interacts with domain 6 and influences the second transesterification reaction of group II intron self-splicing. Nucleic Acids Research 21: 1797-1804[Abstract/Free Full Text]

Dixon M. T. D. M. Hillis 1993 Ribosomal RNA secondary structure: compensatory mutations and implications for phylogenetic analysis. Molecular Biology and Evolution 10: 256-267[Abstract]

Doetsch N. A. M. R. Favreau N. Kuscuoglu M. D. Thompson R. B. Hallick 2001 Chloroplast transformation in Euglena gracilis: splicing of a group III twintron transcribed from a transgenic psbK operon. Current Genetics 39: 49-60

Downie S. R. D. S. Katz-Downie 1999 Phylogenetic analysis of chloroplast rps16 intron sequences reveals relationships within the woody southern African Apiaceae subfamily Apioideae. Canadian Journal of Botany 77: 1120-1135[CrossRef]

Downie S. R. D. S. Katz-Downie K.-J. Cho 1996 Phylogenetic analysis of Apiaceae subfamily Apioideae using nucleotide sequences from the chloroplast rpoC1 intron. Molecular Phylogenetics and Evolution 6: 1-18

Downie S. R. D. S. Katz-Downie E. J. Rogers H. L. Zujewski E. Small 1998a Multiple independent losses of the plastid rpoC1 intron in Medicago (Fabaceae) as inferred from phylogenetic analyses of nuclear ribosomal DNA internal transcribed spacer sequences. Canadian Journal of Botany 76: 791-803[CrossRef]

Downie S. R. D. S. Katz-Downie M. F. Watson 2000 A phylogeny of the flowering plant family Apiaceae based on chloroplast DNA rpl16 and rpoC1 intron sequences: towards a suprageneric classification of subfamily Apioideae. American Journal of Botany 87: 273-292[Abstract/Free Full Text]

Downie S. R. E. Llanas D. S. Katz-Downie 1996 Multiple independent losses of the rpoC1 intron in angiosperm chloroplast DNA's. Systematic Botany 21: 135-151

Downie S. R. R. G. Olmstead G. Zurawski D. E. Soltis P. S. Soltis J. C. Watson J. D. Palmer 1991 6 independent losses of the chloroplast DNA rpl2 intron in dicotyledons—molecular and phylogenetic implications. Evolution 45: 1245-1259[CrossRef][Web of Science]

Downie S. R. J. D. Palmer 1992 Use of Chloroplast DNA rearrangements in reconstructing plant phylogeny. In D. E. Soltis et al. [eds.], Molecular systematics of plants, 14–35. Chapman and Hall, New York, New York, USA

Downie S. R. J. D. Palmer 1994 A chloroplast DNA phylogeny of the Caryophyllales based on structural and inverted repeat restriction-site variation. Systematic Botany 19: 236-252

Downie S. R. S. Ramanath D. S. Katz-Downie E. Llanas 1998b Molecular systematics of Apiaceae subfamily Apioideae: phylogenetic analyses of nuclear ribosomal DNA internal transcribed spacer and plastid rpoC1 intron sequences. American Journal of Botany 85: 563-591[Abstract]

Doyle J. J. J. L. Doyle J. D. Palmer 1995 Multiple independent losses of two genes and one intron from legume chloroplast genomes. Systematic Botany 20: 272-294

Edwards K. J. P. A. Gadek 2001 Evolution and biogeography of Alectryon (Sapindaceae). Molecular Phylogenetics and Evolution 20: 14-26

Eickbush T. H. 1999 Mobile introns: retrohoming by complete reverse splicing. Current Biology 9: R11-R14

Ems S. C. C. W. Morden C. K. Dixon K. H. Wolfe C. W. dePamphilis J. D. Palmer 1995 Transcription, splicing and editing of plastid RNAs in the nonphotosynthetic plant Epifagus virginiana. Plant Molecular Biology 29: 721-733[CrossRef][Web of Science][Medline]

Felsenstein J. 1978 Cases in which parsimony and compatibility methods will be positively misleading. Systematic Zoology 27: 401-410

Felsenstein J. 1983 Parsimony in systematics: biological and statistical issues. Annual Review of Ecology and Systematics 14: 313-333

Felsenstein J. 1985 Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39: 783-791[CrossRef][Web of Science]

Felsenstein J. 2001 Taking variation of evolutionary rates between sites into account in inferring phylogenies. Journal of Molecular Evolution 53: 447-455

Freudenstein J. V. M. W. Chase 2001 Analysis of mitochondrial nad1b-c intron sequences in Orchidaceae: utility and coding of length-change characters. Systematic Botany 26: 643-657

Gatesy J. C. Hayashi R. DeSalle E. Vrba 1994 Rate limits for mispairing and compensatory change: the mitochondrial ribosomal DNA of antelopes. Evolution 48: 188-196[CrossRef][Web of Science]

Goddard M. R. A. Burt 1999 Recurrent invasion and extinction of a selfish gene. Proceedings of the National Academy of Sciences, USA 23: 13880-13885

Goldman N. 1990 Maximum likelihood inference of phylogenetic trees, with special reference to a Poisson process model of DNA substitution and to parsimony analyses. Systematic Zoology 39: 345-361

Graham S. W. R. G. Olmstead 2000 Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms. American Journal of Botany 87: 1712-1730[Abstract/Free Full Text]

Graham S. W. P. A. Reeves A. C. E. Burns R. G. Olmstead 2000 Microstructural changes in noncoding chloroplast DNA: interpretation, evolution, and utility of indels and inversions in basal angiosperm phylogenetic inference. International Journal of Plant Science 161: S83-S96

Guo H. T. M. Karberg M. Long J. P. Jones B. Sullenger A. M. Lambowitz 2000 Group II introns designed to insert into therapeutically relevant DNA target sites in human cells. Science 289: 452-457[Abstract/Free Full Text]

Gutell R. R. J. J. Cannone Z. Shang Y. Du M. J. Serra 2000 A story: unpaired adenosine bases in ribosomal RNAs. Journal of Molecular Biology 304: 335-354[CrossRef][Web of Science][Medline]

Harris-Kerr C. L. M. Zhang C. L. Peebles 1993 The phylogenetically predicted base-pairing interactions between alpha and alpha' is required for group II intron splicing in vitro. Proceedings of the National Academy of Sciences, USA 90: 10658-10662

Hendy M. D. D. Penny 1989 A framework for the quantitative study of evolutionary trees. Systematic Zoology 38: 297-309

Herdenberger F. V. Holländer U. Kuck 1994 Correct in vivo RNA splicing of a mitochondrial intron in algal chloroplasts. Nucleic Acids Research 22: 2869-2875[Free Full Text]

Hershkovitz M. A. E. A. Zimmer W. J. Hahn 1999 Ribosomal DNA sequences and angiosperm systematics. In P. M. Hollingsworth, R. M. Bateman, and R. J. Gornall [eds.], Molecular systematics and plant evolution, 268–326. Taylor & Francis, London, UK

Hickson R. E. C. Simon A. Cooper G. S. Spicer J. Sullivan D. Penny 1996 Conserved sequence motifs, alignment, and secondary structure for the third domain of animal 12S rRNA. Molecular Biology and Evolution 13: 150-169[Abstract]

Hickson R. E. C. Simon S. W. Perrey 2000 The performance of several multiple-sequence alignment programs in relation to secondary-structure features for an rRNA sequence. Molecular Biology and Evolution 17: 530-539[Abstract/Free Full Text]

Holländer V. U. Kück 1998 Splicing of the mitochondrial group-II intron rI1: conserved intron-exon interactions diminish splicing efficiency. Current Genetics 33: 117-123

Holländer V. U. Kück 1999 Group II intron splicing in chloroplasts: identification of mutations determining intron stability and fate of exon RNA. Nucleic Acids Research 27: 2345-2353[Abstract/Free Full Text]

Holst-Jensen A. M. Vaage T. Schumacher S. Johansen 1999 Structural characteristics and possible horizontal transfer of group I introns between closely related plant pathogenic fungi. Molecular Biology and Evolution 16: 114-126[Abstract]

Hoot S. B. J. D. Palmer 1994 Structural rearrangements, including parallel inversions, within the chloroplast genome of Anemone and related genera. Journal of Molecular Evolution 38: 274-281

Huelsenbeck J. P. R. Neilsen 1999 Effect of nonindependent substitution on phylogenetic accuracy. Systematic Biology 48: 317-328

Jacquier A. 1996 Group II introns: elaborate ribozymes. Biochimie 78: 474-487[Medline]

Jacquier A. Jacquesson-Breuleux 1991 Splice site selection and role of the lariat in a group II intron. Journal of Molecular Biology 219: 415-428[CrossRef][Web of Science][Medline]

Jacquier A. F. Michel 1987 Multiple exon-binding sites in class II self-splicing introns. Cell 50: 17-29[CrossRef][Web of Science][Medline]

Jacquier A. F. Michel 1990 Base-pairing interactions involving the 5' and 3'-terminal nucleotides of group II self-splicing introns. Journal of Molecular Biology 213: 437-447[CrossRef][Web of Science][Medline]

Jenkins B. D. J. Kulhanek A. Barkan 1997 Nuclear mutations that block group II RNA splicing in maize chloroplasts reveal several intron classes with distinct requirements for splicing factors. Cell 9: 283-296[Web of Science]

Jordan W. C. M. W. Courtney J. E. Neigel 1996 Low levels of intraspecific genetic variation at a rapidly evolving chloroplast DNA locus in North American duckweeds (Lemnaceae). American Journal of Botany 83: 430-439[CrossRef][Web of Science]

Jurica M. S. B. L. Stoddard 1999 Homing endonucleases: structure, function and evolution. Cellular and Molecular Life Sciences 55: 1304-1326

Katayama H. Y. Ogihara 1996 Phylogenetic affinities of the grasses to other monocots as revealed by molecular analysis of chloroplast DNA. Current Genetics 29: 572-581

Kelchner S. A. 2000 The evolution of noncoding chloroplast DNA and its application in plant systematics. Annals of the Missouri Botanical Gardens 87: 482-498[CrossRef][Web of Science]

Kelchner S. A. R. J. Bayer R. J. Chinnock M. D. Crisp J. G. West 2000 A data partition analysis of noncoding sequence evolution in Bontia and Myoporaceae. American Journal of Botany 87: 136 (Abstract)

Kelchner S. A. L. G. Clark 1997 Molecular evolution and phylogenetic utility of the chloroplast rpl16 intron in Chusquea and the Bambusoideae (Poaceae). Molecular Phylogenetics and Evolution 8: 385-397

Kelchner S. A. J. F. Wendel 1996 Hairpins create minute inversions in noncoding regions of chloroplast DNA. Current Genetics 30: 259-262

Kellogg E. A. N. D. Juliano 1997 The structure and function of RuBisCO and their implications for systematic studies. American Journal of Botany 84: 413-428[Abstract]

Kimura M. 1968 Evolutionary rate at the molecular level. Nature 217: 624-626[CrossRef][Medline]

Kimura M. 1983 The neutral theory of molecular evolution. Cambridge University Press, Cambridge, UK

Kjer L. M. 1995 Use of rRNA secondary structure in phylogenetic studies to identify homologous positions: an example of alignment and data presentation from the frogs. Molecular Phylogenetics and Evolution 4: 314-330

Knoop V. M. Altwasser A. Brennicke 1997 A tripartite group II intron in mitochondria of an angiosperm plant. Molecular and General Genetics 255: 269-276

Knoop V. A. Brennicke 1993 Group II introns in plant mitochondria—trans-splicing, RNA editing, evolution and promiscuity. In A. Brennicke and U. Kück [eds.], Plant mitochondria, 220–232. VCH, Weinheim, New York, New York, USA

Koch J. L. S. C. Boulanger S. D. Dib-Hajj S. K. Hebbar P. S. Perlman 1992 Group II introns deleted for multiple substructures retain self-splicing activity. Molecular and Cellular Biology 12: 1950-1958

Konforti B. B. Q. Liu A. M. Pyle 1998 A map of the binding site for catalytic domain 5 in the core of a group II intron ribozyme. EMBO Journal 17: 7105-7117[CrossRef][Web of Science][Medline]

Kraus F. L. Jarecki M. Miyamoto S. Tanhauser P. Laipis 1992 Mispairing and compensational changes during the evolution of mitochondrial ribosomal RNA. Molecular Biology and Evolution 9: 770-774[Web of Science][Medline]

Kuhner M. K. J. Felsenstein 1994 A simulation of phylogeny algorithms under equal and unequal evolutionary rates. Molecular Biology and Evolution 11: 459-468[Abstract]

Kwakman J. H. D. Konings H. J. Pel L. A. Grivell 1989 Structure-function relationships in a self-splicing group II intron: a large part of domain II of the mitochondrial intron aI5 is not essential for self-splicing. Nucleic Acids Research 17: 4205-4216[Abstract/Free Full Text]

Lai M. J. Sceppa J. A. Ballenger J. J. Doyle R. P. Wunderlin 1997 Polymorphism for the presence of the rpl2 intron in chloroplast genomes of Bauhinia (Leguminosae). Systematic Botany 22: 519-528

Lambowitz A. M. M. Belfort 1993 Introns as mobile genetic elements. Annual Review of Biochemistry 62: 587-622[CrossRef][Web of Science][Medline]

Laroche J. J. Bousquet 1999 Evolution of the mitochondrial rps3 intron in perennial and annual angiosperms and homology to nad5 intron 1. Molecular Biology and Evolution 16: 441-452[Abstract]

Lazowska J. B. Meunier C. Macadre 1994 Homing of a group II intron in yeast mitochondrial DNA is accompanied by unidirectional co-conversion of upstream-located markers. EMBO Journal 13: 4963-4972[Web of Science][Medline]

Learn G. H. J. J. S. Shore G. R. Furnier G. Zurawski M. T. Clegg 1992 Constraints on the evolution of plastid introns: the group II intron in the gene encoding tRNA-Val(UAC). Molecular Biology and Evolution 9: 856-871[Abstract]

Lee B.-Y. S. R. Downie 2000 Phylogenetic analysis of cpDNA restriction sites and rps16 intron sequences reveals relationships among Apiaceae tribes Caucalideae, Scandiceae and related taxa. Plant Systematics and Evolution 221: 35-60

Lee J. T. Hymowitz 2001 A molecular phylogenetic study of the subtribe Glycininae (Leguminosae) derived from the chloroplast DNA rps16 intron sequences. American Journal of Botany 88: 2064-2073[Abstract/Free Full Text]

Lewis P. O. 1998 Maximum likelihood as an alternative to parsimony for inferring phylogeny using nucleotide sequence data. In D. E. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants II: DNA sequencing, 132–163. Kluwer Academic, Boston, Massachusetts, USA

Liden M. T. Fukuhara J. Rylander B. Oxelman 1997 Phylogeny and classification of Fumariaceae, with emphasis on Dicentra s.l., based on the plastid gene rps16 intron. Plant Systematics and Evolution 206: 411-420

Liston A. J. A. Wheeler 1994 The phylogenetic position of the genus Astragalus (Fabaceae): evidence from the chloroplast genes rpoC1 and rpoC2. Biochemical Systematics and Ecology 22: 377-388

Lockhart P. J. A. W. Larkum M. A. Steel P. J. Waddell D. Penny 1996 Evolution of chlorophyll and bacteriochlorophyll: the problem of invariant sites in sequence analysis. Proceedings of the National Academy of Sciences, USA 93: 1930-1934

Lockhart P. J. M. A. Steel M. D. Hendy D. Penny 1994 Recovering evolutionary trees under a more realistic model of sequence evolution. Molecular Biology and Evolution 11: 605-612[Web of Science][Medline]

Lutzoni F. P. Wagner V. Reeb S. Zoller 2000 Integrating ambiguously aligned regions of DNA sequences in phylogenetic analyses without violating positional homology. Systematic Biology 49: 628-651

Lydeard C. W. E. Holznagel M. N. Schnare R. R. Gutell 2000 Phylogenetic analysis of moluscan mitochondrial LSU rDNA sequences and secondary structures. Molecular Phylogenetics and Evolution 15: 83-102

Maddison D. R. M. D. Baker K. A. Ober 1999 Phylogeny of carabid beetles as inferred from 18S ribosomal DNA (Coleoptera: Carabidae). Systematic Entomology 24: 103-138

Malek O. V. Knoop 1998 Trans-splicing group II introns in plant mitochondria: the complete set of cis-arranged homologs in ferns, fern allies, and a hornwort. RNA 4: 1599-1609[Abstract]

McClellan D. A. 2000 The codon-degeneracy model of molecular evolution. Journal of Molecular Evolution 50: 131-140

Michel F. B. Dujon 1983 Conservation of RNA secondary structures in two intron families including mitochondrial-, chloroplast- and nuclear-encoded members. EMBO Journal 2: 33-38[Web of Science][Medline]

Michel F. J.-L. Ferat 1995 Structure and activities of group II introns. Annual Review of Biochemistry 64: 435-461[CrossRef][Web of Science][Medline]

Michel F. K. Umesono H. Ozeki 1989 Comparative and functional anatomy of group II catalytic introns—a review. Gene 82: 5-30[CrossRef][Web of Science][Medline]

Mikheeva S. H. L. Murray H. Zhou B. M. Turczyk K. A. Jarrell 2000 Deletion of a conserved dinucleotide inhibits the second step of group II intron splicing. RNA 6: 1509-1515[Abstract]

Miyamoto M. M. M. W. Allard R. M. Adkins L. L. Lanecek R. L. Honeycutt 1994 A congruence test of reliability using linked mitochondrial DNA sequences. Systematic Biology 43: 236-249

Mohr G. P. S. Perlman A. M. Lambowitz 1993 Evolutionary relationships among group II intron-encoded proteins and identification of a conserved domain that may be related to maturase function. Nucleic Acids Research 21: 4991-4997[Abstract/Free Full Text]

Mohr G. D. Smith M. Belfort A. M. Lambowitz 2000 Rules for DNA target-site recognition by a lactococcal group II intron enable retargeting of the intron to specific DNA sequences. Genes and Development 14: 559-573

Moran J. V. S. Zimmerly R. Eskes J. C. Kennell A. M. Lambowitz R. A. Butow P. S. Perlman 1995 Mobile group II introns of yeast mitochondrial-DNA are novel site-specific retroelements. Molecular and Cellular Biology 15: 2828-2838

Morrison D. A. J. T. Ellis 1997 Effects of nucleotide sequence alignment on phylogeny estimation: a case study of 18S rDNAs of Apicomplexa. Molecular Biology and Evolution 14: 428-441[Abstract]

Mort M. E. P. S. Soltis D. E. Soltis M. L. Mabry 2000 Comparison of three methods for estimating internal support on phylogenetic trees. Systematic Biology 49: 160-170

Mueller M. W. M. Allmaier R. Eskes R. J. Schweyen 1993 Transposition of group II intron aI1 in yeast and invasion of mitochondrial genes at new locations. Nature 366: 174-176[CrossRef][Medline]

Muse S. V. 1995 Evolutionary analyses of DNA sequences subject to constraints on secondary structure. Genetics 139: 1429-1439[Abstract]

Olmstead R. G. C. W. dePamphilis A. D. Wolfe N. D. Young W. J. Elisons P. A. Reeves 2001 Disintegration of the Scrophulariaceae. American Journal of Botany 88: 348-361[Abstract/Free Full Text]

Olmstead R. G. P. A. Reeves A. C. Yen 1998 Patterns of sequence evolution and implications for parsimony analysis of chloroplast DNA. In D. S. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants II: DNA sequencing, 164–187. Chapman and Hall, New York, New York, USA

Oxelman B. M. Backlund B. Bremer 1999 Relationships of the Buddlejaceae s.l. investigated using parsimony jackknife and branch support analysis of chloroplast ndhF and rbcL sequence data. Systematic Botany 24: 164-182

Oxelman B. M. Liden D. Berglund 1997 Chloroplast rps16 intron phylogeny of the tribe Sileneae (Caryophyllaceae). Plant Systematics and Evolution 206: 393-410

Page R. D. M. E. C. Holmes 1998 Molecular evolution: a phylogenetic approach. Blackwell Science, Cambridge, UK

Palmer J. D. K. L. Adams Y. Cho C. L. Parkinson Y.-L. Qiu K. Song 2000 Dynamic evolution of plant mitochondrial genomes: mobile genes and introns and highly variable mutation rates. Proceedings of the National Academy of Sciences, USA 97: 6960-6966

Palmer J. D. C. F. Delwiche 1998 The origin and evolution of plastids and their genomes. In D. E. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants II: DNA sequencing, 375–409. Kluwer Academic, Boston, Massachusetts, USA

Peebles C. L. P. S. Perlman K. L. Mecklenberg M. L. Petrillo J. H. Tabor K. A. Jarrell H.-L. Cheng 1986 A self-splicing RNA excises an intron lariat. Cell 44: 213-223[CrossRef][Web of Science][Medline]

Peebles C. L. M. Zhang P. S. Perlman J. S. Franzen 1995 Catalytically critical nucleotides in domain 5 of a group II intron. Proceedings of the National Academy of Sciences, USA 92: 4422-4426

Persson C. 2000 Phylogeny of the Gardenieae (Rubiaceae) based on chloroplast DNA sequences from the rps16 intron and trnL(UAA)-trnF(GAA) intergenic spacer. Nordic Journal of Botany 20: 257-269

Pfeil B. E. C. L. Brubaker L. A. Craven M. D. Crisp 2002 Phylogeny of Hibiscus and the tribe Hibisceae (Malvaceae) using chloroplast DNA sequences of ndhF and the rpl16 intron. Systematic Botany 27: 333-350

Piesschaert F. L. Andersson S. Jansen S. Dessein E. Robbrecht E. Smets 2000 Searching for the taxonomic position of the African genus Colletoecema (Rubiaceae): morphology and anatomy compared to an rps16-intron analysis of the Rubioideae. Canadian Journal of Botany 78: 288-304[CrossRef]

Podar M. P. S. Perlman 1999 Photocrosslinking of 4-thio uracil-containing RNAs supports a side-by-side arrangement of domains 5 and 6 of a group II intron. RNA 5: 318-329[Abstract]

Podar M. P. S. Perlman R. A. Padgett 1998 The two steps of group II intron self-splicing are mechanistically distinguishable. RNA 4: 890-900[Abstract]

Popp M. B. Oxelman 2001 Inferring the history of the polyploid Silene aegaea (Caryophyllaceae) using plastid and homoeologous nuclear DNA sequences. Molecular Phylogenetics and Evolution 20: 474-481

Posada D. K. A. Crandall 2001 Selecting the best-fit model of nucleotide substitution. Systematic Biology 50: 580-601

Pyle A. M. 1996 Catalytic reaction mechanisms and structural features of group II intron ribozymes. In F. Eckstein and D. M. J. Lilley [eds.], Catalytic RNA, 75–107. Springer Verlag, New York, New York, USA

Reeves J. H. 1992 Heterogeneity in the substitution process of amino acid sites of proteins coded for by mitochondrial DNA. Journal of Molecular Evolution 35: 17-31

Renner S. S. 1999 Circumscription and phylogeny of the Laurales: evidence from molecular and morphological data. American Journal of Botany 86: 1301-1315[Abstract/Free Full Text]

Renner S. S. A. S. Chanderbali 2000 What is the relationship among Hernandiaceae, Lauraceae, and Monimiaceae, and why is this question so difficult to answer?. International Journal of Plant Sciences 161: S109-S119

Renner S. S. D. B. Foreman D. Murray 2000 Timing transantarctic disjunctions in the Atherospermataceae (Laurales): evidence from coding and noncoding chloroplast sequences. Systematic Biology 49: 579-591

Ritland K. J. E. Eckenwalder 1992 Polymorphism, hybridization, and variable evolutionary rate in molecular phylogenies. In P. S. Soltis, D. E. Soltis, and J. J. Doyle [eds.], Molecular systematics of plants, 404–428. Chapman and Hall, New York, New York, USA

Rousset F. M. Pelandakis M. Solignoc 1991 Evolution of compensatory substitutions through G-U intermediate state in Drosophila rRNA. Proceedings of the National Academy of Sciences, USA 88: 10032-10036

Sainsard-Chanet A. O. Begel Y. d'Aubenton-Carafa 1998 Two co-existing mechanisms account for the large-scale deletions of mitochondrial DNA in Podospora anserina that involve the 5' border of a group-II intron. Current Genetics 34: 326-335

Saldanha R. G. Mohr M. Belfort A. M. Lambowitz 1993 Group-I and group-II introns. FASEB Journal 7: 15-24[Abstract]

Samuel R. F. Ehrendorfer M. W. Chase H. Greger 2001 Phylogenetic analyses of Aurantioideae (Rutaceae) based on noncoding plastid DNA sequences and phytochemical features. Plant Biology 3: 77-87

Sanderson M. J. 1995 Objections to bootstrapping phylogenies: a critique. Systematic Biology 44: 299-320

Sanderson M. J. J. Kim 2000 Parametric phylogenetics?. Systematic Biology 49: 817-829

Schmelzer C. M. W. Müller 1987 Self-splicing of group II introns in vitro: lariat formation and 3'-splice site selection in mutant RNAs. Cell 51: 753-762[CrossRef][Web of Science][Medline]

Schmelzer C. R. J. Schweyen 1986 Self-splicing of group II introns in vitro: mapping of the branch point and mutational inhibition of lariat formation. Cell 46: 557-565[CrossRef][Web of Science][Medline]

Schnabel A. J. F. Wendel 1998 Cladistic biogeography of Gleditsia (Leguminosae) based on ndhF and rpl16 chloroplast gene sequences. American Journal of Botany 85: 1753-1765[Abstract/Free Full Text]

Schuster W. A. Brennicke 1994 The plant mitochondrial genome: physical structure, information content, RNA editing, and gene migration to the nucleus. Annual Review of Plant Physiology and Plant Molecular Biology 45: 61-78

Schwarzbach A. E. J. W. Kadereit 1995 Rapid radiation of North American desert genera of the Papaveraceae: evidence from restriction site mapping of PCR-amplified chloroplast DNA fragments. Plant Systematics and Evolution 9: 159-170

Seelanan T. C. L. Brubaker J. M. Stewart L. A. Craven J. F. Wendel 1999 Molecular systematics of Australian Gossypium section Grandicalyx (Malvaceae). Systematic Botany 24: 183-208

Sellem C. H. O. Begel A. Sainsard-Chanet 2000 Recombinant mitochondrial DNA molecules suggest a template switching ability for group-II-intron reverse transcriptase. Current Genetics 37: 24-28

Simmons M. P. H. Ochoterena 2000 Gaps as characters in sequence-based phylogenetic analyses. Systematic Biology 49: 369-381

Small R. L. J. A. Ryburn R. C. Cronn T. Seelanan J. F. Wendel 1998 The tortoise and the hare: choosing between noncoding plastome and nuclear ADH sequences for phylogeny reconstruction in a recently diverged plant group. American Journal of Botany 85: 1301-1315[Abstract/Free Full Text]

Soltis P. S. D. E. Soltis 1998 Molecular evolution of 18S rDNA in angiosperms: implications for character weighting in phylogenetic analysis. In D. E. Soltis, P. S. Soltis, and J. J. Doyle [eds.], Molecular sytematics of plants II: DNA sequencing, 188–210. Kluwer Academic, Boston, Massachusetts, USA

Springer M. L. J. Hollar A. Burk 1995 Compensatory substitutions and the evolution of the mitochondrial 12S rRNA gene in mammals. Molecular Biology and Evolution 12: 1138-1150[Abstract]

Steel M. D. Huson P. J. Lockhart 2000 Invariable sites models and their use in phylogeny reconstruction. Systematic Biology 49: 225-232

Steel M. D. Penny 2000 Parsimony, likelihood, and the role of models in molecular phylogenetics. Molecular Biology and Evolution 17: 839-850[Abstract/Free Full Text]

Sullivan J. K. E. Holsinger C. Simon 1996 The effect of topology on estimates of among-site rate variation. Journal of Molecular Evolution 42: 308-312

Sullivan J. D. L. Swofford 2001 Should we use model-based methods for phylogenetic inference when we know that assumptions about among-site rate variation and nucleotide substitution pattern are violated?. Systematic Biology 50: 723-729

Swofford D. L. 1998 PAUP*: phylogenetic analysis using parsimony (* and other methods). Sinauer Associates, Sunderland, Massachusetts, USA

Swofford D. L. G. J. Olsen P. J. Waddell D. M. Hillis 1996 Phylogenetic inference. In D. M. Hillis, C. Moritz, and B. Mable [eds.], Molecular systematics, 407–514. Sinauer, Sunderland, Massachusetts, USA

Tanner N. K. 1999 Ribozymes: the characteristics and properties of catalytic RNAs. FEMS Microbiology Reviews 23: 257-275

Till B. C. Schmitz-Linneweber R. Williams-Carrier A. Barkan 2001 CRS1 is a novel group II intron splicing factor that was derived from a domain of ancient origin. RNA 7: 1227-1238[Abstract]

Toor N. G. Hausner S. Zimmerly 2001 Coevolution of group II intron RNA structures with their intron-encoded reverse transcriptases. RNA 7: 1142-1152[Abstract]

Tufféry P. P. Darlu 2000 Exploring a phylogenetic approach for the detection of correlated substitutions in proteins. Molecular Biology and Evolution 17: 1753-1759[Abstract/Free Full Text]

Tuffley C. M. Steel 1997 Links between maximum likelihood and maximum parsimony under a simple model of site substitution. Bulletin of Mathematical Biology 59: 581-607

Turmel M. V. Cote C. Otis J. P. Mercer M. W. Gray K. M. Lonergan C. Lemieux 1995 Evolutionary transfer of ORF-containing group-I introns between different subcellular compartments (chloroplast and mitochondrion). Molecular Biology and Evolution 12: 533-545[Abstract]

Varani G. W. H. McClain 2000 The G-U wobble base pair: a fundamental building block of RNA structure crucial to RNA function in diverse biological systems. EMBO Reports 1: 18-23[CrossRef][Web of Science][Medline]

Vawter L. W. M. Brown 1993 Rates and patterns of base change in the small subunit ribosomal RNA gene. Genetics 134: 597-608[Abstract]

Vogel J. T. Börner W. R. Hess 1999 Comparative analysis of splicing of the complete set of chloroplast group II introns in three higher plant mutants. Nucleic Acids Research 27: 3866-3874[Abstract/Free Full Text]

Wallander E. V. A. Albert 2000 Phylogeny and classification of Oleaceae based on rps16 and trnL-F sequence data. American Journal of Botany 87: 1827-1841[Abstract/Free Full Text]

Wang X.-R. Y. Tsumura H. Yoshimaru K. Nagasaka A. E. Szmidt 1999 Phylogenetic relationships of Eurasian pines (Pinus, Pinaceae) based on chloroplast rbcL, matK, rpl20-rps18 spacer, and trnV intron sequences. American Journal of Botany 86: 1742-1753[Abstract/Free Full Text]

Wank H. J. SanFilippo R. N. Singh M. Matsuura A. M. Lambowitz 1999 A reverse transcriptase/maturase promotes splicing by binding at its own coding segment in a group II intron RNA. Molecular Cell 4: 239-250

Wanntorp L. H. E. Wanntorp B. Oxelman M. Kallersjo 2001 Phylogeny of Gunnera. Plant Systematics and Evolution 226: 85-107

Watson L. E. H. Sayed-Ahmed A. Badr 2000 Molecular phylogeny of Old World Trifolium (Fabaceae), based on plastid and nuclear markers. Plant Systematics and Evolution 224: 153-171

Wesson D. M. C. H. Porter F. H. Collins 1992 Sequence and secondary structure comparisons of ITS rDNA in mosquitoes (Diptera: Culicidae). Molecular Phylogenetics and Evolution 1: 253-269

Wheeler W. C. R. L. Honeycutt 1988 Paired sequence difference in ribosomal RNAs: evolutionary and phylogenetic implications. Molecular Biology and Evolution 5: 90-96[Abstract]

Wilgenbusch J. K. deQueiroz 2000 Phylogenetic relationships among the Phrynosomatid sand lizards inferred from mitochondrial DNA sequences generated by heterogeneous evolutionary processes. Systematic Biology 49: 592-612

Winship P. R. 1989 An improved method for direct sequencing PCR amplified material using dimethyl sulfoxide. Nucleic Acids Research 17: 1266[Free Full Text]

Wissinger B. W. Schuster A. Brennicke 1991 Trans-splicing in Oenothera mitochondria: nad1 mRNAs are edited in exon and trans-splicing group II intron sequences. Cell 65: 473-482[CrossRef][Web of Science][Medline]

Wolfe K. H. W.-H. Li P. M. Sharp 1987 Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proceedings of the National Academy of Sciences, USA 84: 9054-9058

Wolfe K. H. C. W. Morden J. D. Palmer 1992 Function and evolution of a minimal plastid genome from a nonphotosynthetic parasitic plant. Proceedings of the National Academy of Sciences, USA 89: 10648-10652

Yang J. G. Mohr P. S. Perlman A. M. Lambowitz 1998 Group II intron mobility in yeast mitochondria—target DNA-primed reverse transcription activity of al1 and reverse splicing into DNA transposition sites in vitro. Journal of Molecular Biology 282: 505-523[CrossRef][Web of Science][Medline]

Yang Z. 1994 Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate methods. Journal of Molecular Evolution 39: 306-314

Yang Z. 1996 Among-site rate variation and its impact on phylogenetic analyses. Trends in Ecology and Evolution 11: 367-372

Yang Z. N. Goldman A. Friday 1994 Comparison of models for nucleotide substitution used in maximum likelihood phylogenetic estimation. Molecular Biology and Evolution 11: 316-324[Abstract]

Young N. D. C. W. dePamphilis 2000 Purifying selection detected in the plastid gene matK and flanking ribozyme regions within a group II intron of nonphotosynthetic plants. Molecular Biology and Evolution 17: 1933-1941[Abstract/Free Full Text]

Zhang W. 2000 Phylogeny of the grass family (Poaceae) from rpl16 intron sequence data. Molecular Phylogenetics and Evolution 15: 135-146

Zhang W. C. Reading A. B. Deisseroth 1992 Improved PCR sequencing with formamide. Trends in Genetics 8: 332

Zimmerly S. H. Guo R. Eskes J. Yang P. S. Perlman A. M. Lambowitz 1995a A group II intron RNA is a catalytic component of a DNA endonuclease involved in intron mobility. Cell 83: 529-538[CrossRef][Web of Science][Medline]

Zimmerly S. H. Guo P. S. Perlman A. M. Lambowitz 1995b Group II intron mobility occurs by target DNA-primed reverse transcriptase. Cell 82: 545-554[CrossRef][Web of Science][Medline]

Zuker M. 1989 On finding all suboptimal foldings of an RNA molecule. Science 244: 48-52[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Facebook Facebook   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
Mol Biol EvolHome page
D. M. Simon, S. A. Kelchner, and S. Zimmerly
A Broadscale Phylogenetic Analysis of Group II Intron RNAs and Intron-Encoded Reverse Transcriptases
Mol. Biol. Evol., December 1, 2009; 26(12): 2795 - 2808.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
K. J. Wurdack and C. C. Davis
Malpighiales phylogenetics: Gaining ground on one of the most recalcitrant clades in the angiosperm tree of life
Am. J. Botany, August 1, 2009; 96(8): 1551 - 1570.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
R. J. Bayer, D. J. Mabberley, C. Morton, C. H. Miller, I. K. Sharma, B. E. Pfeil, S. Rich, R. Hitchcock, and S. Sykes
A molecular phylogeny of the orange subfamily(Rutaceae: Aurantioideae) using nine cpDNA sequences
Am. J. Botany, March 1, 2009; 96(3): 668 - 685.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
D. M. Simon, N. A.C. Clarke, B. A. McNeil, I. Johnson, D. Pantuso, L. Dai, D. Chai, and S. Zimmerly
Group II introns in Eubacteria and Archaea: ORF-less introns and new varieties
RNA, September 1, 2008; 14(9): 1704 - 1713.
[Abstract] [Full Text] [PDF]


Home page
ANN BOT (LOND)Home page
C. Figueroa, G. A. Salazar, H. A. Zavaleta, and E. M. Engleman
Root Character Evolution and Systematics in Cranichidinae, Prescottiinae and Spiranthinae (Orchidaceae, Cranichideae)
Ann. Bot., March 1, 2008; 101(4): 509 - 520.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
M. J. Moore, A. Tye, and R. K. Jansen
Patterns of long-distance dispersal in Tiquilia subg. Tiquilia (Boraginaceae): implications for the origins of amphitropical disjuncts and Galapagos Islands endemics
Am. J. Botany, August 1, 2006; 93(8): 1163 - 1177.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
G. Hausner, R. Olson, D. Simon, I. Johnson, E. R. Sanders, K. G. Karol, R. M. McCourt, and S. Zimmerly
Origin and Evolution of the Chloroplast trnK (matK) Intron: A Model for Evolution of Group II Intron RNA Structures
Mol. Biol. Evol., February 1, 2006; 23(2): 380 - 391.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
C. Lohne and T. Borsch
Molecular Evolution and Phylogenetic Utility of the petD Group II Intron: A Case Study in Basal Angiosperms
Mol. Biol. Evol., February 1, 2005; 22(2): 317 - 332.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
J. Shaw, E. B. Lickey, J. T. Beck, S. B. Farmer, W. Liu, J. Miller, K. C. Siripun, C. T. Winder, E. E. Schilling, and R. L. Small
The tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis
Am. J. Botany, January 1, 2005; 92(1): 142 - 166.
[Abstract] [Full Text] [PDF]


Home page
Syst BiolHome page
M. Popp and B. Oxelman
Evolution of a RNA Polymerase Gene Family in Silene (Caryophyllaceae)--Incomplete Concerted Evolution and Topological Congruence Among Paralogues
Syst Biol, December 1, 2004; 53(6): 914 - 932.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
S. S. Renner
Bayesian analysis of combined chloroplast loci, using multiple calibrations, supports the recent arrival of Melastomataceae in Africa and Madagascar
Am. J. Botany, September 1, 2004; 91(9): 1427 - 1435.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
A. Weeks and B. B. Simpson
Molecular genetic evidence for interspecific hybridization among endemic Hispaniolan Bursera (Burseraceae)
Am. J. Botany, June 1, 2004; 91(6): 976 - 984.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. M. Crayn, K. Winter, and J. A. C. Smith
Multiple origins of crassulacean acid metabolism and the epiphytic habit in the Neotropical family Bromeliaceae
PNAS, March 9, 2004; 101(10): 3703 - 3708.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Bot.Home page
E. R. Sanders, K. G. Karol, and R. M. McCourt
Occurrence of matK in a trnK group II intron in charophyte green algae and phylogeny of the Characeae
Am. J. Botany, April 1, 2003; 90(4): 628 - 633.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Web of Science (57)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Kelchner, S. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Kelchner, S. A.
Agricola
Right arrow Articles by Kelchner, S. A.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS