Am. J. Bot. Li-Cor Advertisement
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


  Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter
What's this?
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Agricola
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?
(American Journal of Botany. 2006;93:566-574.)
© 2006 Botanical Society of America, Inc.


Phytochemistry

Development of a comprehensive detection method for medicinal and toxic plant species1

Roger A. Barthelson4, Padma Sundareshan, David W. Galbraith and Raymond L. Woosley

2Department of Plant Sciences, University of Arizona, Tucson, Arizona 85721 USA; and 3 Department of Medicine, Sarver Heart Center, University of Arizona, Tucson, Arizona 85721 USA

Received for publication April 18, 2005. Accepted for publication January 31, 2006.

ABSTRACT

Pharmacologically active ingredients in plants can cause significant morbidity through their increasingly common use in herbal alternative medicines and dietary supplements. Monitoring consumer products for the presence of toxic plants is encumbered by the lack of rapid and specific assays. To create a sensitive, reliable, fast, and broad-spectrum assay for medicinal or toxic plant species, we tested multiplexed ligation-dependent probe amplification (MLPA), which requires partial genomic DNA sequences from species of plants that are not well represented in currently available genetic databases. Genomic DNA was obtained from 21 species of medicinal and/or toxic plants. The PCR products were amplified from these plants and cloned for sequencing. The MLPA method was successful with DNA samples from many different species. The use of a microarray to facilitate screening of potentially thousands of plants in a single assay also was successful. The combination of the specificity of the MLPA assay with the broad-scale capabilities of microarray technology should make this an especially useful tool in screening in foods and commercial herbal preparations to identify the plant compounds actually present. Other applications could potentially extend to the identification of any plant species in samples for academic botanical studies and for biodefense and forensics applications.

Key Words: CODEHOP • cytochrome P450 • dietary supplement • medicinal plants • microarray • multiplexed ligation-dependent probe amplification (MLPA) • toxic plants

Plants are unique in their ability to produce an extraordinary array of different secondary metabolites. Many of these metabolites have medicinal and/or toxic attributes. The medicinal qualities of plants have been valued by man for centuries, and this tradition continues today in the increasing use of alternative medicines and dietary supplements in our society.

With increased consumption of herbal preparations comes an increased risk of toxic side effects, because of the intended and sometimes unintended plant species that may be found in these preparations. Many herbal preparations are derived from plants collected under uncontrolled conditions, and the collectors may have limited skill in the identification of plant species. Accidental poisonings occur because of mistakes and because of the unwitting use of related herbs that have different properties. The identification of herbs can be confused by the multiple and overlapping names used to describe them (Chan, 2003 ). For example, toxicity was reported for infants accidentally exposed to Japanese star anise, which was confused with Chinese star anise, which is not toxic (Ize-Ludlow et al., 2004 ).

A diverse range of plants may be included in herbal preparations, and many of them can have serious toxic side effects. Ephedra use has documented effects on cardiovascular health, including death, which has resulted in its sale being prohibited in the United States (Naik and Freudenberger, 2004 ; Soni et al., 2004 ). Some herbal preparations cause serious drug interactions, especially by altering expression of liver enzymes (Izzo et al., 2002 ; Karliova et al., 2000 ; Thabrew et al., 2004 ). Alternative medicines and supplements have been linked also with hepatotoxicity, nephrotoxicity, neurotoxicity, carcinogenicity, and allergic reactions (Dasgupta, 2003 ; Stedman, 2002 ).

The use and study of herbal preparations would benefit from a comprehensive method for identifying the species of plants contained in them. To develop such a method, we chose the multiplexed ligation-dependent probe amplification (MLPA) method (Schouten et al., 2002 ) in conjunction with microarray technology. The MLPA method uses a ligation reaction for strong specificity, coupled with a PCR amplification step for sensitivity. The ligation reaction joins two oligonucleotide probes that match one continuous sequence of plant genomic DNA. The PCR step uses a single pair of primers that match primer-binding sites added to the outer ends of the ligation probes. The MLPA has some capacity for multiplexing by combining multiple pairs of ligation probes in a single tube reaction. The addition of a microarray step simplifies the expansion of the number of plant species detected with the assay. The microarray consists of oligonucleotides that match each of the possible ligation products, so that these products may be identified upon hybridization. Because DNA microarrays can have tens of thousands of DNA oligonucleotides printed on them, they provide unmatched breadth for parallel testing.

For this method, species-specific genomic DNA sequences are needed for each plant to be identified. Many plant species have few gene sequences in the public databases or none at all. Some gene sequences are so well conserved between species that they are not useful for this application. Thus, gene samples are needed from many different species of plants. In our studies, PCR was used to amplify gene sequences from the various plant species to be included ultimately in a comprehensive assay. We chose to target the genes for cytochrome P450 enzymes, because of several advantageous properties. They are ubiquitous and heavily represented in plants. They are conserved within some regions of the molecule and widely divergent in others. Some of the cytochrome P450 enzymes are involved specifically in the synthesis of the very unique and diverse secondary metabolites that are of toxic or medicinal interest in some plants. We selected the CYP71 family of enzymes to target, because many of these, and their relatives within the CYP71 clan, are involved in the synthesis of defensive compounds and other secondary metabolites (Morant et al., 2003 ; Nelson et al., 2004 ).

To amplify cytochrome P450 gene fragments from plants, we used the consensus-degenerate hybrid oligonucleotide primers (CODEHOP) program to design the PCR primers (Rose et al. 2003 ; Rose et al. 1998 ). This program (http://blocks.fhcrc.org/blocks/codehop.html) uses blocks of conserved protein sequence to design primers and has been employed successfully to amplify plant cytochrome P450 sequences (Morant et al., 2002 ). We applied the CODEHOP method to design PCR primers, amplified samples of genomic DNA, and cloned and sequenced the resulting PCR products. We used the sequences to design probes for the MLPA assay and successfully tested these probes. This approach is an effective way to rapidly accumulate sequence information from the long list of plants that will be included in a multiplexed assay.

Ultimately, CODEHOP may help rapidly accumulate hundreds or thousands of genomic DNA sequence samples from diverse plant species. The sequence samples will allow for designing MLPA probes for detecting the plants, and a custom microarray will facilitate parallel testing of potentially thousands of species in a single tube by providing a streamlined procedure for designing the probes and for reading the results.

MATERIALS AND METHODS

Plant species
Plants used traditionally as alternative medicines or as dietary supplements were included, as well as species that one would wish to exclude from these products: Aconitum napellus Herb., (Ranunculaceae); Arabidopsis thaliana L., (Brassicaceae); Artemesia absinthium L., (Asteraceae); Artemisia vulgaris L. (Asteraceae); Atropa belladonna L. (Solanaceae); Capsicum annuum var. glabriusculum L. (Dunal) Heiser & Pickersgill, (Solanaceae); Caulophyllum thalictroides L. Michx. (Berberidaceae); Citrus aurantium L. (Rutaceae); Datura metel L. (Solanaceae); Digitalis lanata Ehrh. (Scrophulariaceae); Echinacea angustifolia, DC., (Asteraceae); Ephedra viridis Coville, (Ephedraceae); Glycyrrhiza uralensis Fisch. ex DC. (Fabaceae); Hypericum perforatum L. (Clusiaceae); Lawsonia inermis L. (Lythraceae); Lobelia inflata L. (Campanulaceae); Mentha pulegium L. (Lamiaceae); Symphytum officinale L. (Boraginaceae); Tanacetum vulgare L. (Asteraceae); Teucrium canadense L. (Lamiaceae); Teucrium chamaedrys L. (Lamiaceae); Tussilago farfara L. (Asteraceae).

Cultivation of plants
Plant specimens and seeds were obtained from Companion Plants, Athens, Ohio, USA; Crimson Sage Nursery, Colton, Oregon, USA; and Richters Herbs, Goodwood, Ontario, Canada. Plants were grown in peat-based potting soil in artificial light. Watering schedules and light intensity were varied to provide conditions appropriate to each species.

Use of the CODEHOP program
Degenerate PCR primers were designed with the aid of the CODEHOP program. The primers were derived from 24 different protein sequences for CYP71, selected from diverse plant species. The program employs a codon usage table to reverse translate the amino acid sequences to DNA sequences. Six different codon usage tables were used to represent six different families of plants: Brassicaceae (Arabidopsis), Asteraceae, Rutaceae (citrus), Fabaceae, Scrophulariaciae, and Solanaceae. The codon usage table for each group (or a representative for the group) was used to generate 2–3 different primers each, for a total of 17 different primers (Table 1). The primers were generated from three different sections of the CYP71 sequence, all near the heme-binding domain of cytochrome P450. For each plant family represented, one primer was chosen to be a sense strand primer (those including the letter C), and two others to be antisense strand primers (D and E, Fig. 1). Each primer was chosen from a group of primer candidates provided by the CODEHOP program. The final sequence of each primer was determined after examining its suitability for PCR, with the aid of Vector NTI (Invitrogen, Carlsbad, California, USA). The primer pairs employed in these studies are given in Table 2.


View this table:
[in this window]
[in a new window]
 
Table 1. CODEHOP primers for PCR amplification of fragments of genomic DNA from targeted plant species

 

Figure 1
View larger version (4K):
[in this window]
[in a new window]
 
Fig. 1. Schematic diagram of theoretical CODEHOP binding sites for primers 71arabCs (C), 71arabDas (D), and 71arabEas (E), and the heme-binding region (Umemoto et al., 1993 ) of Arabidopsis thaliana CYP71A2

 

View this table:
[in this window]
[in a new window]
 
Table 2. PCR primer pairs for the amplification of segments of genomic DNA from targeted plant species

 
Amplification of cytochrome P450 sequences
Genomic DNA samples were prepared with DNeasy kits (Qiagen, Valencia, California, USA). The PCR reactions were performed in 96-well microplates with a touchdown protocol that used two consecutive sets of cycles: (1) 22 cycles of 94°C for 45 s, 70°C (decreased 1° each cycle) for 2 min, and 72°C for 5 min and (2) 22 cycles of 94°C for 45 s, 55°C for 2 min, and 72°C for 5 min. The amplification was completed with an additional 5 min at 72°C. The amplification mixture contained Herculase DNA polymerase (Stratagene,, La Jolla, California, USA) and the accompanying buffer, 0.2 µmol/L dNTPs, 1 µmol/L primers designed by the CODEHOP protocol, and approximately 500 ng of genomic DNA in 50 µL. Multiple reactions were performed simultaneously in 96-well microplates in different combinations of primer pairs and genomic DNA. The PCR products were separated by electrophoresis on 2% agarose gels in TAE buffer, and bands of the appropriate size were excised and purified with the Qiagen MinElute kit.

Cloning and sequencing
The PCR products were cloned with the Zero Blunt Topo kit (Invitrogen, Carlsbad, California, USA). Bacterial colonies were screened by PCR using sequencing primer sites on the plasmid. The positive colonies were grown in CircleGrow medium (Qbiogene, Carlsbad, California, USA) and purified with the Montage Plasmid Miniprep96 kit (Millipore, Billerica, Massachusetts, USA). Bidirectional sequencing was performed by the Genomic Analysis and Technology Core of the University of Arizona. Raw sequences were assembled into contigs and trimmed with the help of ContigExpress/VectorNTI (Invitrogen, Carlsbad, California, USA). The resulting sequences were analyzed by Blastx analysis on the National Center for Biotechnology Information website (http://www.ncbi.nlm.nih.gov/BLAST). The sequences were compared with the AlignX program of VectorNTI (Invitrogen, Carlsbad, California, USA).

MLPA assay
The MLPA assay was performed as described by Schouten et al. (2002) . Briefly, the MLPA probes were chosen to match the sequences obtained by sequencing. They were designed to have a melting temperature (Tmelt) of approximately 65°C, and PCR primer sequences (the same ones used by Schouten et al. [2002 ]) were added to the 5' end of one MLPA probe, and the 3' end of the other. The genomic DNA sample (e.g., 50–200 ng) and the MLPA probes (3.75 fmol) were incubated together in 0.28 mol/L KCl, 0.047 mol/L Tris-HCl pH 8.5, and 0.19 EDTA in 8 µL at 98°C for 5 min, then for 16 h at 65°C. For the ligation step, the following MLPA kit components (MRC-Holland, Amsterdam, Netherlands) were combined with 25 µL water: SALSA PCR buffer 3 µL, NAD buffer 3 µL, and 1 µL Ligase-65. This mixture was added to the annealed probe–genomic DNA mixture and incubated for 15 min at 54°C, followed by 98°C for 5 min. A 10-µL sample of the ligation mixture was transferred to 40 µL of a PCR reaction mixture. The PCR mixture contained Surestart Taq DNA polymerase (Stratagene, La Jolla, California, USA), the appropriate buffer components, and 2.5 µL each of 10 µmol/L stocks of the PCR primers. The PCR protocol consisted of 95°C for 9 min, 34 cycles of (94°C 45 s, 60°C 1 min, 72°C 1 min), followed by 72°C for 7 min. The PCR products were separated on 3% NuSieve (Cambrex East Rutherford, New Jersey, USA), 1% agarose gels.

Microarray printing
A small microarray was created by printing 70-mer oligonucleotides in 3x sodium chloride/sodium citrate buffer (SSC) on aminosilane-coated glass slides (Telechem International, Sunnyvale, California, USA) with an Omnigrid 100 (Genomic Solutions, Ann Arbor, Michigan, USA). The array oligos consisted of two specific oligos, and seven randomly chosen Arabidopsis genome oligos from an Array-Ready Oligo set (Operon Biotechnologies, Huntsville, Alabama, USA). The specific oligos represented the specific portions of the plus strand of the MLPA products for Datura metel and Lobelia inflata. The oligo sequences for Arabidopsis thaliana are proprietary, but they were designed for the following genes: at3g01700, at4g00810, at1g44150, at5g57480, at3g22310, at5g57480, and at1g29430.

Microarray hybridization
An MLPA was performed with oligos for the following plants: Artemesia absinthium, Atropa belladonna, Caulophyllum thalictroides, Datura metel, Digitalis lanata, Echinacea angustifolia, Glycyrrhiza uralensis, Hypericum perforatum, Lobelia inflata, Mentha pulegium, Symphytum officinale, Tanacetum vulgare, Teucrium canadense, T. chamaedrys, and Tussilago farfara, and the genomic DNA for Datura metel alone. The PCR step of the MLPA was modified to include a reverse primer that was Cy5-labeled. The PCR product was purified and used for hybridization with the small microarray described. Before hybridization, the array was rehydrated over a 55°C water bath four times for ~4–5 s each. They were then cross-linked to the surface with UV treatment three times with 120 mJ each time. The array was washed with 0.1% SDS for 10 min, then rinsed in nuclease-free water, and dried with a nitrogen jet. The hybridization reaction was performed with the PCR product in 90 µL of 2x SSC, 6% Liquid Block (Amersham Biosciences, Piscataway, New Jersey, USA), and 0.1% SDS. The mixture, after boiling for 2 min, was applied by capillary action under a 24 x 60 mm Lifterslip (Erie Scientific, Portsmouth, New Hampshire, USA) positioned on a microarray. After incubating in the dark at 58°C for 5 h, the microarray was washed with 2x SSC, 0.5% SDS at 58°C for 5 min. Two subsequent washes were at room temperature with 0.5x SSC. After drying, the slide was scanned at 635 nm in a Genepix 4200AL scanner (Molecular Devices, Sunnyvale, California, USA).

RESULTS

Collection of genomic DNA sequence samples from targeted plants
The PCR was used to amplify portions of the cytochrome P450 genes from target plant species to aid in their identification. In the absence of specific sequence information for the targeted species, the CODEHOP program was employed to design degenerate primers, starting with the protein sequences available for other plant cytochrome P450 genes. Different combinations of primers (Tables 1, 2) and genomic DNA were used to increase the odds of producing a useful product, but emphasis was placed on pairing the genomic DNA with primer sequences produced with the codon usage table for a related plant species. Typically, approximately 25% of the reactions produced products of the appropriate size for the expected cytochrome P450 product. When paired with a C primer, the D and E primers (Fig. 1) produced products of approximately 190 and 420 bp respectively.

After the PCR products were isolated and cloned, the resulting plasmid inserts were sequenced. A summary of the sequenced PCR products is given in Table 3. All of the sequences were trimmed to remove primer sequences. Some of the sequences represented were truncated, because of nonsense or unlikely stop codons in their proposed translations. The 32 PCR product sequences represent 21 different species. Information from submitting the sequences to Blastx analysis indicates that almost all of the sequences are likely derived from plant cytochrome P450 genes. Even for the sequences with the higher expected values (e.g., 0.0004 for the Capsicum annuum sequence), the best match available in the databases is a cytochrome P450 gene. One sequence, obtained from Ephedra nevadensis, had no significant match using Blastx (not shown).


View this table:
[in this window]
[in a new window]
 
Table 3. Summary of sequence data for PCR-amplified segments of genomic DNA from targeted plant species

 
Some of the Blastx matches indicate a close relation to the cytochrome CYP71 family for five of the PCR products, which is consistent with the protein sequences used with the CODEHOP program. Other sequences best matched members of the greater CYP71 clan (Nelson et al. 2004 ), including the CYP98 family in plants (see Fig. 2) and the CYP76, CYP80, and CYP93 families. The alignment in Fig. 2 shows the high level of homology within all five sequences that matched CYP98 family members by Blastx analysis. By comparison, alignments for sequences that matched the CYP71, CYP76, and CYP93 families were significantly more divergent.


Figure 2
View larger version (128K):
[in this window]
[in a new window]
 
Fig. 2. Alignment of the sequences of the CYP98-like PCR products. All sequences were matched by Blastx analysis to a CYP98 family member with an Expect value of 2e-47, or smaller. The alignment was performed with the AlignX program of VectorNTI (Invitrogen, Carlsbad, California, USA) using the default settings. Bases highlighted with dark gray are 100% identical. Those with light gray show 50% or greater identity, and those with white are 20% or less identical. Identities were 50.9% and positives 85.9% overall or 56.3% and 95.0%, respectively, in the portion that was common to all. The complement of sequence AY870903 was used for the alignment, and so it is marked rc (reverse copy)

 
The P12 primer pair, which was generated with the Solanum tuberosum codon usage table, was especially successful at producing PCR products. The P12 pair produced 14 of the sequences in these experiments, indicating that they were especially good matches with some plant cytochrome P450 genes. This pair produced three matches for CYP71 family members, five matches for the CYP98 family members, and four matches for CYP76 family members. Twelve of the 14 products of the P12 pair of primers were obtained with genomic DNA from plants of the Asteridae subclass. For four of the P12 products, the first- or second-best match by Blastx was p-coumaroyl shikimate 3-hydroxylase (cyp98A13) from Ocimum basilicum.

Design and testing of MLPA probes for targeted plants
The sequence samples obtained from the various plant species were used in the design of probes for the MLPA assay. Primer sequences were taken, as much as possible, from the least conserved sections of each sequence. They were designed to have a Tmelt of approximately 65°C in 100 mmol/L NaCl. Hairpin and dimer formations were avoided. The probes were tested for their suitability for identifying specific target plants in a series of experiments.

The MLPA probes (Table 4) were tested in MLPA assays with genomic DNA from different species. Initially, the probes were tested with genomic DNA from the appropriate species and with Arabidopsis thaliana genomic DNA as a negative control. Arabidopsis thaliana DNA was tested also with MLPA probes designed specifically for Arabidopsis thaliana to provide a positive control. The results of two such experiments are given in Table 5. All the ligase-dependent PCR products for these experiments were typically 110–120 bp (Fig. 3), because they represent the ligation product of two MLPA probes, which were from 45 to 60 bp each. The reaction with the Arabidopsis probes and Arabidopsis genomic DNA yielded a product. Otherwise, all but one MLPA probe set produced a product with their cognate plant genomic DNA, but not with Arabidopsis DNA. The exception was for the Symphytum officinale probes, which did not produce a product at all.


View this table:
[in this window]
[in a new window]
 
Table 4. MLPA probes used in plant identification tests

 

View this table:
[in this window]
[in a new window]
 
Table 5. Initial test of MLPA probe to determine their specificity for the intended species. The results of the MLPA reactions are summarized with (+) indicating a PCR product was formed, and (–) indicating no PCR product was formed

 

Figure 3
View larger version (40K):
[in this window]
[in a new window]
 
Fig. 3. Test of species specificity for the MLPA ligation probes. An MLPA experiment was performed with different sets of ligation probes and different samples of genomic DNA. After the ligation probes were annealed to the genomic DNA for 16 h, the ligation reaction was performed, followed by PCR amplification of any ligated sequences. The PCR products were separated on a 3% NuSieve agarose (Cambrex, Rockland, Maine, USA)/1% agarose electrophoresis gel and stained with SybrGold (Invitrogen, Carlsbad, California, USA). Ligation probes designed for Artemisia absinthium (lanes 2–4), Atropa belladonna (lanes 5–7), Caulophyllum thalictroides (lanes 8–10), Datura metel (lanes 11–13), Echinacea angustifolia (lanes 14–16), Lawsonia inermis (lanes 17–19), and Lobelia inflata (lanes 20–22) were incubated with genomic DNA from the cognate species (lanes 2, 5, 8, 11, 14, 17, 20) and with genomic DNA from differing species: Artemisia vulgaris (lane 3); Echinacea angustifolia (lane 4); Datura metel (lane 6); Hyocyamis niger (lane 7); Aconitum napellus (lane 9); Lobelia inflata (lane 10); Atropa belladonna (lane 12); Hyocyamus niger (lane 13); Lobelia inflata (lane 15); Artemisia absinthium (lane 16); Larrea tridenta (lane 18); Citrus aurantium (lane 19); Echinacea angustifolia (lane 21); and Artemisia absinthium (lane 22). A 100-bp ladder (100-1000 bp) is in lanes 1 and 23

 
In another set of experiments, the MLPA probes were tested against genomic DNA from their cognate genomic DNA and against DNA from two other species. An emphasis was placed, as much as possible, on matching the plant genus or family for the plant species used in these negative controls. In all cases, the MLPA probes produced a product when paired with the appropriate genomic DNA (Figs. 3 and 4). Additionally, MLPA tests for Artemisia absinthium, Glycyrrhiza uralensis, Teucrium canadense, and Tanacetum vulgare also produced products with DNA from other species. For Artemisia absinthium and Teucrium canadense, plants in the same genus, Artemisia vulgaris and T. chamaedrys, respectively, produced PCR products. In the case of the MLPA probes for Tanacetum vulgare, two different plants in the same family (Asteraceae) produced products. Thus, some of these MLPA probe sequences may represent genes common among many plant relatives, while others may be limited to one genus.


Figure 4
View larger version (40K):
[in this window]
[in a new window]
 
Fig. 4. Further MLPA probe tests. MLPA probes for the following species were tested (details as in Fig. 3): Digitalis lanata (lanes 1–3); Glycyrrhiza uralensis (lanes 4–6); Hypericum perforatum (lanes 7–9); Mentha pulegium (lanes 10–12); Teucrium canadense (lanes 14–16); Teucrium chamaedrys (lanes 17–19); Tussilago farfara (lanes 20–22); and Tanacetum vulgare (lanes 23–25). Genomic DNA was from the cognate species (lanes 1, 4, 7, 10, 14, 17, 20, 23) and from the following species: Artemisia absinthium (lane 2); Echinacea angustifolia (lane 3); Capsicum annuum (lane 5); Larrea tridenta (lane 6); Capsicum annuum (lane 8); Echinacea angustifolia (lane 9); Symphytum officinale (lane 11); Teucrium canadense (lane 12); Teucrium chamaedrys (lane 15); Mentha pulegium (lane 16); Teucrium canadense (lane 18); Mentha pulegium (lane 19); Echinacea angustifolia (lane 21); Tanacetum vulgare (lane 22); Artemisia absinthium (lane 24); Echinacea angustifolia (lane 25). The 100-bp ladder was in lane 13

 
Use of a custom microarray for simultaneous testing for multiple plant species
The concept of using a microarray as the final readout for a highly multiplexed plant identification assay was tested, by printing a custom microarray with two 70-mer oligos that matched the MLPA PCR products for Datura metel and Lobelia inflata. Lobelia inflata was chosen because of the other probes used in these experiments, the probe sequences for this species most closely matched those of D. metel. The rest of the array consisted of control oligos that had no relationship to any of the plant probes used in our assays. An MLPA ligation reaction was performed with 15 different sets of probes, including the set for a toxic plant, D. metel, and a set for Lobelia inflata. Only the genomic DNA for D. metel was added to the ligation reaction. For testing the microarray, the PCR reaction included a Cy5-labeled reverse primer, and the PCR product was used in a hybridization on the array. The fluorescence scan of the microarray is given in Fig. 5. A red reflectance scan showed all nine oligos printed on the microarrays, grouped in four spots per oligo. By comparison, in the fluorescence scan only one oligo group, corresponding to the D. metel oligos, was visible. The oligo group printed just below the D. metel group corresponds to L. inflata and was not fluorescent. This experiment demonstrated that a microarray hybridization can readily determine which ligation probes successfully recognized their target genomic DNA in a sample and which do not.


Figure 5
View larger version (24K):
[in this window]
[in a new window]
 
Fig. 5. Microarray for the identification of species detected in a multiplexed MLPA reaction. The MLPA reaction included 15 different pairs of ligation primers for diverse plant species and genomic DNA from one species (Datura metel). The microarray included nine different sets of oligonucleotides complementary to various plant sequences. After hybridization, the array was scanned at 635 nm at full laser power with a red filter (left) to reveal the spots that hybridized with the Cy5-labeled target. The hybridized spots corresponded to the oligos complementary to sequences derived from Datura metel. An untreated array from the same print set was scanned at 635 nm with laser power at 46% and with a neutral density filter (ND3) to show the location of all spots by reflectance off salt deposits (right)

 
DISCUSSION

The CODEHOP method (Rose et al., 2003 ) for degenerate PCR primer design was developed for amplifying unsequenced genes with homology to known sequences. The method starts with a group of related protein sequences and uses conserved blocks of aligned protein sequences to design PCR primers. To accomplish this, CODEHOP employs a user-selected codon usage table to reverse-translate the protein sequences. The CODEHOP-derived primers have been used previously to amplify successfully cytochrome P450 genes in plants (Morant et al., 2002 ). We felt that this would be an ideal method and an ideal family of proteins to use for finding representative sequence tags for species of plants underrepresented in the databases. We chose the CYP71 family of proteins for their ubiquity in plants and because many members of the extended CYP71 clan function in the synthesis of secondary metabolites related to defense, as opposed to the more conserved metabolic functions found in all vascular plants (Nelson et al., 2004 ).

We chose as a starting place a well-conserved section of the cytochrome P450 molecule, the heme-binding domain. Indeed, this was the only section of the CYP71 sequence that generated useful primers with the CODEHOP program, without excessively increasing the degeneracy (32–64 or less was preferred) or using a relatively high strictness number (typically 0.4 or less). A higher degeneracy essentially reduces the effective concentration of the PCR primers. A higher strictness number means that the primers are increasingly biased towards the most common base in each position and could limit severely the range of sequences that the primers could amplify. We began by using a limited number of plant codon usage tables to provide a collection of primers to test.

The CODEHOP primers worked very well at producing PCR products. Although not all combinations of primers and genomic DNA samples produced products immediately, 89% of the genomic DNA samples produced products without a need for altering our initial PCR conditions. We were successful at cloning most of these for sequencing.

A Blastx analysis of the PCR product sequences indicated that some likely represented CYP71 family members. Other products were probably from families that are considered part of a greater CYP71 clan, including CYP76, CYP98, and CYP93.

The sequences allowed us to test the MLPA method for identifying the species of origin of samples containing plant genomic DNA. The MLPA method was consistently successful at recognizing genomic DNA from the appropriate species. Preliminary tests for each probe set against DNA samples from Arabidopsis thaliana and two other species of plants demonstrated that 11 of 15 of the MLPA probes were indeed specific to one plant species (within the limitations of our tests). Four of the ligation probe sets tested recognized a DNA sample from a species other than that intended. In three of these cases, another plant in the same genus produced a product. This reveals a predictable problem in trying to construct a reliable assay for any species. Related species may be very difficult to distinguish genetically, unless sequence information is found that helps distinguish one related species from another. The weaker PCR bands that resulted for these false positives indicate that raising the annealing temperature for the MLPA could improve the selectivity of these assays. For the T. vulgare probes, DNA harvested from plants of the same family (Asteraceae) produced false positives, thus indicating that these probes are inadequate for use in plant identification.

A way to improve the specificity of the ligation probes is to use sequence information from less-conserved sections of a gene. In these studies, our probes were all derived from the protein sequences that surround the heme-binding region of CYP71 family members. This is a relatively well-conserved section of the cytochrome P450 molecule, because of its clear importance to enzymatic function. If a section of the molecule closer to the amino terminus were used to produce PCR primers, then longer, less-conserved stretches of sequence could be obtained. To produce effective primers from other regions of the molecule may require the acceptance of a higher level of degeneracy in the PCR primers than tolerated in these experiments. The CODEHOP primers have some ability to compensate for degeneracy of the primers, because all primers include a long stretch of sequence in common on the 5' end, which enables all primers in a degenerate mixture to anneal to amplification products of the other primers, at least at lower annealing temperatures.

Another way to improve the specificity of ligation probes would be to avoid using sequences that likely derive from well-conserved types of cytochrome P450. For example, five different PCR products derived from different plants were very close matches by Blastx analysis to members of the CYP98 family, which are coumaroyl ester 3-hydroxylases (Morant et al., 2002 ; Gang et al., 2002 ). This group of enzymes is important to the synthesis of lignin monomers and therefore has a conserved, primary function for the plant. Over time, as more sequences are entered in the public databases, we may be able to improve our arsenal of CODEHOP primers, so that they are less likely to amplify sequences from such conserved genes. The list of sequences that may be used to generate primers with CODEHOP increases and may provide for more precise primer design to avoid the more-conserved members of the broad, cytochrome P450 family. Similarly, we may learn to avoid primers that are likely to produce less-useful sequences. In our tests, the P12 primer pair was used to amplify all five of the CYP98 family members in Fig. 2. This primer pair was also by far the most productive one for these experiments, having produced almost half of the cloned sequences. Minor modifications of the P12 primer pair may help avoid the amplification of CYP98 sequences. Ultimately these improvements could help speed the scale-up of this method to a higher throughput.

The MLPA method itself is a good choice for this application, because it uses for selectivity long probes at relatively high annealing temperatures, combined with a strict requirement for a perfect match between probes and target DNA at the point where the two probes meet. Ideally, the point in the genomic DNA sequence chosen for the joining of the probes should be poorly conserved and thus more likely unique for each species.

Applications for MLPA assays typically employ multiple probe sets, and the ligation probes are designed to include a spacer between the PCR primer-binding site and the gene-specific probe sequence (Schouten et al., 2002 ). In multiplexed assays, the resulting final products are different sizes so that they may be distinguished by agarose electrophoresis. The discrimination of different size bands by electrophoresis is potentially a strong limitation of the multiplexing of MLPA assays. In our studies, we did not include these spacer sequences and plan to use instead a microarray step to identify any or all amplified sequences in a multiplexed assay. This modified method would use a custom microarray with oligo DNA probes that match perfectly the gene-specific portions of the ligated probes. This approach should make it much easier to design and produce MLPA probes and may enable us to use a higher degree of multiplexing in the ultimate plant diagnostic assay planned.

We tested the concept of using a microarray with the MLPA assay in a simple experiment that included 15 sets of probes in a single tube and genomic DNA from one species. The resulting fluorescent scan clearly indicated that the MLPA assay can be used to test for multiple species simultaneously and that a microarray can distinguish which species is being recognized by the assay. The test study also demonstrated that this method could identify an alternative medicine or other consumer herbal preparation containing potentially toxic Datura plant leaves rather than, or in addition to, the intended plant materials. In the future, this assay could simultaneously monitor for hundreds or thousands of other toxic plant species.

We plan to continue expanding the number of plant species that can be tested in this assay. As more unique sequences are obtained for diverse plant species, more oligos can be added to the custom microarray used for the final readout. We expect that with development, this method will provide a critical tool for monitoring the content of herbal or alternative medicines and dietary supplements. This tool should help protect the consumer and facilitate more reliable research into the efficacy of these products. The combined specificity and scalability of this method means that it could be applied also to identification of plant materials for general botanical studies or studies in archaeology, paleontology, for biodefense, or for forensic studies.

Conclusions
Degenerate, consensus, PCR primers were used to amplify partial cytochrome P450 sequences from 20 different species of plants used in preparation of alternative medicines or considered potential contaminants of these preparations. The resulting PCR products were cloned and sequenced. The sequences were used to design probes for use in MLPA assays for identification of plant specimens that contain genomic DNA. Tests of the MLPA probes indicate that they can be used to identify plant specimens, although care must be taken to consider that some probes may not be specific to one unique species, but may recognize closely related species. When many MLPA probe sets are used in a single tube assay, a microarray can be used to determine the genetic source of genomic DNA in a sample. This method can be applied ultimately to the identification of the source of any sample that contains plant genomic DNA.

FOOTNOTES

1 The authors thank the Center for Disease Control for financial support of this research (Grant no. H75/CCH923339-01) and Payam Morgan for his technical support. Back

4 Author for correspondence (rogerab{at}ag.arizona.edu ) Back

LITERATURE CITED

Chan K.. 2003. Some aspects of toxic contaminants in herbal medicines. Chemosphere 52: 1361-1371.[Medline]

Dasgupta A.. 2003. Review of abnormal laboratory test results and toxic effects due to use of herbal medicines. American Journal of Clinical Pathology 120: 127-137.[CrossRef][Web of Science][Medline]

Gang D. Beuerle T. Ullmann P. Werck-Reichhart D. Pichersky E.. 2002. Differential production of meta hydroxylated phenylpropanoids in sweet basil peltate glandular trichomes and leaves is controlled by the activities of specific acyltransferases and hydroxylases. Plant Physiology 130: 1536-1544.[Abstract/Free Full Text]

Ize-Ludlow D. Ragone S. Bruck I. Bernstein J. Duchowny M. Garcia-Peña B.. 2004. Neurotoxicity in infants seen with consumption of star anise tea. Pediatrics 114: 653-656.[CrossRef]

Izzo A. Borrelli F. Capasso R.. 2002. Herbal medicine: the dangers of drug interaction. Trends in Pharmacological Sciences 23: 358-359.[CrossRef][Medline]

Karliova M. Treichel U. Maolago M. Frilling A. Gerken G. Broelsch C.. 2000. Interaction of Hypericum perforatum (St. John's wort) with cyclosporin A metabolism in a patient after liver transplantation. Journal of Hepatology 33: 853-855.[CrossRef][Web of Science][Medline]

Morant M. Bak S. Moller B. Werck-Reichhart D.. 2003. Plant cytochromes P450: tools for pharmacology, plant protection and phytoremediation. Current Opinion in Biotechnology 14: 151-162.[CrossRef][Web of Science][Medline]

Morant M. Hehn A. Werck-Reichhart D.. 2002. Conservation and diversity of gene families explored using the CODEHOP strategy in higher plants. Plant Biology 2: 7-12.[CrossRef][Medline]

Naik R. Freudenberger S.. 2004. Ephedra-associated cardiomyopathy. Annals of Pharmacotherapy 38: 400-403.[Abstract/Free Full Text]

Nelson D. Schuler M. Paquette S. Werck-Reichhart D. Bak S.. 2004. Comparative genomics of rice and Arabidopsis: Analysis of 727 cytochrome P450 genes and pseudogenes from a monocot and a dicot. Plant Physiology 135: 756-772.[Abstract/Free Full Text]

Rose T. Schultz E. Henikoff J. Pietrokovski S. McCallum C. Henikoff S.. 1998. Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences. Nucleic Acids Research 26: 1628-1635.[Abstract/Free Full Text]

Rose T. Henikoff J. Henikoff S.. 2003. CODEHOP (Consensus-degenerate hybrid oligonucleotide primer) PCR primer design. Nucleic Acids Research 31: 3763-3766.[Abstract/Free Full Text]

Schouten J. McElgunn C. Waaijer R. Zwijnenburg D. Diepvens F. Pals G.. 2002. Relative quantification of 40 nucleic acid sequences by multiplex ligand-dependent probe amplification. Nucleic Acids Research 30: e57.[Abstract/Free Full Text]

Soni M. Ioana G. Carabin I. Griffiths J. Burdock G.. 2004. Safety of ephedra: lessons learned. Toxicology Letters 150: 97-110.[CrossRef][Web of Science][Medline]

Stedman C.. 2002. Herbal hepatotoxicity. Seminars in Liver Disease 22: 195-206.[CrossRef][Web of Science][Medline]

Thabrew I. Munasinghe J. Chackrewarthi S. Senarath S.. 2004. The effects of Cassia auriculata and Cardiospermum halicacabum teas on the steady state blood level and toxicity of carbamazepine. Journal of Ethnopharmacology 90: 145-150.[CrossRef][Web of Science][Medline]

Umemoto N. Kobayashi O. Ishizaki-Nishizawa O. Toguri T.. 1993. cDNAs sequences encoding cytochrome P450 (CYP71 family) from eggplant seedlings. Federation of European Biochemical Societies Letters 330: 169-173.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Facebook Facebook   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Agricola
Right arrow Articles by Barthelson, R. A.
Right arrow Articles by Woosley, R. L.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Facebook   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS