|
|
||||||||
Population Biology |
Department of Forestry, Michigan State University, East Lansing, Michigan 48824 USA
Received for publication May 23, 2003. Accepted for publication November 20, 2003.
| ABSTRACT |
|---|
|
|
|---|
Key Words: dispersal eastern white pine genetic structure microsatellite Pinus strobus population genetics spatial autocorrelation
| INTRODUCTION |
|---|
|
|
|---|
Eastern white pine (Pinus strobus L.) is an outcrossing (Beaulieu and Simon, 1995
), wind-pollinated forest tree, having wind-dispersed seed. In an earlier study using several isozyme loci (Epperson and Chung, 2001
), we obtained estimates of the amount of spatial autocorrelation in two study populations located in northern Michigan. In one natural, old-growth population, weak but consistent structure was indicated in both adults and seedlings. In a second population, which was logged around the turn of the 20th century, the genotypes of adults were nearly randomly distributed (possibly as a direct result of logging), but the seedlings appeared to have spatial autocorrelation similar to that in the natural condition. This situation provided an opportunity to study the effects of a single generation of limited seed dispersal. However, it was not possible to make statistical contrasts in the results, because of an insufficient number of alleles and a lack of available proper statistical tests in the literature.
In this article, we examine the spatial structures in seedlings in detail, using more than 20 alleles. We examine a number of nuclear microsatellite loci, which typically have much larger average numbers of alleles than do isozyme loci (e.g., Rajora et al., 2000
). The increase in information contained in spatial distributions with increasing numbers of alleles has been characterized recently (Epperson, 2003
). As long as there are more than three or four alleles at a locus, the spatial distribution of each allele represents a nearly independent realization of the processes of limited dispersal. Whereas the isozyme data had 58 separate spatially informative allele patterns, the microsatellite loci had 2124, thus representing a three- to fivefold increase in information. In addition, new results on the correlations of spatial coefficients for different alleles (Epperson, 2003
) allow proper statistical tests and standard errors to be constructed for multiallelemultilocus averages. It should be possible to obtain very precise estimates of spatial structure, which may allow statistical contrasts among the samples, even though the amounts of spatial autocorrelation are small.
Moreover, microsatellites and isozymes have rarely been studied in the same populations and almost never in studies of spatial structure within populations (an exception is Streiff et al., 1998
). Microsatellite loci are appealing markers in part because typically they are hypervariable owing to unusually high rates of mutation. However, they also typically have somewhat different arrays of allele frequencies, often having relatively large numbers of low-frequency alleles. In samples with moderate numbers of sampled individuals, alleles that have very low frequencies (e.g., when an allele occurs in only one sampled individual) are not useful for studies of spatial structure, although they can be useful for other types of studies, such as those directed at parentage assignment. Moreover, some microsatellites mutate at such high rates that we may begin to question if mutation rates can have direct influences on spatial structure (Epperson, 1990
), in addition to their indirect effects operating through numbers and frequencies of alleles. We examine some possible effects of high mutation rate in our data.
| MATERIALS AND METHODS |
|---|
|
|
|---|
120 trees/ha. One-year-old seedlings were sampled by collecting the one closest to each of the adult trees, which were mapped in the previous study. Originally, in 1998, 90 seedlings were sampled. However, because the seedlings were small and sampled nondestructively, sufficient tissue was available only for 46 of the seedlings. Consequently, in 1999 we sampled 43 new 1- yr-old seedlings to replace those that were lost, by sampling again from near the corresponding adult trees. Casual inspection of several hundred seedlings in the entire sample plot area revealed that nearly all were 1 yr old.
A similar size area was sampled in 1998 from a second population (second site, SS), which was logged approximately 100 yr ago, but undisturbed since. The SS population has regenerated and is well represented in all age classes. Seedlings of all sizes were present, in contrast to the OG site. Each of the sampled seedlings (121) was chosen as the one that was less than 30 cm in height and closest to each adult tree, as reported in Epperson and Chung (2001)
. Adult trees were considered as those having diameters >20 cm. The density of seedlings was obviously much greater at SS than at OG, although we did not measure the densities. Both populations are not pure stands, and other species include red pine, maple, beech, hemlock at OG and these plus black spruce and balsam fir at SS.
DNA isolation
DNA was isolated using the QIAGEN DNeasy Plant Kit (QIAGEN, Santa Clarita, California, USA). Needle tissue (
70 mg) was homogenized in a FastPrep FP120 instrument (BIO 101, Savant Instruments, Holbrook, New York, USA). A ceramic ball was placed at the bottom of a 2-mL screw cap centrifuge tube, the tissue was added, a ceramic cylinder was placed on top of the tissue, and then 600 µL of lysis buffer was dispensed. Samples were ground for 45 s at the 4.5 setting. Remaining steps were performed according to the QIAGEN DNeasy protocol. DNA concentrations were measured by fluorometry, and the average yield was 8 µg/extraction.
Nuclear microsatellite markers
Nuclear microsatellite markers were developed for eastern white pine (Pinus strobus) by Echt et al. (1996)
and are commercially available as primer pairs from Research Genetics (Huntsville, Alabama, USA). In this study, the following primer pairs were used: Rps1b, Rps2, Rps34, Rps39, Rps50, and Rps127. Forward primers were labeled with one of three different fluorescent phosphoramidite dyes, either 6-FAM, HEX, or TET (Applied Biosystems, Foster City, California, USA). Rps2 was scored only for the OG sample.
The PCR amplification protocol has been described elsewhere (Walter and Epperson, 2001
). Multiplexing was carried out by co-loading multiple PCR reactions with different fluorescent dyes in the same lane of the gel. This allowed simultaneous analysis and detection of polymorphic loci, even when they overlap in size. Electrophoresis was performed on a 0.40-mm 6% denaturating polyacrylamide gel (7 mol/L urea) at 2500 V/30 W for 5.56 h. The GeneScan sequencer (ABI 373; Applied Biosystems), with a 350-bp internal standard was used to determine the fragment sizes. In addition, we used two lanes with a known external standard on each gel to control for any potential gel-to-gel variations (which in fact did not occur). This procedure resolved fragment size to a single base pair. Repeatability of one-base-pair resolution was further verified by repeating the PCR amplifications and GeneScan gel runs for heterozygous individuals. Alleles are denoted by their nominal sizes in numbers of nucleotides; these should not be considered necessarily as absolute rather as relative sizes.
Statistical analysis
The distribution of allele frequencies among populations was assessed using the
estimator (Cockerham and Weir, 1993
) of FST. Observed values of
were compared to the null hypothesis of the absence of differentiation by bootstrapping over loci. Measures of Wright's (1922)
inbreeding coefficients FIS were also calculated. All of these statistics and their significance levels were analyzed using the GDA program of Lewis and Zaykin (2001)
.
Spatial autocorrelation analysis was calculated separately for each allele of each locus (Sokal and Oden, 1978
). Alleles that occurred in only a single tree in a sample were considered to be spatially noninformative and omitted from further analysis. For diallelic loci, calculations were done for only one allele because the information provided by the second allele would be completely complementary. For the remaining alleles, spatial autocorrelation analysis proceeded by treating nominal (join-count) correlations of genotypes converted into gene frequencies. Allele frequency values were assigned to genotypes as 1.0, 0.5, or 0.0 for homozygotes, heterozygotes, or no copies of that allele, respectively. For each allele, Moran's I statistics (Sokal and Oden, 1978
) was calculated. The individuals were paired as joins (using Euclidean distances) representing one of 10 distance classes. Joins were formulated in a way that formed a continuous network and that ensured adequate numbers of nearest neighbors. Special attention was paid to the first distance class, by choosing its upper limit as
1.5 times the square root of the inverse of sample density (Epperson, 2003
). By doing this, it could be guaranteed that the majority of near-neighbor pairs are included in the first distance class. Spatial autocorrelation coefficients (Moran's I) were calculated for each of the 10 distance classes, separately for both populations. For each coefficient, a two-tailed test of significance was applied, again separately for OG and SS. In addition, a test for significance for an entire set of 10 distance classes for an allele for a population was assessed using a Bonferroni-type approximation for multiple tests. These statistics for spatial structure analysis were all computed using Wartenberg's SAAP program (version 4.3; Wartenberg, 1989
). For OG, separate analyses were done for the two years of sampling. In both years, all collected seedlings were 1 yr old. Because seed-cone production at OG is sparse and highly erratic, seedlings collected in 1999 likely came from quite different maternal sources than those collected in 1998.
To obtain summary statistics for the overall amount of spatial structure, we calculated the average,
, across alleles and loci for each sample. The average was not weighted, because theoretical results show that alleles should generally have very similar stochastic and statistical variances (Epperson et al., 1999
), which was supported by inspection of the estimated variances under the null hypothesis of a random spatial distribution. Significance tests for the average values were developed by estimating the standard error under the null hypothesis, by the formula:
![]()
| RESULTS |
|---|
|
|
|---|
between the two populations for five of the loci. In the OG population, genotypes for two loci, Rps34 and Rps50, occurred in ratios that were statistically significantly different from Hardy-Weinberg expectations, in contrast to the SS population in which only locus Rps34 was different. In both populations, homozygotes for Rps34 were in excess. This is likely due to the presence of null alleles, which have been found in other unpublished assays of Rps34 in Pinus strobus. The presence of null alleles presumably would have marginal effects on the spatial analyses, because its largest effect is to overestimate the presence of homozygotes for rare alleles. Rare heterozygotes have values of allele frequency (0.5) that are already far above the mean. Regarding measures of differentiation between the two populations, none of the per-locus
values nor the overall value of
were statistically significant.
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Adults of the SS population have essentially random spatial genetic distributions (Epperson and Chung, 2001
), and the lack of spatial structure may have been caused by logging at the site prior to the establishment of what is now the adult cohort. The results show that a single generation of limited seed movement has reestablished the natural level of spatial genetic structure among seedlings, because very similar amounts of structure were found for the seedlings at the OG population. At OG, for the year-old seedlings collected in 1999 the average value for distance class 1 was 0.035, larger by 0.060 than the expected value under the null hypothesis of a random distribution. Not as much structure was indicated in the seedlings collected in 1998, but for the average of the two years, a clear pattern of isolation by distance was found. For the first distance class, the average autocorrelation across the two years is 0.012, which is statistically greater than the average expected value, 0.023 (Z score = 2.03), and the excess, 0.035, is similar to that observed at the SS population. There was also some evidence of positive autocorrelation at the second distance class, 1525 m, for some alleles. This is consistent with the distance that seeds may disperse in Pinus species (e.g., Govindaraju, 1988
). It is reasonable to expect to find (mostly half-sib) seeds from the same tree concentrated within a "seed shadow" having diameter of ca. 3050 m. However, the degree of concentration also depends on the degree of overlap of seed shadows of different trees.
An interesting cautionary point is that if the OG seedling data for the two years had been blindly combined and then analyzed, much of the structure would have been masked. This can be related to the low and sporadic production of cones. At the OG site in 1998, we found mature cones on only 13 trees. In 1999, again few trees had cones, and most of these cone-bearing trees did not have mature cones in 1998. The cones that matured in 1998 probably contributed substantially to the (year-old) seedlings collected in 1999, because the only other potential sources could be seeds in any seed bank or seeds from outside the immediate area (although casual inspection of trees in same also indicated few had cones). It is reasonable to expect that similarly low cone production occurred in 1997, probably the year that most 1998 year-old seedlings came from. It was not feasible to analyze seed banks, but the difference between 1998 and 1999 itself suggests that a seed bank contributed little to the seedlings, because it would tend to make the yearly cohorts similar.
It should also be noted that although the old, mature trees in the OG population have been "undisturbed" and the ecology that created them was natural, fire suppression in the twentieth century has made the present situation at ground level far from natural. There is a very dense understory canopy of mature individuals of beech, maple, and hemlock, all of which are highly shade tolerant. Year-old seedlings of relatively shade-intolerant white pine are suffering near-complete mortality. Similarly, no seedlings, not even 1-yr-olds, of red pine (Pinus resinosa) were found despite very large seed production. Similar mortality of P. strobus and P. resinosa seedlings was observed at the Hartwick OG forest some 20 years ago (Rose, 1984
). Low light availability and deep leaf litter at the forest floor are likely responsible for lack of regeneration, and both are largely attributable to the understory of deciduous trees, most of which would not be present if fires had gone through the site. Moreover, the adult trees at OG should be considered an overmature stand, and most of the trees had crowns only near their tops (e.g., for many trees the distance from the ground to their lowest live branch was >30 m). Leaf biomass and net primary productivity are low (Rose, 1984
).
Year of establishment varied naturally among the seedlings sampled at the SS population. Only seedlings less than 30 cm in height were collected, but within this class all sizes were well represented. Hence, in this population we were also measuring over multiple years. Because we did not age the seedlings, we cannot separate the sample by yearly cohorts. However, in our observations cone productivity was much greater than at OG, and the presence of autocorrelation at apparently natural levels in the total sample itself suggests that the seed sources were similar over generative years.
The amount of structure greatly differed among loci, and it can be demonstrated that the differences are greater than expected by chance. In particular, almost all alleles of Rps50 have very little or no spatial autocorrelation. Rps50 has by far the greatest numbers of alleles. The allele frequencies of Rps50 may partially explain the cause. Although allele frequency generally has little or no effect on spatial autocorrelation measures, when an allele's frequency is less than about 0.02 to 0.05, some reduction in short distance correlation results, on the order of ca.15% for populations with dispersal levels like those in Pinus strobus (Epperson et al., 1999
; Epperson, 2003
). Several alleles with such frequencies were retained in the spatial analyses especially for Rps50 (Tables 2 4). However, this would explain only part of the difference and only for the rarest alleles.
The fact that Rps50 has higher number of alleles likely means that it has a higher rate of mutation than the other microsatellites. Recent work has shown that differences over a certain range of mutation rates can cause differences in the amount of spatial structure (B. K. Epperson, unpublished manuscript). For species with dispersal levels similar to those for Pinus species, more precisely corresponding to a neighborhood size of 115, a mutation rate of 102 resulted in an average value of 0.0211 for distance class 1 (nearest neighbors), which is 40% smaller than the value of 0.035 produced in the same model without mutation. Large reductions were also generally observed for other distance classes and dispersal levels. In contrast, when mutation rates are less than or equal to 103, spatial structure is essentially unchanged from that produced by models with no mutation. Thus the range of 102103 is critical for direct effects of mutation on spatial structure in general. Microsatellite markers often have high mutation rates, on the order of 103 (Jarne and Lagoda, 1996
) and up to 102 (Bruford et al., 1992
), in comparison to allozymes which have rates on the order of 106 (Voelker et al., 1980
).
Although the actual rates of mutation of the SSRs used in this study are not known, it appears that the range of relative mutation rates is consistent with a differential mutation effect on spatial structure, using the well-known relationship of effective number of alleles ne to the effective population size (not to be confused with the neighborhood size) N and µ, Nµ = (ne 1)/4, under the genetic drift-infinite alleles mutation model (Ewens, 1979
). A total of 12 alleles of Rps50 were present in the samples, and the average effective number of alleles was 5.7, compared to an average of 3.8 alleles and average effective number 1.7, for the remaining loci. This leads to estimates of Nµ of approximately seven (6.7) times larger for Rps50 (1.175) than for the remaining loci (0.175). Because the effective population size N should be essentially constant over loci, the seven-fold increase should hold true for the mutation rate for Rps50.
The average Moran's I statistic for distance class 1 is 0.0118 for Rps50 in the SS seedlings, indistinguishable from the expected value, 0.0098, under the null hypothesis of a random distribution. The average among alleles at all other loci is 0.0477, which is statistically greater than both the average for Rps50 and the expected value under the null hypothesis. Moreover, this average is quite close to that observed for allozymes, 0.04 (Epperson and Chung, 2001
). In comparison to theoretical models, the excess [0.0477 (0.0098) = 0.0575] corresponds to a neighborhood size of ca. 100. This is probably a more accurate estimate than the value of 230 obtained when Rps50 is included. For the OG seedlings sampled in 1999, the average value of Moran's I statistic for distance class 1 is 0.0187 for Rps50, again indistinguishable from the null hypothesis, whereas for all other alleles combined the average has a quite large value of 0.0681. As may be expected, for the seedlings collected in 1998 both the average for Rps50 (0.0078) and that for all other alleles (0.0134) were small. Nonetheless, the average for all loci other than Rps50 averaged over the two sample years is 0.0274, which represents an excess of 0.0508 over the average expected value. This value is very similar to the excess, 0.05, observed for isozymes (Epperson and Chung, 2001
).
Interestingly, another study of small-scale autocorrelation, that assayed both isozymes and microsatellites, in Quercus robur and Q. petraea, found more structure for the latter. However, the authors attributed this to statistical noise, especially because of the much smaller numbers of alleles assayed for isozymes relative to microsatellites (Streiff et al., 1998
).
The spatial autocorrelations in Pinus strobus can be compared to that found in other tree species. Average values for short distance classes for juvenile populations of 0.052 and 0.06 for Gleditsia triacanthos (Schnabel and Hamrick, 1990
) and 0.098, 0.063, and 0.032 (small scale intervals: 01 m, 0 2 m, and 05 m, respectively) for Quercus laevis (Berg and Hamrick, 1995
) were obtained.
In our study, two loci, Rps34 and Rps50, in the OG stand and also Rps34 in the SS population, deviated significantly from Hardy-Weinberg expectations, but as noted, the deviations for Rps34 are probably due to null alleles. All of the isozymes were in Hardy-Weinberg proportions (Epperson and Chung, 2001
). Beaulieu and Simon (1995)
found in two filial eastern white pine populations that one of four polymorphic loci deviated. The mean values of the estimates of Wright's fixation indices (FIS) for eastern and western white pine progeny and adult populations are listed in Table 5. The positive values reveal a slight deficit of heterozygotes, indicating some form of inbreeding, and negative values represent a slight excess of heterozygotes. However, this could be due to null alleles in considerable frequency.
|
| FOOTNOTES |
|---|
| LITERATURE CITED |
|---|
|
|
|---|
Berg E. E. J. L. Hamrick 1995 Fine-scale genetic structure of a turkey oak forest. Evolution 49: 110-120[CrossRef][ISI]
Bruford M. W. O. Hanotte J. F. Y. Brookfield T. Burke 1992 Multi- and single-locus fingerprinting. In A. R. Hoelzel [ed.], Molecular analysis of populations: a practical approach, 225269. IRL Press, Oxford, UK
Cliff A. D. J. K. Ord 1981 Spatial processes: methods and applications. Pion, London, UK
Cockerham C. C. B. S. Weir 1993 Estimation of gene flow from F-statistics. Evolution 47: 855-863
Crawford T. J. 1984 The estimation of neighborhood parameters for plant populations. Heredity 52: 273-283[ISI]
Echt C. S. P. May-Marquardt M. Hseih R. Zahorchak 1996 Characterization of microsatellite markers in eastern white pine. Genome 39: 1102-1108[Medline]
El-Kassaby Y. A. M. D. Meagher J. Parkinson F. T. Portlock 1987 Allozyme inheritance, heterozygosity and outcrossing rate among Pinus monticola near Ladysmith, British Columbia. Heredity 58: 173-181[ISI]
Epperson B. K. 1990 Spatial autocorrelation of genotypes under directional selection. Genetics 124: 757-771[Abstract]
Epperson B. K. 2003 Geographical genetics. Princeton University Press, Princeton, New Jersey, USA
Epperson B. K. M. G. Chung 2001 Spatial genetic structure of allozyme polymorphisms within populations of Pinus strobus (Pinaceae). American Journal of Botany 88: 1006-1010
Epperson B. K. Z. Huang T.-Q. Li 1999 Measures of spatial structure in samples of genotypes for multiallelic loci. Genetical Research, Cambridge University Press 73: 251-261
Ewens W. J. 1979 Mathematical population genetics. Springer, New York, New York, USA
Govindaraju D. R. 1988 Life histories, neighbourhood sizes, and variance structure in some North American conifers. Biological Journal of the Linnean Society 35: 69-78[CrossRef]
Jarne P. P. J. L. Lagoda 1996 Microsatellites, from molecules to populations and back. Trends in Ecology and Evolution 11: 424-429[CrossRef]
Lewis P. O. D. Zaykin 2001 Genetic data analysis: computer program for the analysis of allelic data, version 1.0 (d16c). Computer program distributed by the authors, website: http://lewis.eeb.uconn.edu/lewishome/software.html
Neale D. B. W. T. Adams 1985 The mating system in natural and shelterwood stands of Douglas-fir. Theoretical and Applied Genetics 71: 201-207[ISI]
Rajora O. P. M. H. Rahman G. P. Buchert B. P. Dancik 2000 Microsatellite DNA analysis of genetic effects of harvesting in old- growth eastern white pine (Pinus strobus) in Ontario, Canada. Molecular Ecology 9: 339-348[CrossRef][Medline]
Rose W. M. 1984 Biomass, net primary production and successional dynamics of a virgin white pine (Pinus strobus) stand in northern Michigan. Ph.D. dissertation, Michigan State University, East Lansing, Michigan, USA
Schnabel A. J. L. Hamrick 1990 Organization of genetic diversity within and among populations of Gleditsia triacanthos (Leguminosae). American Journal of Botany 77: 1060-1069[CrossRef][ISI]
Sokal R. R. N. L. Oden 1978 Spatial autocorrelation in biology. 1. Methodology. Biological Journal of the Linnean Society 10: 199-228[CrossRef]
Streiff R. T. Labbe R. Bacilieri H. Steinkellner J. Glössl A. Kremer 1998 Within-population genetic structure in Quercus robur L. and Quercus petraea (Matt.) Liebl. assessed with isozymes and microsatellites. Molecular Ecology 7: 317-328[CrossRef]
Voelker R. A. H. E. Schaffer T. Mukai 1980 Spontaneous allozyme mutations in Drosophila melanogaster: rate of occurrence and nature of mutants. Genetics 94: 961-968
Walter R. B. K. Epperson 2001 Geographic pattern of genetic variation in Pinus resinosa: area of greatest diversity is not the origin of postglacial populations. Molecular Ecology 10: 103-111[CrossRef][Medline]
Wartenberg D. E. 1989 SAAP. A spatial autocorrelation analysis program, version 4.3. Exeter Software, Setauket, New York, USA
Wright S. 1922 Coefficients of inbreeding and relationship. American Naturalist 56: 330-338[CrossRef][ISI]
Yazdani R. D. Lingren D. Rudin 1985 Gene dispersion and selfing frequency in a seed tree stand of Pinus sylvestris (L). In H. R. Gregorius [ed.], Population genetics in forestry, 11391154. Lecture Notes in Biomathematics, vol. 60. Springer-Verlag, Berlin, Germany
This article has been cited by other articles:
![]() |
C. Pardo, P. Cubas, and H. Tahiri Genetic variation and phylogeography of Stauracanthus (Fabaceae, Genisteae) from the Iberian Peninsula and northern Morocco assessed by chloroplast microsatellite (cpSSR) markers Am. J. Botany, January 1, 2008; 95(1): 98 - 109. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Cubas, C. Pardo, and H. Tahiri Genetic variation and relationships among Ulex (Fabaceae) species in southern Spain and northern Morocco assessed by chloroplast microsatellite (cpSSR) markers Am. J. Botany, December 1, 2005; 92(12): 2031 - 2043. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |