Main

The geographic distribution of Y chromosome SNPs was compared with previously published studies of autosomal and mitochondrial DNA variation6,7,8 (Table 1). Theory based on the island model of migration indicates that FST (approximated as the variation among populations) is equal to 1/(1+4Nν) for diploid systems, and to 1/(1+Nν) for haploid systems (that is, mitochondrial and Y chromosome data), where N is the effective population size and ν is the sum of migration (m) and mutation (μ; more exactly, ν=m+μ-mμ ref. 9). The mutation rate of 3.5×10–8 per site, per year for mtDNA is approximately 10–20 times higher than the autosomal or Y-chromosomal rates of 1.9×10–9–5.4×10–9 (Refs 10,11). The mutation rates for all three systems, however, are substantially lower than any estimates of the migration rate12, and are unlikely to contribute to major differences among these three genetic systems. We assume that ν is equal to m.

Table 1 Single nucleotide polymorphism diversity

Estimates of Nν from FST between world populations are shown (Table 2). If N is the same for males and females, the female migration rate would exceed the male migration rate by a factor of 7.96. As expected, the autosomes displayed a value of Nν, which is intermediate between the values calculated from mtDNA and Y chromosomes (and close to their geometric mean).

Table 2 FST, Nν and their ratios for SNP data

We compared data for autosomal and Y-chromosomal microsatellites (short tandem repeat polymorphisms) in 14 African populations (Table 3). The fraction of the total variance found within populations was higher for the autosomes than for the Y chromosome. An analogue of FST for microsatellites (RST; Refs 13,14) indicated an Nν for the autosomes of 9.37, versus 4.26 for the Y. Their ratio of 2.20 was nearly the same as the ratio reported for SNPs (Table 2).

Table 3 Microsatellite diversity

Another approach, using the regression of genetic distance onto geographic distance (Fig. 1), leads to the same conclusion. The slope for the Y chromosome data was higher than for the mtDNA data by a factor of 8.22 (P<0.05), indicating a more rapid increase of genetic distance for the Y chromosome. This, too, was in close agreement with the FST analysis presented (Table 2).

Figure 1: Regressions of genetic distance (FST) onto geographic distance (km) in Europe.
figure 1

Three independent data sets were used (Y chromosome28, mtDNA (ref. 29) and classical markers30). The slope for the Y chromosome (1.108×10–4) is significantly greater than for the autosomes (1.014×10–5; P=0.0095) and mtDNA (1.348×10–5; P=0.0269).

Although a difference in the female versus male migration rate appears to be the most likely explanation for these results, several other factors may contribute. Natural selection could produce this pattern of geographically localized Y chromosome variation by eliminating variation from within populations. Evidence has been presented against recent Y chromosome selective sweeps in humans11, and our own data fail to indicate the significant involvement of natural selection.

Polygyny or higher rates of male mortality might also reduce the level of Y chromosome variation within populations relative to the autosomes and mtDNA. In strictly monogamous populations, the effective population size of autosomal genes is four times greater than for the haploid Y chromosome and mtDNA, because the autosomes are diploid and transmitted by both sexes. Polygyny decreases the effective population size of the Y chromosome (NY), although the autosomal N is also reduced by polygyny, to 4MF/(M+F), where M and F are the number of males and females, respectively. It can be shown that the ratio of the autosomal to Y-chromosomal N is approximately equal to where R is the ratio of married men to married women in a population.

For an extreme value of R=0.54, where 53% of the men of a population are married and have 2.6 wives on average15, NY differs from Nau only by a factor of 5.7, instead of the expected factor of 4. We have calculated a more realistic global value of Nau to NY of 4.27 from data on a sample of 66 populations16. This is a relatively minor, and probably undetectable, deviation from the expected ratio of 4. Although some two-thirds of traditional societies may allow or encourage polygyny, there are several reasons to expect our estimates to exaggerate its effects. First, very few males (typically fewer than 20%) control sufficient resources to support more than one wife. Second, approximately one-third of human societies practice sororal polygyny, in which additional wives must be sisters of the first wife. Sisters share identical mitochondrial genomes, and one-half of their autosomal genes (on average). In such cases, the Y chromosome should experience no reduction in effective population size relative to mtDNA, and its reduction relative to the autosomes should be much lower than if wives were unrelated. Third, we assume all women are equally fertile, when, in fact, women in polygynous marriages tend to be less fertile than their monogamous counterparts17,18,19.

The conclusion that males migrate less than females is not immediately obvious, as men do travel more than women in many societies. Among the Biaka Pygmies, the exploration range of men is 1.8 times greater than that of women20. The most important parameter from a genetic perspective is the distance between the birthplace of parents and children, and this is largely determined when one spouse leaves the natal household to join the other. Marriages in agricultural economies (and probably earlier foraging economies as well21), where the land is inherited by sons, are typically patrilocal. Thus women tend to be more mobile than men with respect to transgenerational movements. Precise estimates of the magnitude of the difference in migration rates produced by patrilocality are difficult to obtain, but approximately 70% of societies are patrilocal22,23.

There seem to be no comparable data among primates, although some may be forthcoming. The closest data available today are perhaps in macaques, where a contrasting pattern has been observed. Female macaques generally remain in their natal groups, whereas most (if not all) males disperse to join others. Although Y-chromosomal data from macaques are not available, a much smaller fraction of mitochondrial variation (approximately 9% of the total) is found within species when compared with the distribution of autosomal variation (where approximately 91% of the total is distributed within species24). By some measures, macaques are polygynous, with most offspring in a social group fathered by a single male in any particular year25. However, females rarely bear offspring with the same male on more than one occasion25, so the relative effective population sizes of males and females are not precisely known. If the effective population size of males is in fact smaller than for females, as some expect25, this provides further evidence against an extensive contribution of polygyny to our observations from humans.

Our data on humans are consistent in showing that a significantly greater fraction of the total variance is distributed between populations for the Y chromosome relative to both the autosomes and mitochondrial DNA. Our analysis indicates that a lower transgenerational migration rate of males compared with that of females must be the most significant factor in producing this observation. Polygyny makes a modest contribution, but it appears to account for less than 10% of the discrepancy.

Methods

Population samples.

DNA was analysed from 14 African populations. Informed consent was obtained from all DNA donors. The number of complete Y chromosome haplotypes is indicated in parentheses for each population. Sudan: samples from the Hadendowah and Beni Amer sections of the Beja (6) were collected in and around Kassala. Eleven populations from Ethiopia were sampled: Konso (8) from Konso town; Tsamako (5) and Ongota (9) near Weyto; Hamar (5) near Turmi; Dasenech (5) in Omorate; Dizi (4) and Surma (11) near Maji; Bume (or Nyangatom; 11) at Kibish; Bench (8) in Mizan Teferi; and Majangir (10) near Tepi. In Mali, collections from three populations were made: Dogon (7) and Peulh near Bandiagara (14.361 oN, 3.647 oW); Tuareg (9) near Timbuktu (16.661 oN, 3.240 oW); and Songhai (5) near Gao (16.287 oN, 0.044 oW). Three groups were defined in calculating the apportionment of diversity presented (Table 3): group1, Dizi, Dasenech, Hamar, Bench, Ongota, Beja, Tsamako and Konso; group 2, Nyangatom, Majangir and Surma; group 3, Dogon, Tuareg and Songhai.

DNA extraction.

DNA was extracted from whole blood (5 ml) drawn in EDTA anticoagulant. Extraction was begun in the field as quickly as possible (but always less than 2 d after collection) using a modified 'salting out' procedure26.

Y microsatellites.

Eight sets of primers were used to amplify 10 microsatellite loci on the Y chromosome: DYS19, DYS385I/II, DYS388, DYS389I/II, DYS390, DYS391, DYS392 and DYS393. Two of the primer sets typically produced two bands (DYS385 and DYS389). In many of the analyses, the two bands of DYS385 and DYS389 were treated as if from separate loci.

Autosomal microsatellites.

Primers for the following 29 loci were used (ABI Prism linkage mapping set). Primers for all members of a panel were multiplexed in the PCR and run on a single gel. Panel 11, D7S513, D7S550, D7S640, D7S657, D8S260 and D8S514; panel 12, D7S519, D7S531, D8S270, D8S277, D8S284 and D8S550; panel 21, D15S117, D15S130, D15S131, D15S153, D15S165, D16S411, D16S503, D16S515 and D16S520; panel 25, D19S210, D19S220, D19S420, D20S100, D20S115, D20S117, D20S171 and D20S196.

PCR amplification and allele size determination.

Microsatellite loci were amplified by PCR in reactions (5 μl) according to the manufacturer's protocols. Primers were multiplexed as noted above. Reactions included the Taqstart (Clontech) anti-Taq antibody.

We used a 'touchdown' cycling regime, beginning with 14 cycles of successive 0.5 °C decreases in annealing temperature (63–56.5 °C.) Twenty cycles with a constant 56 °C annealing temperature followed. Each step lasted 30 s, and a denaturing temperature of 94 °C and extension temperature of 72 °C were used throughout.

We ran PCR products on an automated DNA sequencer (ABI373a) with GS-350 or GS-500 dye-labelled size standard. Allele sizes were determined with GS analysis software (ABI).

The apportionment of diversity was calculated using the analysis of molecular variance (AMOVA), implemented in the Arlequin program8,27. We calculated all genetic distances, FST and RST using Microsat (E. Minch, available on request).