Abstract
Phylogeography of the mitochondrial lineages commonly found in Western Europe can be interpreted in the light of a postglacial resettlement of the continent. The center of this proposal lies in the Franco-Cantabrian glacial refuge, located in the northern Iberian Peninsula and Southwestern France. Recently, this interpretation has been confronted by the unexpected patterns of diversity found in some European haplogroups. To shed new lights on this issue, research on Iberian populations is crucial if events behind the actual genetics of the European continent are to be untangled. In this regard, the region of Asturias has not been extensively studied, despite its convoluted history with prolonged periods of isolation. As mitochondrial DNA is a kind of data that has been commonly used in human population genetics, we conducted a thorough regional study in which we collected buccal swabs from 429 individuals with confirmed Asturian ancestry. The joint analysis of these sequences with a large continent-wide database and previously published diversity patterns allowed us to discuss a new explanation for the population dynamics inside the Franco-Cantabrian area, based on range expansion theory. This approximation to previously contradictory findings has made them compatible with most proposals about the postglacial resettlement of Western Europe.
Similar content being viewed by others
Introduction
The genetic diversity of European populations has been an intensely researched topic because of its implications in the reconstruction of the history of the human species.1 In the last decade, many efforts have been dedicated to the search for patterns that could be related to known prehistoric events; for example, migrations during the Last Glacial Maximum or the Neolithic demic diffusion.2 Uniparental markers, such as mitochondrial DNA (mtDNA) and the Y-chromosome, have proven increasingly useful for delving into these patterns, allowing for a mostly accepted chronology of key events for European demography to be established throughout the years.3 Nevertheless, discussion of some of the inferences related to this chronology has caused controversies that remain open, including the presumed repopulation of most of Western Europe during the Magdalenian (ca.17–12 ka).4
The most common interpretation of the phylogeography of European mtDNA haplogroups currently supports this theory, as there is a strong geographical southwestern–northeastern cline in the frequencies of haplogroups with postglacial coalescence times, such as H1, H3 and V.5 Frequency peaks for those lineages can all be found in the northern Iberian Peninsula and the southwestern part of France, in a territory which formed a glacial refuge called the ‘Franco-Cantabria’.6 Molecular dissection studies and intensive samplings have shown these clines to be consistent and compatible with a postglacial (formerly late-glacial) resettlement of most European regions, which would be centered in this refuge.7, 8
Lately, this idea has been criticized with the argument that while the frequencies of European postglacial haplogroups are the highest in Franco-Cantabrian populations, they also show their lowest molecular diversities there.9 Furthermore, marked heterogeneities in haplogroup composition appear even between geographically close populations of this area, which are probably related to genetic drift due the steep relief of northern Iberia. Both evidences point to the possibility that the Franco-Cantabrian genetic makeup may have evolved discretely and differently from the rest of Europe, and may not be adequate for inferring continental patterns. Following this view, the proposed repopulation should be centered in other refuges, such as those located in Southern France,10 Italy,11 the Balkan Peninsula12, 13 or the Near East.14
But these low diversities can be interpreted in another way. It is known that human population size in Europe declined from the beginning of the Upper Paleolithic until the Bølling-Allerød interstadial, because of a dietary breadth contraction caused by prey extinction.15 A likely effect of this would have been an enhancement of genetic drift coupled with recurrent population bottlenecks, which are the major factors in the shaping of genetic diversity.16 During the Last Glacial Maximum, conditions would have been worsened, as paleoecological modeling has shown the maximum range of the glacial refuges to be small and constrained by permafrost areas.17 In this situation, carrying capacity of the Franco-Cantabrian refuge would not have been able to sustain a large, expanding population, and a series of fragmented populations would have been more likely, as has been inferred to have occurred in regions of the refuge in a previous ancient DNA study.18 In such a situation, incoming migration promotes increased competition for resources in the area, which in turn hinders population growth; meanwhile, fragmentation promotes ‘refuge effects’19 in which local populations can be entirely monomorphic, but the majority of lineages remain somewhere in the metapopulation.20 Such heterogeneity can be indirectly seen by the diversification and coexistence of Epigravettian and Solutrean cultures during glacial times.17, 21 After the improvement of climatic conditions, a major increase of carrying capacity would occur outside the refuge, promoting a range expansion of the refugee population and migration towards the newly available area. This would be paired with an increase of the metapopulation size, especially in the margins of the inhabited area, as has been inferred by archeological findings for this period.22 After migration, demographic expansions would follow, and if gene flow with the former refugia is not interrupted, genetic diversity will increase.23 This is compatible with the neutrality statistics and mismatch distributions of most European populations, which bear the signal of a population expansion.8 As for the parent population, still in the refuge, it will only expand if the carrying capacity of the area rises, which can be related with the fact that Franco-Cantabrian populations have been inferred to begin their expansion in the Neolithic (after the introduction of agriculture), whereas populations from Southern-Central Europe show earlier tardiglacial expansion times.24 In these conditions, it becomes possible that a glacial refuge, due to a long-term small carrying capacity and internal extinction–recolonization processes,25 might nowadays show low genetic diversity values in comparison with outside areas repopulated from that refuge.
If our hypothesis is true, three variables should positively correlate with the diversity of expanded lineages: geographical distance to the refuge (as northern territories would have been more densely repopulated),4 effective population size (which should be regarded as an historical mean)26 and the number of private lineages of each population (which would also increase due to mutation and the assimilation of the remnants of indigenous populations that survived in cryptic refugia).12
As for the heterogeneity found even in close extant Franco-Cantabrian populations, it is not surprising in the context of the Iberian Peninsula, which is a crossroad between mainland Europe, Northern Africa, the Atlantic Ocean and the Mediterranean Sea.27 The varied Iberian geography favors the existence of barriers and isolated populations all throughout the territory, which calls for detailed studies to account for localized historical processes.28 These have to be paired with extensive sampling efforts to have a chance of identifying the historic and demographic processes behind the distribution of mtDNA lineages.29 On that basis, one of the most understudied regions of the peninsula is the Principality of Asturias, which is located precisely in the western edge of the Franco-Cantabrian refuge (Figure 1).
So far, mtDNA studies in Asturias have been clinically oriented and based on classical restriction fragment-length polymorphisms analyses30, 31 or included only a few samples as a part of broader studies.32, 33 To our knowledge, the only other sizeable population genetic study is the one from Garcia et al.,9 which used 89 samples collected in Asturias. They noted the aforementioned pattern of local drift as a possible cause for the differences between Asturias and other regions of the Franco-Cantabrian area, but a greater number of samples is likely needed to achieve an unbiased representation of Asturian mtDNA lineages.34 Recent immigration should also be taken into account to avoid admixture biases in the sample, at least to some extent. Thus, we decided to perform an extensive ancestry-restricted population-wide study. The aim of the study was to perform spatial analyses that can be used to test the hypothesis of a postglacial range expansion compatible with the present-day genetic makeup of Western Europe. For that matter, new mtDNA sequences would be obtained from a population in the border of the presumed center of the expansion, which would be used to obtain picture of the pattern of local heterogeneities and drift processes.
Materials and methods
Study area providing new data within the Franco-Cantabrian refuge
Asturias is a Spanish autonomous community of around one million inhabitants, unequally distributed between the densely populated cities of the center and many smaller scattered towns. Its territory, spanning 10 000 km2, is the most mountainous region of Spain and one of the most mountainous region of Europe, with political limits that have existed since the Roman times.35 After the Roman invasion of the peninsula (29–19 BC), no important population movements into the territory were historically recorded, as its difficult terrain kept it outside the main routes of migration into Iberia for many centuries.36 In fact, in medieval times, Asturias was described as an isolated and impoverished province on the northern margin of the Spanish kingdom, a status that hardly changed even after the Industrial Revolution.37 Only in the years following 1950, the development of coal-mining facilities in the region improved the economy and favored the immigration of entire families from other Spanish regions, shaping the Asturian demography as it is today.38
Subjects
Samples of buccal cells were obtained from 429 volunteers recruited in various cities and towns of Asturias during the period 2009–2011, who confirmed having at least two generations of autochthonous maternal ancestry (i.e., grandmothers born in Asturias). No volunteers less than 18 years of age were allowed in the study, informed consent was obtained from each sample donor, and the study conformed to the Spanish Law for Biomedical Research (Law 14/2007—3 July). Sampling was performed using swabs with Dacron tips (Deltalab, Barcelona, Spain), and a protocol based on the Chelex-100 resin (Bio-Rad, Hercules, CA, USA) was used for DNA extraction.39
Sequence amplification and analyses
From each sample, the complete hypervariable segment I (HVS-I) of the control region of the mtDNA was amplified by PCR using the primers L15900 and H00016.40 Two independent amplifications were performed to reduce the possibility of sequence ambiguities derived from low-quality bases. Sequencing was carried out in an ABI 3730xl DNA Analyzer by the company Macrogen (Seoul, Korea).
Sequences were analyzed using the software Geneious v4.85,41 in which the highest quality consensuses were obtained and aligned with the revised Cambridge Reference Sequence.42 This alignment was checked manually for phylogenetic consistence following the recommendations of Bandelt and Parson43 and the EMPOP online tool.44 Haplogroup assignment of the samples was performed with the software HaploGrep,45 using version 13 of the mtDNA phylogenetic tree as a reference.46 Samples were included in paragroups when various equally probable assignments were possible due to the absence of diagnostic mutations. All samples that obtained a HaploGrep quality index lower than 80% and all samples allocated to the HV* paragroup were analyzed further by genotyping the coding-region single-nucleotide polymorphisms adequate for restriction fragment-length polymorphism assays,47 to assure the accuracy of their phylogenetic classification. Also, all samples allocated to the H* paragroup were restriction fragment-length polymorphism-tested for the presence of diagnostic mutations for the H1 and H3 haplogroups.48
Additional data acquisition
Asturian sequences were compared with an HVS-I data set of 9539 sequences from 45 European and North African populations (Supplementary Table 1). To include the maximum number of sequences in this data set, all of them were trimmed to a length of 342 nucleotides between positions 16 024 and 16 366 of the mitochondrial genome. This range is sufficient for our analyses, as it includes the most polymorphic interval of the entire hypervariable region.49
Data for the diversity of the haplogroups H1, H3, K, T2b, W, HV0 and V were computed from the Asturian haplotypes and obtained from the Garcia et al. study9 for nine European regions and Northern Africa, including the Franco-Cantabrian area. Values for the θK and KP parameters were also computed for these regions using the most approximate populations of our HVS-I data set (Supplementary Table S3), with the aim of performing model fitting on the diversity data. For Central and Eastern European populations not present in our data set, estimates previously computed by Helgason et al.50 were used.
Statistical analyses
The software Arlequin v3.51251 was used to obtain estimates of the haplotype and nucleotide diversities described by Nei52 and Tajima,53 respectively. Also, the number of private lineages found in each population was obtained, defined as the number of lineages that are only found in a given population when this is compared with all the others of our HVS-I data set. Comparison of the haplotype diversities of the pairs of populations was performed with the Test_h_diff software,54 which implements a Bayesian procedure. Differentiation between populations was assessed in terms of ΦST pairwise values, which were computed using the Tamura and Nei55 distance between sequences, with an unequal transition/transversion weighting of 4/5.56 To depict population relationships in a simpler manner, Slatkin57 generalized distances that were computed from the pairwise ΦST values were plotted into a two-dimensional multidimensional scaling, with the algorithm ALSCAL included in the software SPSS v17.58
The Watterson59 estimator θK (also called ‘population-mutation rate parameter’) was also computed according to the Ewens sampling formula.60 This value is an approximation, based on the number of haplotypes, to the widely known composite 2Nef·m, in which Nef is the effective number of females of the population and m is the mutation rate per nucleotide of the HVS-I region. Previous studies have defined the existence of a positive, significant correlation between θK and the proportion of private lineages of a group of populations.50, 61 This is because private lineages arise by mutation of a previously existing haplotype, which increases the θK estimates under the theoretical framework of an infinite-allele mutation model.
To properly compare different populations on the basis of their diversity patterns, sampling coverage estimates (i.e., the proportion of the true number of different haplotypes that have actually been sampled) in each population were calculated with the R statistical framework62 using the unseen2 package, which incorporates the formulae developed by Egeland and Salas63 for this matter. This same framework was also used to fit linear models and to compute Moran autocorrelation indexes for some statistics, the latter using package ape.64 Spatial mappings of population genetic statistics were performed with the Quantum GIS v1.73 software,65 whereas interpolation routines shown in the graphics were performed with the Kriging algorithm of the SDA4PP plugin.66 Distances between two populations, when needed, were computed from geographical coordinates using GDMG v1.23.67
Estimation of migration rates between populations was performed with the software MIGRATE-N v3.216.68 For each model that we tested, we used 80 random sequences from each included population and executed 5 separate runs under the Bayesian implementation. Each analysis involved two long Markov chains of 10 million generations, with a standard uniform slice priors on the M and θ parameters. A static heating scheme with four chains and standard temperatures was also used. Model likelihood was obtained by a thermodynamic integration with Bezier approximation, as implemented in the software. Finally, the median parameter values of each run were compared with the 95% credibility intervals (95% CI) of all other runs to ensure their fit and remove outliers if any.
Results
Haplogroup composition and diversity of the Asturian population
The HVS-I sequences from the Asturian volunteers, along with their haplogroup assignment, can be found in Supplementary Table 2. A total of 63 different haplogroups were identified in the sample, and their frequencies are shown in Table 1. The haplogroup composition of Asturias is dominated by nearly a 50% of the haplogroup H, as could be expected given a Southern European Atlantic location. A previous study highlighted the high frequency of haplogroup J in the region, which amounts to a 10.51% in our results (95% CI 7.85–13.71) and seems to be a local peak in respect to neighboring populations.9 About other haplogroups commonly encountered in the Franco-Cantabrian area, another peak appears in the frequency of T2b, which is a 4.66% of the sample (95% CI 2.94–7.04). Interestingly, this frequency is higher than those reported on the neighboring regions of Galicia, Leon and Cantabria, nearly reaching the 5%, which is common in the Mediterranean.69 In contrast, H1, HV, V and X haplogroups seem to experience local minimums in Asturias.
Some non-European haplogroups, of Near Eastern, African and Asian affinities or origins, were also found: M1, N1b, L3f1b4a and D4g1. Although finding such lineages is rare outside of their proposed areas of origin, they have been reported in other populations from the Iberian Peninsula28 and appear sporadically in European surveys in frequencies around 1% or less, as it is the case for Asturias.70
More detailed information can be inferred by observing the raw Slatkin ΦST matrix, which is in Supplementary Table 5. Regarding Asturias, this index is statistically significant for its comparisons with all other Spanish populations, with the exception of the Balearic Islands, but this could be an effect of their small sample size. It has to be noted that the regions immediately to the west and east of Asturias, Galicia and Cantabria, respectively (Figure 1), have low ΦST values of 0.003 and 0.004, whereas the value for the southern neighboring region, Leon (Figure 1), reaches 0.008. This is an even higher value than those estimated between Asturias and every region of the British Isles, with the exception of Orkney and the Hebrides. Also, a significant difference in haplotype diversity (P=0.002) between Asturias and Leon was found, which did not appear in comparisons with Galicia, Cantabria or even England. This suggests a relationship with the north (British regions), which can be seen also in the case of Galicia and Cantabria, without the steep differences with the south (Leon) in terms of diversity or ΦST. Graphically, the pattern can also be seen in the multidimensional scaling plot (Figure 2), in which the Asturian sample (ES-AS, marked with an arrow) is located in the edge of the Iberian group and farthest from the African group, surrounded by British and Scandinavian samples. As a sidenote, in this study, the term ‘Iberian’ includes the Balearic Islands and the French Province of Béarn (Figure 1) because of their deep historical relationship.71, 72
Effective population sizes and private lineages in Iberia
Estimates of sampling coverage, nucleotide and haplotype diversity, the θK parameter and the percentage of private lineages (KP) for each population of the HVS-I data set can be found in Table 2. We obtained a positive and significant correlation r=0.683 (P<0.001) between θK and the number of private lineages for the whole HVS-I data set and r=0.649 (P=0.006) for only the Iberian regions. The latter is graphically shown in Figure 3 along with the 95% confidence interval of the linear regression line, to highlight the different population groups that exist in territory of the Iberian Peninsula. Two of the populations of the Franco-Cantabrian refuge, Cantabria and Navarra, are outside the 95% CI because of a scarcity of private lineages, which is typical of areas in which both inward and outward migrations have been historically high.50 Asturias, on the contrary, is in the upper half, but outside the regression 95% CI for having an excess of private lineages relative to its effective number of females. It is, in fact, a population remarkable for having the highest proportion of private lineages of all our Iberian data set.
Spatial analysis of European populations
Spatial maps of the θK and KP values of the populations of our HVS-I data set can be found in Figures 4 and 5. As it can be seen, both statistics show a cline with the lowest values in the Franco-Cantabrian refuge area. As the θK and KP values were correlated (r2=0.458; P-value=0.018), only the most explanatory variable was used in the models to avoid collinearity. Diversity of the H1 haplogroup was fitted into a linear model that used the distance to the Franco-Cantabrian refuge and the θK value as predictors (r2=0.569; P-value=0.022). The same two predictors were adequate for fitting the diversity of the K haplogroup (r2=0.587; P-value=0.012). Diversity of the H3 haplogroup was fitted by using the distance to the refuge and the percentage of private lineages as predictors (r2=0.491; P-value=0.039). As for the V haplogroup, diversity in Asturias was much higher than in other Franco-Cantabrian areas (0.806 vs 0.653), which hindered the fit of any model that considered it. After removing this outlier value, diversity of the V haplogroup for the rest of the data set could be fitted to a high goodness by using a quadratic model that considered the distance to the refuge and the percentage of private lineages (r2=0.883; P-value=0.009). Diversities of the T2b, W and HV0 haplogroups could not be significantly fitted to a model containing any of the aforementioned variables. Plots of these models can be found in Supplementary Figures 1 to 4.
To further assess the population dynamics of the Franco-Cantabrian populations, we computed a MIGRATE analysis on seven Iberian, four French and one British population. All of them were inside or near the Franco-Cantabrian area. First, the likelihood of five models with differing levels of migration and panmixia were calculated, including a full model to serve as a reference (Supplementary Table 4). With the likelihood values, a model was chosen according to the Bayes factor criterion.73 This model supported the division of the Franco-Cantabrian refuge in two different areas: Asturias–Cantabria and Basques–Navarra–Bèarn, with panmictic populations inside each area. The migration matrix was constrained so that migration was only possible between directly adjacent areas, and this can be graphically seen in Figure 6. Reported values of migration parameters are their averages over the five runs, as no outlier results were found. They were rounded to integers, as they represent the effective number of migrants per generation (Nef·m). In all the regions tested, inward–outward gene flow was nearly symmetrical with the exception of trans-Pyrenean migrations, which were greater towards France. Migration from the Basque populations was nearly six times higher towards Catalonia and France (60) than towards Asturias and Cantabria (11).
Discussion
Population dynamics around the Franco-Cantabrian refuge
Both θK and KP calculated from our HVS-I data set show a cline in Europe (Figures 4 and 5), in which the lower values are coincident with the Franco-Cantabrian populations. This pattern is opposite to the frequency clines shown by haplogroups H1, H3 and V in earlier studies.7, 74, 75
In our analysis, diversities of H1, H3 and K can be explained with simple linear models using only two of the aforementioned covariates. The fit is hindered by some outlier populations (Supplementary Figures 1–3), but this fact can be related to local drift processes, such as those previously inferred in some regions from France76 or Scandinavia,77 and does not disrupt the general trend. Also, the historical migration pattern that enhanced the diversity of these haplogroups during the range expansion can be seen in the results of the MIGRATE analysis (Figure 6). The result of haplogroup K is particularly noteworthy, as its coalescence in Europe has been dated to the Initial Upper Paleolithic and not to postglacial times, as is the case for the other haplogroups.4 Nevertheless, its involvement in a postglacial repopulation was already proposed on the basis of the star-like mutational pattern of its common haplotypes.78
As for the diversity of the V haplogroup, a very good fit was obtained using a quadratic model (not accounting for the Asturian data), where increases of predictors are correlated with increases of diversity up to a point. This accounts for the founder effects that constrained the diversity of distant northern European populations,79 and is consistent with our interpretation of the refuge effect in Southern-Central Europe (Supplementary Figure 4). The outlier result of Asturias is probably a reflection of the heterogeneity of extant northern Iberian populations, which is discussed in the in-depth analysis of its sample. As for the high diversity of southern Iberian populations, it could be due to long-term contacts with Berber populations across the Gibraltar Strait,80 or be a signal of the proposed Iberian origin of this haplogroup.5
The Asturian edge of the Franco-Cantabrian area
In the genetic landscape of the northwestern Iberian Peninsula, a clear outlier in terms of effective size and percentage of private lineages can be found in Asturias (Figure 3). Such an outlier result could be explained by at least three basic causes:50 demographic isolation, significant gene flow from unsampled regions and/or higher sampling coverage than other regions. Before discussing the first two, it should be said that the latter is unlikely, given the sampling coverage estimated for the different Iberian regions, as most populations have been equally or more thoroughly sampled than Asturias, without finding a higher proportion of private lineages in them (Table 2).
The historical isolation of this region, coupled with its complex orography, is well documented. Consequently, it is reasonable to deduce such feature as the most probable cause of its great proportion of private lineages, probably because of internal genetic structuring or microdifferentiation, in a manner similar to cryptically subdivided populations.81 Also, particularly high inbreeding values for the region were inferred in a previous isonymy study.82 For the female case, ΦST pairwise P-values show that the mitochondrial pool of Asturians is statistically different from most of the other Iberian regions. However, the greater neighboring divergence, not counting the isolated Pasiegos and Maragatos, happens with Leon at the south. Significant differences in haplotype diversities between these two populations are consistent with a geographical–genetical structuring process, as it indicates that most of the differences between them would be related to mutational distances between non-shared haplotypes.83 Such a process could be promoted by a genetic drift, aided by a barrier that inhibits, at least partially, the flow of haplotypes after their apparition. A candidate barrier, in this case, would be the Cantabrian Mountain Range of the southern part of Asturias, a loftier and most intricate mountainous system called the Asturian Massif, which is the political border between the Province of Asturias and the autonomous community of Castilla–Leon.84
Our migration analysis also suggests restricted gene flow from the Basque population group. This is reminiscent of the distinct and fragmented mitochondrial pools that have been found in populations from different timeframes throughout the Franco-Cantabrian area.18 Nevertheless, there is no clear east–west spatial barrier in the geography of northern Spain; and archeological, climatological and paleontological contexts of Asturias, Cantabria and the Basque Country were all very similar during the Upper Paleolithic.85, 86 Alternative explanations for this low historical migration rate should be sought with a more in-depth regional study, in which relatively recent timeframes should be explicitly considered.87
The second explanation for an excess of private lineages in Asturias, gene flow with unsampled regions, cannot be discarded but does not seem to be the main cause, as previous studies did not show significant relationships between Spain and regions not present in our database, such as the eastern countries.88 In any case, lineages that reached Iberia by ancient southward migrations should be more or less homogenously distributed throughout it, so any bias in our results due to unrecognized shared haplotypes is probably very small.
Regarding similarities with other European regions, the ΦST patterns observed in the multidimensional scaling plot are consistent with the results of the migration analysis and suggest a close relationship between the northwestern Iberian coast and the British Isles, in accordance to the genetic makeup of the Atlantic façade evidenced in other studies.89 Neolithic coastal dispersions could be a parsimonious explanation,4 without excluding some input of ‘British Celtic’ markers like the haplogroup J2b1a (2.10% of Asturian sample; 95% CI 1.05–3.88), which is also shared with Mediterranean populations.90
Conclusions
The spatial patterns of classical population genetic statistics seem to correlate well with the diversity of haplogroups which were proposed to participate in the repopulation of Europe during the Magdalenian period. These diversities are low in the former territory of the Franco-Cantabrian refuge, which can be interpreted in the context of a spatially restricted refuge with small carrying capacity. A recolonization process in which the populations greatly expanded after arriving to the new territories would explain the positive association of diversity and distance to the refuge. In this way, diversity measures can be reconciled with previous interpretations based on haplogroup frequency distributions and paleontological findings. Nevertheless, any model that can be proposed based on extant DNA data will have to be tested by the analysis of ancient DNA of Franco-Cantabrian populations. Only such a direct approach can be used to find if the now common European haplogroups were actually present in the gene pool of Paleolithic Europeans and clarify their spatial distribution. In this regard, we have to highlight the recent study of Hervella et al.,18 which has been the first report of haplogroup H among Magdalenian Iberians.
Also, this study was a first attempt to characterize the mitochondrial gene pool of the Asturian population, by performing direct sequencing of the hypervariable HVS-I of the mtDNA control region. Regional peculiarities highlight a partial isolation of the Asturian mitochondrial pool that differentiates it from the rest of Franco-Cantabrian populations, with an unexpected high number of private lineages that would be compatible with a certain degree of genetic structuring inside the population. It seems that the complex history of the Franco-Cantabrian region might have left signals in the genetic makeup of its populations that have not been entirely uncovered yet.
References
Torroni, A., Achilli, A., Macaulay, V., Richards, M., Bandelt, H. J. Harvesting the fruit of the human mtDNA tree. Trends Genet. 22, 339–345 (2006).
Richards, M., Macaulay, V., Torroni, A., Bandelt, H. -J. In search of geographical patterns in European mitochondrial DNA. Am. J. Hum. Genet. 71, 1168–1174 (2002).
Forster, P. Ice Ages and the mitochondrial DNA chronology of human dispersals: a review. Philos. Trans. R Soc. Lond. B Biol. Sci. 359, 255 (2004).
Soares, P., Achilli, A., Semino, O., Davies, W., Macaulay, V., Bandelt, H. -J. et al. The archaeogenetics of Europe. Curr. Biol. 20, R174–R183 (2010).
Achilli, A., Rengo, C., Magri, C., Battaglia, V., Olivieri, A., Scozzari, R. et al. The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool. Am. J. Hum. Genet. 75, 910–918 (2004).
Gamble, C. The Palaeolithic settlement of Europe, Cambridge University Press: Cambridge, UK, (1986).
Pereira, L., Richards, M., Goios, A., Alonso, A., Albarrán, C., Garcia, O. et al. High-resolution mtDNA evidence for the late-glacial resettlement of Europe from an Iberian refugium. Genome Res. 15, 19 (2005).
Álvarez-Iglesias, V., Mosquera-Miguel, A., Cerezo, M., Quintáns, B., Zarrabeitia, M. T., Cuscó, I. et al. New population and phylogenetic features of the internal variation within mitochondrial DNA macro-haplogroup R0. PLoS ONE 4, e5112 (2009).
Garcia, O., Fregel, R., Larruga, J. M., Alvarez, V., Yurrebaso, I., Cabrera, V.M. et al. Using mitochondrial DNA to test the hypothesis of a European post-glacial human recolonization from the Franco-Cantabrian refuge. Heredity 106, 37–45 (2011).
Richard, C., Pennarun, E., Kivisild, T., Tambets, K., Tolk, H. -V., Metspalu, E. et al. An mtDNA perspective of French genetic variation. Ann. Hum. Biol. 34, 68–79 (2007).
Pala, M., Achilli, A., Olivieri, A., Kashani, B. H., Perego, U. A., Sanna, D. et al. Mitochondrial haplogroup U5b3: a distant echo of the Epipaleolithic in Italy and the legacy of the early Sardinians. Am. J. Hum. Genet. 84, 814–821 (2009).
Malyarchuk, B., Derenko, M., Grzybowski, T., Perkova, M., Rogalla, U., Vanecek, T. et al. The peopling of Europe from the mitochondrial haplogroup U5 perspective. PLoS ONE 5, e10285 (2010).
Malyarchuk, B., Grzybowski, T., Derenko, M., Perkova, M., Vanecek, T., Lazur, J. et al. Mitochondrial DNA phylogeny in Eastern and Western Slavs. Mol. Biol. Evol. 25, 1651–1658 (2008).
Pala, M., Olivieri, A., Achilli, A., Accetturo, M., Metspalu, E., Reidla, M. et al. Mitochondrial DNA signals of Late Glacial recolonization of Europe from Near Eastern refugia. Am. J. Hum. Genet. 90, 915–924 (2012).
Morin, E. Evidence for declines in human population densities during the early Upper Paleolithic in western Europe. Proc. Natl Acad. Sci. 105, 48–53 (2008).
Cavalli-Sforza, L. L., Menozzi, P., Piazza, A. The History and Geography of Human Genes, Princeton University Press: Princeton, USA, (1994).
Banks, W. E., d’Errico, F., Peterson, A. T., Vanhaeren, M., Kageyama, M., Sepulchre, P. et al. Human ecological niches and ranges during the LGM in Europe derived from an application of eco-cultural niche modeling. J. Archaeol. Sci. 35, 481–491 (2008).
Hervella, M., Izagirre, N., Alonso, S., Fregel, R., Alonso, A., Cabrera, V. M. et al. Ancient DNA from Hunter-Gatherer and Farmer groups from Northern Spain supports a random dispersion model for the Neolithic expansion into Europe. PLoS ONE 7, e34417 (2012).
Aguilée, R., Claessen, D., Lambert, A. Allele fixation in a dynamic metapopulation: founder effects vs refuge effects. Theor. Popul. Biol. 76, 105–117 (2009).
Arenas, M., Ray, N., Currat, M., Excoffier, L. Consequences of range contractions and range shifts on molecular diversity. Mol. Biol. Evol. 29, 207–218 (2011).
Banks, W. E., Zilhão, J., d’Errico, F., Kageyama, M., Sima, A., Ronchitelli, A. Investigating links between ecology and bifacial tool types in Western Europe during the Last Glacial Maximum. J. Archaeol. Sci. 36, 2853–2867 (2009).
Gamble, C., Davies, W., Pettitt, P., Hazelwood, L., Richards, M. The archaeological and genetic foundations of the European population during the Late Glacial: implications for ‘agricultural thinking’. Cambridge Archaeol. J. 15, 193–223 (2005).
Excoffier, L., Foll, M., Petit, R. J. Genetic consequences of range expansions. Annu. Rev. Ecol. Evol. Syst. 40, 481–501 (2009).
Shi, W., Ayub, Q., Vermeulen, M., Shao, R., Zuniga, S., Van Der Gaag, K. et al. A worldwide survey of human male demographic history based on Y-SNP and Y-STR data from the HGDP–CEPH populations. Mol. Biol. Evol. 27, 385–393 (2010).
Slatkin, M. Gene flow and genetic drift in a species subject to frequent local extinctions. Theor. Popul. Biol. 12, 253–262 (1977).
Charlesworth, B. Effective population size and patterns of molecular evolution and variation. Nat. Rev. Genet. 10, 195–205 (2009).
Dietler, M., López-Ruiz, C. eds. Colonial encounters in Ancient Iberia: Phoenician. Greek and Indigenous relations, University of Chicago Press: Chicago, USA, (2009).
Alvarez, L., Santos, C., Ramos, A., Pratdesaba, R., Francalacci, P., Aluja, M.P. Mitochondrial DNA patterns in the Iberian Northern plateau: population dynamics and substructure of the Zamora province. Am. J. Phys. Anthropol. 142, 531–539 (2010).
Cardoso, S., Alfonso-Sánchez, M. A., Valverde, L., Odriozola, A., Pérez-Miranda, A.M., Peña, J. A. et al. The maternal legacy of Basques in northern navarre: new insights into the mitochondrial DNA diversity of the Franco-Cantabrian area. Am. J. Phys. Anthropol. 145, 480–488 (2011).
Castro, M. G., Huerta, C., Reguero, J. R., Soto, M. I., Domenech, E., Alvarez, V. et al. Mitochondrial DNA haplogroups in Spanish patients with hypertrophic cardiomyopathy. Int. J. Cardiol. 112, 202–206 (2006).
Palacín, M., Alvarez, V., Martín, M., Díaz, M., Corao, A. I., Alonso, B. et al. Mitochondrial DNA and TFAM gene variation in early-onset myocardial infarction: evidence for an association to haplogroup H. Mitochondrion 11, 176–181 (2011).
Alvarez, J.C., Johnson, D.L.E., Lorente, J.A., Martinez-Espin, E., Martinez-Gonzalez, L.J., Allard, M. et al. Characterization of human control region sequences for Spanish individuals in a forensic mtDNA data set. Leg. Med. 9, 293–304 (2007).
Prieto, L., Zimmermann, B., Goios, A., Rodriguez-Monge, A., Paneto, G. G., Alves, C. et al. The GHEP-EMPOP collaboration on mtDNA population data--A new resource for forensic casework. Forensic Sci. Int. Genet. 5, 146–151 (2011).
Pereira, L., Cunha, C., Amorim, A. Predicting sampling saturation of mtDNA haplotypes: an application to an enlarged Portuguese database. Int. J. Legal Med. 118, 132–136 (2004).
Gómez-Pellón, E. Aproximación al estudio antropológico de Asturias. Rev. Antropol. Soc. 0, 31–63 (1991).
Santos, F. D. Historia de Asturias: Asturias romana y visigoda, Ayalga: Oviedo, Spain, (1977).
Puerta, M. C. La Asturias Medieval In: Fernández-Pérez A., Friera-Suárez F. eds. Historia de Asturias, KRK: Oviedo, Spain, (2005).
Infiesta, V. R. Asturias en los siglos XX y XXI In: Fernández-Pérez A., Friera-Suárez F. eds. Historia de Asturias, KRK: Oviedo, Spain, (2005).
Walsh, P. S., Metzger, D. A., Higuchi, R. Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. Biotechniques 10, 506–513 (1991).
Brandstätter, A., Niederstätter, H., Pavlic, M., Grubwieser, P., Parson, W. Generating population data for the EMPOP database—an overview of the mtDNA sequencing and data evaluation processes considering 273 Austrian control region sequences as example. Forensic Sci. Int. 166, 164–175 (2007).
Drummond, A., Ashton, B., Cheung, M., Heled, J., Kearse, M., Moir, R. et al. Geneious v4.85, Biomatters Ltd: Auckland, New Zealand, (2009).
Andrews, R. M., Kubacka, I., Chinnery, P. F., Lightowlers, R. N., Turnbull, D. M., Howell, N. Reanalysis and revision of the Cambridge reference sequence for human mitochondrial DNA. Nat. Genet. 23, 147 (1999).
Bandelt, H. -J., Parson, W. Consistent treatment of length variants in the human mtDNA control region: a reappraisal. Int. J. Legal Med. 122, 11–21 (2008).
Parson, W., Dür, A. EMPOP—a forensic mtDNA database. Forensic Sci. Int. Genet. 1, 88–92 (2007).
Kloss-Brandstätter, A., Pacher, D., Schönherr, S., Weissensteiner, H., Binna, R., Specht, G. et al. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum. Mutat. 32, 25–32 (2011).
van Oven, M., Kayser, M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 30, E386–E394 (2009).
Pardiñas, A., Dopico, E., Roca, A., Garcia-Vazquez, E., Lopez, B. Introducing human population biology through an easy laboratory exercise on mitochondrial DNA. Biochem. Mol. Biol. Educ. 38, 110–115 (2010).
Ennafaa, H., Cabrera, V. M., Abu-Amero, K. K., González, A. M., Amor, M. B., Bouhaha, R. et al. Mitochondrial DNA haplogroup H structure in North Africa. BMC Genet. 10, 8 (2009).
Soares, P., Ermini, L., Thomson, N., Mormina, M., Rito, T., Röhl, A. et al. Correcting for purifying selection: an improved human mitochondrial molecular clock. Am. J. Hum. Genet. 84, 740–759 (2009).
Helgason, A., Hickey, E., Goodacre, S., Bosnes, V., Stefánsson, K., Ward, R. et al. mtDNA and the islands of the North Atlantic: estimating the proportions of Norse and Gaelic ancestry. Am. J. Hum. Genet. 68, 723–737 (2001).
Excoffier, L., Lischer, H.E.L. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol. Ecol. Res. 10, 564–567 (2010).
Nei, M. Molecular Evolutionary Genetics, Columbia University Press: New York, USA, (1987).
Tajima, F. Measurement of DNA polymorphism. In: Mechanisms of Molecular Evolution Introduction to Molecular Paleopopulation Biology eds Takahata N., Clark A. G. 37–39 Sinauer Associates, Inc: Sunderland, UK, (1993).
Weale, M. E. Testing for differences in H between two populations. http://www.ucl.ac.uk/mace-lab/resources/software. (University College London: London, UK, (2003).
Tamura, K., Nei, M. Estimation of the number of nucleotide substitutions in the control region of mtDNA in humans and chimpanzees. Mol. Biol. Evol. 10, 512–526 (1993).
Rubinstein, S., Dulik, M. C., Gokcumen, O., Zhadanov, S., Osipova, L., Cocca, M. et al. Russian Old Believers: genetic consequences of their persecution and exile, as shown by mitochondrial DNA evidence. Hum. Biol. 80, 203–237 (2008).
Slatkin, M. Isolation by distance in equilibrium and non-equilibrium populations. Evolution 47, 264–279 (1993).
SPSS Inc.. SPSS Statistics v17 for Windows, SPSS Inc: Chicago, USA, (2008).
Watterson, G. A. On the number of segregating sites in genetical models without recombination. Theor. Popul. Biol. 7, 256–276 (1975).
Ewens, W. J. The sampling theory of selectively neutral alleles. Theor. Popul. Biol. 3, 87–112 (1972).
Goodacre, S., Helgason, A., Nicholson, J., Southam, L., Ferguson, L., Hickey, E. et al. Genetic evidence for a family-based Scandinavian settlement of Shetland and Orkney during the Viking periods. Heredity 95, 129–135 (2005).
R Development Core Team.. R: A language and environment for statistical computing, R Foundation for Statistical Computing: Vienna, Austria, (2010).
Egeland, T., Salas, A. Estimating haplotype frequency and coverage of databases. PLoS ONE 3, e3988 (2008).
Paradis, E., Claude, J., Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290 (2004).
Quantum GIS Development Team. Quantum GIS Geographic Information System, http://qgis.osgeo.org. (Open Source Geospatial Foundation Project, (2011).
Kepoglu, V. Spatial Data Analysis Plugin for Point Pattern. http://code.google.com/p/. (Middle East Technical University: Ankara, Turkey, (2009).
Ersts, P.J. Geographic Distance Matrix Generator version 1.23. http://biodiversityinformatics.amnh.org/open_source/gdmg . American Museum of Natural History (Center for Biodiversity and Conservation: New York, USA, (2012).
Beerli, P. How to use migrate or why are Markov Chain Monte Carlo programs difficult to use?. In: Population Genetics for Animal Conservation eds Bertorelle G., Bruford M. W., Hauffe H. C., Rizzoli A., Vernesi C. 42–79 Cambridge University Press: Cambridge, UK, (2009).
Ottoni, C., Martinez-Labarga, C., Vitelli, L., Scano, G., Fabrini, E., Contini, I. et al. Human mitochondrial DNA variation in Southern Italy. Ann. Hum. Biol. 36, 785–811 (2009).
Irwin, J., Egyed, B., Saunier, J., Szamosi, G., O’Callaghan, J., Padar, Z. et al. Hungarian mtDNA population databases from Budapest and the Baranya county Roma. Int. J. Legal Med. 121, 377–383 (2007).
Rodríguez-Ezpeleta, N., Álvarez-Busto, J., Imaz, L., Regueiro, M., Azcárate, M., Bilbao, R. et al. High-density SNP genotyping detects homogeneity of Spanish and French Basques, and confirms their genomic distinctiveness from other European populations. Hum. Genet. 128, 113–117 (2010).
Ayuso, V. M. G. The Balearic Islands: prehistoric colonization of the furthest Mediterranean islands from the mainland. J. Mediterr. Archaeol. 14, 136–158 (2002).
Goodman, S. N. Toward evidence-based medical statistics. 2: The Bayes factor. Ann. Intern. Med. 130, 1005–1013 (1999).
Torroni, A., Bandelt, H. J., D'Urbano, L., Lahermo, P., Moral, P., Sellitto, D. et al. mtDNA analysis reveals a major late Paleolithic population expansion from southwestern to northeastern Europe. Am. J. Hum. Genet. 62, 1137–1152 (1998).
Torroni, A., Bandelt, H. -J., Macaulay, V., Richards, M., Cruciani, F., Rengo, C. et al. A signal, from human mtDNA, of postglacial recolonization in Europe. Am. J. Hum. Genet. 69, 844–852 (2001).
Dubut, V., Chollet, L., Murail, P., Cartault, F., Béraud-Colomb, E., Serre, M. et al. mtDNA polymorphisms in five French groups: importance of regional sampling. Eur. J. Hum. Genet. 12, 293–300 (2003).
Helgason, A., Nicholson, G., Stefánsson, K., Donnelly, P. A reassessment of genetic diversity in Icelanders: strong evidence from multiple loci for relative homogeneity caused by genetic drift. Ann. Hum. Genet. 67 (4), 281–297 (2003).
Richards, M., Corte-Real, H., Forster, P., Macaulay, V., Wilkinson-Herbots, H., Demaine, A. et al. Paleolithic and neolithic lineages in the European mitochondrial gene pool. Am. J. Hum. Genet. 59, 185 (1996).
Tambets, K., Rootsi, S., Kivisild, T., Help, H., Serk, P., Loogväli, E. -L. et al. The Western and Eastern roots of the Saami—the story of genetic “Outliers” told by mitochondrial DNA and Y-chromosomes. Am. J. Hum. Genet. 74, 661–682 (2004).
Coudray, C., Olivieri, A., Achilli, A., Pala, M., Melhaoui, M., Cherkaoui, M. et al. The Complex and Diversified Mitochondrial Gene Pool of Berber Populations. Ann. Hum. Genet. 73, 196–214 (2009).
Carvalho-Silva, D. R., Tyler-Smith, C. The grandest genetic experiment ever performed on man? A Y-chromosomal perspective on genetic variation in India. Int. J. Hum. Genet. 8, 21–29 (2008).
Rodriguez-Larralde, A., Gonzales-Martin, A., Scapoli, C., Barrai, I. The names of Spain: A study of the isonymy structure of Spain. Am. J. Phys. Anthropol. 121, 280–292 (2003).
Pons, O., Petit, R. Measuring and testing genetic differentiation with ordered versus unordered alleles. Genetics 144, 1237 (1996).
Muñoz-Jimenez, J. Geografía de Asturias 1: Geografía Física. El relieve, el clima y las aguas, Ayalga: Oviedo, Spain, (1982).
Sánchez, M.J., Arquer, P. F. New radiometric and geomorphologic evidences of a last glacial maximum older than 18 ka in SW European mountains: the example of Redes Natural Park (Cantabrian Mountains, NW Spain). Geodin. Acta. 15, 93–101 (2002).
Straus, L.G. Were there human responses to Younger Dryas in Cantabrian Spain? Quatern. Int. 242, 328–335 (2011).
Melchior, L., Lynnerup, N., Siegismund, H. R., Kivisild, T., Dissing, J. Genetic Diversity among Ancient Nordic Populations. PLoS ONE. 5, e11898 (2010).
Karachanak, S., Carossa, V., Nesheva, D., Olivieri, A., Pala, M., Hooshiar Kashani, B. et al. Bulgarians vs the other European populations: a mitochondrial DNA perspective. Int. J. Legal Med. 126, 497–503 (2011).
McEvoy, B., Richards, M., Forster, P., Bradley, D. G. The Longue Duree of genetic ancestry: multiple genetic marker systems and Celtic origins on the Atlantic facade of Europe. Am. J. Hum. Genet. 75, 693–702 (2004).
Forster, P., Romano, V., Calì, F., Röhl, A., Hurles, M. MtDNA markers for Celtic and Germanic language areas in the British Isles. In: Traces of Ancestry: Studies in Honour of Colin Renfrew ed. Jones M. 99–111 McDonald Institute for Archaeological Research: Oxford, UK, (2004).
Acknowledgements
Antonio F. Pardiñas is currently supported by a ‘Severo Ochoa’ FICYT-PCTI Grant from the Asturias Regional Government (ref. BP09038). We are grateful to all the donors who volunteered for sampling and to all those who helped during the organization of the sampling campaign, especially the mayors of Llanes and Ribadesella. We are also grateful to Dr Daniel Campo and Dr Hannes Schroeder for helpful comments, which greatly improved the manuscript. This research received support from the SYNTHESYS Project (http://www.synthesys.info/), which is financed by the European Community Research Infrastructure Action under the FP7 ‘Capacities’ Program.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no conflict of interest.
Additional information
Supplementary Information accompanies the paper on Journal of Human Genetics website
Rights and permissions
About this article
Cite this article
Pardiñas, A., Roca, A., Garcia-Vazquez, E. et al. Mitochondrial diversity patterns and the Magdalenian resettlement of Europe: new insights from the edge of the Franco-Cantabrian refuge. J Hum Genet 57, 717–726 (2012). https://doi.org/10.1038/jhg.2012.100
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/jhg.2012.100