Surveying the genome and constructing a high-density genetic map of napiergrass (Cenchrus purpureus Schumach)

Paudel, Dev; Kannan, Baskaran; Yang, Xiping; Harris-Shultz, Karen; Thudi, Mahendar; Varshney, Rajeev K.; Altpeter, Fredy; Wang, Jianping

doi:10.1038/s41598-018-32674-x

Download PDF

Article
Open access
Published: 26 September 2018

Surveying the genome and constructing a high-density genetic map of napiergrass (Cenchrus purpureus Schumach)

Scientific Reports volume 8, Article number: 14419 (2018) Cite this article

3173 Accesses
14 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Napiergrass (Cenchrus purpureus Schumach) is a tropical forage grass and a promising lignocellulosic biofuel feedstock due to its high biomass yield, persistence, and nutritive value. However, its utilization for breeding has lagged behind other crops due to limited genetic and genomic resources. In this study, next-generation sequencing was first used to survey the genome of napiergrass. Napiergrass sequences displayed high synteny to the pearl millet genome and showed expansions in the pearl millet genome along with genomic rearrangements between the two genomes. An average repeat content of 27.5% was observed in napiergrass including 5,339 simple sequence repeats (SSRs). Furthermore, to construct a high-density genetic map of napiergrass, genotyping-by-sequencing (GBS) was employed in a bi-parental population of 185 F₁ hybrids. A total of 512 million high quality reads were generated and 287,093 SNPs were called by using multiple de-novo and reference-based SNP callers. Single dose SNPs were used to construct the first high-density linkage map that resulted in 1,913 SNPs mapped to 14 linkage groups, spanning a length of 1,410 cM and a density of 1 marker per 0.73 cM. This map can be used for many further genetic and genomic studies in napiergrass and related species.

Genotyping by sequencing provides new insights into the diversity of Napier grass (Cenchrus purpureus) and reveals variation in genome-wide LD patterns between collections

Article Open access 06 May 2019

High Density Genetic Maps of Seashore Paspalum Using Genotyping-By-Sequencing and Their Relationship to The Sorghum Bicolor Genome

Article Open access 21 August 2019

Genome-wide SNP identification in Prunus rootstocks germplasm collections using Genotyping-by-Sequencing: phylogenetic analysis, distribution of SNPs and prediction of their effect on gene function

Article Open access 30 January 2020

Introduction

Napiergrass (Cenchrus purpureus Schumach., syn. Pennisetum purpureum Schumach), also known as elephant grass, is a tropical perennial grass native to eastern and central Africa. It is cultivated primarily for forage and widely used by smallholder dairy farmers due to its high growth rate, leaf nutritive value, perennial nature, persistence, ease of propagation, and broad adaptation^1,2,3,4. As a C₄ grass species, napiergrass is a promising candidate feedstock for biofuel production due to its superior yield of biomass^5,6,7. Napiergrass cultivars are typically developed from natural out-crossings^1,8. It is an allotetraploid (2n = 4x = 28, A’A’BB)⁹ with an average amount of DNA per G₁ nucleus of 5.78 pg¹⁰. The chromosomes in the A’ genome of napiergrass are believed to be homologous to the A genome of pearl millet (Pennisetum glaucum, 2n = 2x = 14, AA)⁹. Pearl millet and napiergrass form a monophyletic group¹¹ and were initially classified as primary and secondary gene pool of the genus Pennisetum, respectively^11,12. Recently, species of Pennisetum and Odontelytrum were transferred to the unified genus Cenchrus⁴. Pearl millet and napiergrass can hybridize to produce hybrids called kinggrass¹³ or Pearl Millet-Napiergrass (PMN) hybrids^14,15,16. These hybrids are sterile due to triploidy (2n = 3x = 21, AA’B genome)¹⁷, thus preventing the unintended spreading into natural areas or other cropping systems by wind dispersed seeds. Some PMN hybrids show high heterosis for biomass yield and forage quality while the perennial, persistent nature is often reduced compared to napiergrass³.

The targeted improvement of napiergrass includes identification of agronomically superior genotypes and studies assessing genetic diversity and relatedness using random amplification of polymorphic DNA (RAPD), amplified fragment length polymorphism (AFLP), isozymes, and simple sequence repeats (SSRs) developed for other species like pearl millet and buffelgrass (Pennisetum ciliare)^{1,13,18,19,20,21,22,23}. Other than these, genetic information on napiergrass is very meager²³. A genetic map is lacking and molecular tools are not yet deployed in napiergrass breeding programs^21,24. Development of molecular markers for detection and utilization of DNA polymorphisms will help to understand the molecular basis of various agronomic traits²⁵. Molecular breeding for yield components, flowering date, nutrient uptake, abiotic and biotic stress tolerance will accelerate genetic improvement of napiergrass. This can be greatly facilitated by having access to marker resources like SSR, single nucleotide polymorphisms (SNPs), and genetic linkage maps. SSRs as molecular markers are very advantageous because they are locus specific, multi-allelic, co-dominant, and easy to detect by polymerase chain reaction (PCR)^26,27. SNP markers have gained increasing consideration in molecular breeding and linkage map construction as they occur in a large number and high density²⁸. Access to these resources will support marker-assisted selection (MAS) by making phenotypic predictions based on the genotype²⁹.

Recently, next generation sequencing (NGS) technology has simplified linkage map construction by using high throughput genotyping-by-sequencing (GBS), which allows simultaneous SNP discovery and genotyping across the whole genome of the population of interest^29,30,31. GBS has been effective for marker discovery, genetic mapping, quantitative trait locus (QTL) analysis, population genetics, and comparative genomics studies in several diploid species and has recently gained popularity in polyploid species such as wheat (Triticum aestivum)³², switchgrass (Panicum virgatum)³³, potato (Solanum tuberosum)³⁴, and sugarcane (Saccharum spp.)³⁵ among others. However, the presence of highly similar homeologous copies of two genomes in allopolyploid species complicates SNP detection which relies on delineating true allelic SNPs from homeologous SNPs because sequences from homeologous loci mimic allelic SNPs and can introduce false-positives. Distinguishing allelic SNPs from homeologous SNPs relies on the use of high-stringency sequence read alignment, specifically uniquely aligned reads³⁶. Despite challenges of using GBS for genotyping of polyploid species, genetic mapping without a reference genome has been carried out in switchgrass by defining linkage groups with the modulated modularity clustering (MMC) method³⁷ referring to the genome of foxtail millet (Setaria italica)³³. A genetic map of wheat was constructed by using the bin-mapping procedure with homozygous genotypes of a double-haploid population³⁸. Each program for calling variants utilizes different models or algorithms to identify potential polymorphisms, therefore, multiple software programs need to be evaluated in order to identify the best SNP caller for polyploids³⁶.

Linkage maps are important tools for map-based cloning, marker-assisted breeding, QTL identification, genome organization, and comparative genomics of important species. A number of linkage maps have been constructed for several grasses including pearl millet³⁹. However, so far, napiergrass SSR markers, genetic linkage map, or reference genome assembly are lacking. The purpose of this study was to survey the napiergrass genome and to construct a high-density genetic linkage map. Here, for the first time, we have surveyed whole genome sequences in napiergrass, developed SSR markers, and constructed high-density genetic map of napiergrass to investigate its genomic and genetic architecture.

Results

Napiergrass genome survey

Approximately 211 million raw reads from Illumina and 97 thousand raw reads from 454 sequencing were subjected to a sequence quality check. After filtration and trimming, 161 million clean Illumina reads and 96,000 clean 454 reads were aligned to the pearl millet genome v1⁴⁰. A total of 62.5 million (38.8%) reads were able to align with the pearl millet genome. Polymorphisms were detected between the napiergrass and pearl millet aligned reads, of which 619,708 SNPs and 24,135 indels were identified. Most of the sequence variations (58.7% SNPs) were in intergenic regions (Supplementary Fig. S1). The clean reads were assembled into 113,789 contigs with a total size of 44.5 Mbp and a N50 of 435 bp and a GC content 43.45%. The largest 10 contigs of the sequence assembly, which ranged from 8,506 to 25,329 bp in length, were selected as representative napiergrass genome fragments. The repeat content of the 10 longest contigs ranged from 5% to 90% with an average of 27.5% and a total of 164 repetitive elements (Supplementary Table S1). Two contigs had no hits in the pearl millet genome due to a high repeat content (over 80%). The rest of the contigs had one or more large hits (>500 bp) to the pearl millet genome. The micro-synteny showed mostly collinear relationship between the genomes of the two species (Supplementary Fig. S2). However, chromosome inverted duplications were also observed in the pearl millet genome (Supplementary Fig. S3), indicating that the chromosome rearrangement occurred after the speciation of napiergrass and pearl millet. The length of stringently (>500 bp and >80% sequence similarity) aligned regions accounted for 36.3% of the examined contig sequences of napiergrass (Supplementary Table S2). The total length of the alignment was 25.1% higher in pearl millet than in napiergrass aligned regions.

From the assembled napiergrass survey sequences, 5,339 SSRs were identified. Mono- type repeats were most common in napiergrass, followed by Tri-, Di- and Tetra- type repeats (Supplementary Table S3). From these identified SSRs, 1,926 were successfully used for primer design (Supplementary Table S4). All of the primer sequences aligned to the assembly of napiergrass and 89% of the primers were uniquely aligned. On the other hand, the overall alignment rate of the primers with pearl millet genome v1⁴⁰ was 31% with 15% uniquely aligned. These SSR primers will undoubtedly serve as an abundant resource for molecular markers in napiergrass.

Genotyping-by-sequencing

To construct the linkage map for napiergrass, an F₁ bi-parental mapping population was developed, which consisted of 185 true hybrid individuals⁴¹. These hybrids were subjected to GBS. A total of 549 million raw reads were generated. After trimming and filtering, 512 million high quality reads were retained. The average number of reads per sample was 2.6 million and ranged from 44 thousand to 5.4 million reads per sample. In silico digestion of the pearl millet genome v1⁴⁰ with PstI yielded DNA fragments in the range of 170–350 bp, which suggest that an estimated average depth for the mapping population was 36.5 reads per locus per sample (Supplementary Fig. S4), which should allow us to call the SNPs confidently at most of the loci.

A total of 695,602 unique tags were identified from the clean reads generated from the mapping population by using the TASSEL de-novo UNEAK pipeline. These tags were further clustered into 182,934 non-redundant tags by CD-HIT. To examine the sequence similarity between napiergrass and other grass species, we aligned the non-redundant tags of napiergrass against several grass species with complete genome sequences including rice (Oryza sativa) (Osativa_323_v7.0), Brachypodium (Bdistachyon_314_v3.0), maize (Zmays_284_AGPv3), sorghum (Sbicolor_313_v3.0), foxtail millet (Sitalica_312_v2), switchgrass (Pvirgatum_273_v1.0), wheat (Taestivum_296_v2), pearl millet v1⁴⁰, and barley (ASM32608v1), with Arabidopsis (Athaliana_167_TAIR9) as an outgroup control. The results showed that the percentage of napiergrass sequence tags aligned to these grass species ranged from 2.6% to 37.9% for barley and pearl millet genome, respectively (Table 1), indicating a relatively close relationship between napiergrass and pearl millet.

Table 1 Summary of the alignment of non-redundant tags of napiergrass (Cenchrus purpureus) to the available genomes of different species.

Full size table

SNP calling by various SNP callers

Three de-novo SNP calling pipelines, TASSEL-UNEAK, Stacks, and GBS-SNP-CROP identified 10,799, 6,871, and 4,521 SNPs, respectively. Reference based pipelines were also applied by using pearl millet v1⁴⁰ as the reference genome. However, the alignment rate was relatively low due to the differences between the napiergrass and pearl millet genomes. The percentage of clean reads aligned to the pearl millet genome using Bowtie2 ranged from 5.60% to 44.62% with an average of 39.68%. Two samples had a small number of sequences (<10% of the average number of sequences per sample) and also the lowest percentage of uniquely mapped reads (Supplementary Table S5, Supplementary Fig. S5). Therefore, these samples were removed from linkage map construction. Six different reference-based pipelines were employed to call SNPs viz., TASSEL 4.3⁴², Stacks 1.24⁴³, GBS-SNP-CROP⁴⁴, SAMtools 1.2 mpileup⁴⁵, FreeBayes 0.9.21⁴⁶, and GATK 3.3⁴⁷. TASSEL 4.3, Stacks, and SAMtools identified 7,326, 4,920, 27,082 SNPs, respectively in the mapping population, whereas FreeBayes, GBS-SNP-CROP, and GATK that can handle ploidy identified 25,193, 2,906 and 197,475 SNPs, respectively. The six reference-based SNP callers concordantly called only 11 SNPs (Fig. 1, only five programs are shown in figure due to Venn-diagram display limitations) and 207,391 non-redundant SNPs.

Genetic linkage map construction

From a total of 549,944 SNPs called by both reference based and de-novo pipelines, 287,093 SNPs were filtered for further analysis. Out of these, a total of 18,286 single-dose SNPs were genotyped in more than 180 progenies. Three individuals with more than 10% missing sites were removed from further analysis. For linkage map construction of each parental line, only the SNPs showing heterozygous in one parent but homozygous in the other parent were selected. A total of 3,276 loci were heterozygous in female parent but homozygous in male parent and segregated with an expected ratio of 1:1 in the population, thus can be used for female parent linkage map construction. Similarly, 3,417 loci were heterozygous in male parent but homozygous in female parent and segregated with an expected ratio of 1:1 in the population, thus can be used for male parent linkage map construction. For the female parental line, a total of 1,606 SNPs were grouped and 899 loci were mapped on 14 linkage groups with a total length of 1,555.17 cM averaging 1 marker every 1.72 cM (Supplementary Fig. S6). Inclusion of segregation distorted (SD) markers increased the genetic distance of the female parent map by 28.13%. For the male parent, a total of 1,509 markers were grouped into 14 linkage groups and 1,073 markers were mapped onto these 14 linkage groups with a total length of 1,939.19 cM averaging 1 marker every 1.80 cM (Supplementary Fig. S7). Inclusion of SD markers increased the total genetic distance of the male parent map by 38.41%.

A combined linkage map containing markers that segregated from both female and male parents was constructed, which can facilitate future QTL mapping of the population. To construct a combined linkage map, the markers showing heterozygous on both parents in addition to male-parent heterozygous and female-parent heterozygous markers were used. Therefore, a parent-averaged combined map was constructed by using 378 heterozygous markers for both parents that segregated in a 1:2:1 ratio in the population, in combination with 3,417 male-parent heterozygous and 3,276 female-parent heterozygous markers. In total, 4,058 markers were grouped into 14 linkage groups out of which 1,913 markers were mapped. The final composite linkage map spanned a length of 1,410.10 cM with an average of 0.73 cM between markers. The largest linkage group was Linkage group 02 (LG 02), which spanned 142.40 cM and contained 170 markers (Table 2). Length of each linkage group ranged from 70.18 cM to 142.40 cM and density ranged from 0.88 to 1.77 markers per cM (Fig. 2, Table 2, Supplementary Fig. S8). Results of the χ² test indicated that 114 (6.06%) of the 1,879 markers showed significant segregation distortion (0.001 < P < 0.05) on the combined map. These distorted markers showed clustered distribution on three segregation distortion regions (SDRs) in linkage groups LG07 and LG08 (Fig. 2).

Table 2 Summary of the combined linkage map of napiergrass and the percentage of gaps less than 5 cM in male and female parent linkage maps.

Full size table

Among the different reference-based SNP callers, GATK called the highest number of SNPs (197,475) followed by SAMtools and FreeBayes (Table 3). After accounting for segregation ratio and missing data, SAMtools retained the largest number of SNPs followed by TASSEL de-novo UNEAK. However, when considering the total number of markers mapped on the combined linkage groups, TASSEL de-novo UNEAK showed the highest percentage of SNPs mapped followed by Stacks (Table 3).

Table 3 Summary of napiergrass single nucleotide polymorphism (SNP) markers mapped on the combined linkage map using 9 different software pipelines.

Full size table

Comparison between genomes of napiergrass and pearl millet

Sequence tags of the markers that mapped on napiergrass linkage groups were extracted and compared to the pearl millet genome. Among the 1,156 TASSEL de-novo UNEAK tags positioned on the combined map, 663 were found to have significant sequence similarities to the genome sequence of pearl millet. Considerable collinearity was observed between the napiergrass and pearl millet genomes (Fig. 3). For each pearl millet pseudomolecule, two corresponding regions in the linkage groups (LGs) of napiergrass genome were identified (Figs 3 and 4). However, some pearl millet genomic regions had more than two corresponding regions on napiergrass genome. For example, pseudomolecule 3 of pearl millet had regions corresponding to three linkage groups LG03, LG12, and LG14 of napiergrass indicating possible chromosomal rearrangement between the two species after speciation (Figs 3 and 4).

Discussion

Despite its importance as a forage grass and its enormous potential as a biofuel crop, molecular, genetic, and genomic studies have been severely limited in napiergrass. Currently, there was no equivalent genome sequence in the public domain to be used as a reference for napiergrass. In this study, an initial comparison between the napiergrass survey sequences to 10 available grass genomes revealed that napiergrass genomic sequences had the highest similarity with the pearl millet genome, which could be explained by the presence of the A’A’ genome of napiergrass that is homologous to the AA genome of pearl millet. Consequently, in this study we have utilized pearl millet genome v1⁴⁰ as a reference for SNP calling and also performed de-novo SNP calling without a reference genome. A total of 38.8% of the napiergrass reads aligned to the pearl millet genome using Bowtie 2, which performed better over BWA, another popular aligner^48,49. The large portion of unaligned reads might be from the B genome or the divergent chromosome regions of A genome between the two species.

From the genome survey comparison, the total length of all the alignments of napiergrass reads was 25.1% longer in pearl millet indicating genic duplication or expansion in pearl millet and genomic rearrangements between the two species during evolution from their ancestral genome. This is consistent with a previously reported genomic in situ hybridization, which verified that the pearl millet genome A was 24% larger compared to the chromosomes of genome A’ of napiergrass⁵⁰. For the 10 longest contigs in our assembly, average repeat content (27%) was lower than reported from other grasses including sorghum (61%)⁵¹, maize (85%)⁵², foxtail millet (46%)⁵³, rice (43.3%)⁵⁴, and pearl millet (77%)⁴⁰. Low repeat content in napiergrass could be attributed to the loss of genomic sequences after hybridization. Rearrangements and loss of genomic sequences are common events after hybridization⁵⁵. Similar to other plant genomes, long-terminal repeat (LTR) retrotransposons comprised the most abundant class (62.19%) of repeats (Supplementary Table S1). Significant relationships between napiergrass, pearl millet, and P. squamulatum suggested their common origin and it was inferred that napiergrass and pearl millet had concomitantly diverged from a common ancestor^11,50,56 and the origin of napiergrass occurred at the interspecific hybridization event, by combining genome A of the ancestor with genome B of a still unknown second ancestor⁵⁰. Our study showed that the napiergrass genome had considerable microcolinearity with the pearl millet genome showing evidence of their relatedness and shared ancestry. Chromosome inverted duplications on pseudomolecule 3 of pearl millet showed possible rearrangement after speciation of napiergrass and pearl millet. Two corresponding regions on the napiergrass linkage groups for each pearl millet chromosome corroborate the hypothesis that these two genomes evolved from a common ancestor.

We developed a limited genomic assembly of napiergrass based on Illumina and 454 sequences. Nearly two thousand SSR markers were developed, which could be immediately useful for applications in napiergrass breeding and genetics. With the advancement of NGS, high throughput NGS-enabled genotyping technologies are becoming readily accessible. Yet, SSR markers remain as a popular tool for genetic studies, variety identification, monitoring of seed purity, and hybrid quality. They are particularly important in laboratories which have limited resources and lack access to NGS facilities or bioinformatic expertise. To our knowledge, this is the first study in napiergrass where SSR markers were developed based on napiergrass genome survey.

A genetic linkage map is an important tool to reveal the genome structure and to identify marker-trait associations⁵⁷ which ultimately help in MAS³³ to improve precision of selection. In this study, we used the GBS approach to construct a combined high-density linkage map that spanned 1,473.9 cM with 1,917 markers on 14 linkage groups, which is a very critical tool for further genetic and genomics studies of napiergrass. GBS has been extensively used for genotyping many diploid organisms, however, SNP calling from the NGS data in allotetraploids like napiergrass is particularly challenging due to existence of highly similar homeologous copies, one corresponding to A genome and the other to B genome⁵⁸. Therefore, different strategies have been devised to construct linkage map in allopolyploids. For example only uniquely aligned reads (single copy) were considered for SNP calling and subsequent map construction^59,60 to avoid the collapsed alignment of homoeologous reads due to low divergence, recent polyploidization event, and severe domestic bottlenecks⁶¹. SNP calling in allotetraploid Brassica napus L. (rapeseed; 2n = 4x = 38; AACC) was done by utilizing only uniquely mapped reads (single copy) and a read depth minimum of three to four reads at each potential SNP⁵⁹. Linkage map construction in zoysiagrass (Zoysia matrella) was performed by utilizing single-dose markers after calling SNPs using the maximum likelihood method in Stacks⁶². Similarly, single dose markers from TASSEL de-novo UNEAK were used to construct linkage maps in prairie cordgrass (Spartina pectinate)⁶³.

In this study, we applied multiple SNP callers and strategies to maximize SNP calling for linkage map construction for napiergrass. In the final combined genetic map, the number of markers identified by different software varied dramatically. GATK called the highest number of SNPs followed by SAMtools and FreeBayes initially. Both GATK and SAMtools apply Bayesian method to compute the posterior probability for each possible genotype and then choose the genotype with the highest probability as the consensus genotype⁶⁴. GBS-SNP-CROP and TASSEL showed a low matching percentage, which is similar to results from previous research⁴⁴. The number of useful markers for linkage group construction was the highest in SAMtools (47.75%) followed by TASSEL de-novo UNEAK (35.68%). However, the TASSEL de-novo UNEAK pipeline had the highest number of markers mapped on the linkage groups (60.43%) followed by Stacks (13.43%). This indicated that the network-based SNP discovery in TASSEL de-novo UNEAK and UStacks pipeline⁶⁵ could be efficiently utilized for constructing linkage maps in non-model species. Even though TASSEL was primarily designed for diploids, it is powerful enough to give a large number of mapped markers compared to other programs that handle polyploidy like FreeBayes, GATK, or GBS-SNP-CROP.

The SNP markers were relatively evenly distributed among the linkage groups with more than 97.45% of marker interval being less than 5 cM. To our knowledge, this linkage map with an average inter-marker distance of 0.7 cM is the first genetic linkage map of napiergrass to date. A study based on an interspecific population of a cross between pearl millet and napiergrass has been previously reported to link RAPD markers with biomass related traits in Pennisetum²³. The large number of markers and their even distribution in our study facilitate full-scale map coverage. Few regions where the interval space was >5 cM might be due to stretches of large repeats or due to low coverage sequencing of GBS^29,66. Segregation distortion is regarded as a potential evolutionary force and including these markers for linkage map construction could increase genome coverage as well as benefit QTL mapping^67,68. Including SDR markers and correcting for bias led to an increase in genetic distance between distorted markers⁶⁹. The deviation from expected Mendelian ratio shows disturbances in the transmission of genetic information from one generation to the next and can be caused by chromosome loss or rearrangements, genetic load, gametic selection, zygotic selection, or both^70,71,72. Napiergrass generally outcross through wind pollination that could result in high levels of gene flow leading to genetic load. The assignment of napiergrass linkage groups according to the pearl millet genome allows for future fine mapping and QTL analysis.

In summary, this study reports for the first time a high-density genetic linkage map using NGS-derived SNP markers, as well as the development of SSRs from napiergrass genomic sequences. The napiergrass genome showed considerable collinearity with the pearl millet genome and the genetic map contains 14 linkage groups with low inter-marker interval. The results will be useful for future molecular breeding programs such as identification of QTLs for important traits as well as MAS for the genetic improvement of napiergrass and comparative genomics. These resources will play a critical role in future whole genome sequencing projects and leveraging molecular breeding of napiergrass.

Methods

Napiergrass genome survey

The genomic DNA of napiergrass cultivars Merkeron and UF1 was sequenced using Illumina Genome Analyzer and 454 GS-FLX. For Illumina reads, reads that contained more than 50% low-quality bases (Q20) were removed and adapter sequences were trimmed. Quality and adapter trimming of 454 reads was done using default Newbler v2.8 (454 Life Sciences, Roche, Branford, CT) settings. Illumina reads were assembled using ABySS/1.3.4⁷³ with kmer size ranging from 25 to 60 at intervals of 5. The 454 reads were assembled using Newbler v2.8 (454 Life Sciences, Roche, Branford, CT) with default parameters. The assemblies were completed using CAP3⁷⁴. The largest 10 contigs of the assembly were selected for further analysis. Repeats on these contigs were masked using a comprehensive public repeat database compiled from TIGR plant repeats (http://plantrepeats.plantbiology.msu.edu/), Plant miniature inverted-repeat transposable elements (P-MITE) database (http://pmite.hzau.edu.cn/django/mite/), MIPS Repeat Element Database (http://mips.helmholtz-muenchen.de/plant/recat/), and Repbase from RepeatMasker software (http://www.repeatmasker.org/). Unique repeats were extracted from this database by removing redundant repeats with 98% identity using CD-HIT/4.6⁴⁶. SNPs and indels were called using FreeBayes/0.9.15⁴⁶ excluding alleles with depth less than 20. The annotation of SNPs was performed using SnpEff/4.0 (http://snpeff.sourceforge.net/)⁷⁵. In order to identify sequence similarity among the two genomes, clean reads from Illumina and 454 were aligned to the pearl millet genome v1⁴⁰ using bowtie2/2.2.5.

SSR identification and marker development

The napiergrass assembly was used to identify SSR markers that contained repeat motifs ranging in length from 1 to 6 bp. The minimum number of repeats was 10 for Mono-, 6 for Di-and 5 for Tri-, Tetra-, Penta- and Hexa-. SSRs were analyzed based on their types, number of repeats, and percentage frequency of occurrences of each SSR motif. SSRs in napiergrass were detected using MIcroSAtellite identification tool (MISA)⁷⁶ and primers were developed using primer3 software⁷⁷. SSR search results were input into scripts p3_in.pl and p3_out.pl in order to identify SSR primer pairs for napiergrass.

Plant materials and DNA extraction

A mapping population of 185 F₁ hybrid progenies were developed from a cross between two napiergrass accessions (N122 and N190) described previously^41,78. The 185 F₁ hybrids were planted in the field at the Plant Science Research and Education Unit (PSREU), Citra, Florida, along with the parental accessions.

Young and healthy leaf tissues were harvested from each individual of the mapping population. DNA extraction was done following the protocol described previously⁷⁹. The extracted DNA samples were run on a 2% agarose gel to check the quality and quantified with PicoGreen to meet the requirements of GBS. 185 F₁ plants that were confirmed to be true hybrids using SSR markers⁴¹ were selected for GBS.

Genotyping by sequencing

GBS data was generated at the Institute of Biotechnology, Cornell University as described previously³⁰. Briefly, DNA samples were digested with the restriction enzyme PstI followed by ligation of adapters, that consisted of Illumina sequencing primers and barcode adapters, to the DNA fragment ends. After ligation, 95 samples were combined into a pool and PCR amplification was performed to create a GBS library and sequenced on Illumina HiSeq 2000.

Comparative genomics

Unique tags of napiergrass from TASSEL de-novo UNEAK/3.0⁴² pipeline were used for comparative genomic analysis. CD-HIT/4.6.4⁸⁰ was used to cluster the tags. Genomes of rice (Osativa_323_v7.0), Brachypodium (Bdistachyon_314_v3.0), maize (Zmays_284_AGPv3), sorghum (Sbicolor_313_v3.0), foxtail millet (Sitalica_312_v2), switchgrass (Pvirgatum_273_v1.0), wheat (Taestivum_296_v2), Arabidopsis (Athaliana_167_TAIR9) were downloaded from Phytozome v11 (https://phytozome.jgi.doe.gov/pz/portal.html). The barley genome (ASM32608v1) was downloaded from Ensembl (http://www.ensembl.org). We used BLASTN (BLAST v2.5.0) with the default settings and an e-value cutoff of 1 × 10⁻⁸ to blast the uniquely clustered tags of napiergrass against different genomes in order to find the percentage similarity of napiergrass reads among the various grass species. Tags of 64 bp with >80% identity and alignment length >50 bp to the respective genomes were counted as a hit.

Sequence analysis and SNP calling

Raw data processing and SNP identification was performed using both de novo and reference-based approaches. Common software capable of calling SNPs de novo used in this research were TASSEL/3.0 UNEAK⁴², Stacks/1.24⁸¹, and GBS-SNP-CROP 1.1⁴⁴. For the reference based approach, pearl millet reference genome v1⁴⁰ was used. The reference genome consists of seven pseudomolecules. Six different reference based pipelines were evaluated to call SNPs viz., TASSEL 4.3⁴², Stacks 1.24⁴³, GBS-SNP-CROP 1.1⁴⁴, SAMtools 1.2 mpileup⁴⁵, FreeBayes 0.9.21⁴⁶, and GATK 3.3⁴⁷. Parameters used in each software were provided in Supplementary Table S6. Sequence variants called were filtered with a minimum depth of 48 per sample.

Linkage map construction

QC-filtered SNPs were further filtered by the following standards for map construction: (1) markers must be genotyped in at least 180 individuals; (2) Individuals with over 10% missing data were discarded; and (3) Redundant markers were removed by standard of similarity = 1. For each parental map construction, only single dose markers were used. Markers segregating at a distorted Mendelian ratio (expected ratio for ‘lmxll’ type and ‘nnxnp’ type is 1: 1, χ2 test, 0.001 < P < 0.05) were marked. The single dose markers from the maternal and paternal parent were analyzed separately using JoinMap 4.1⁸² and outcross pollinated family (CP) was selected as the population type. Markers that were heterozygous in N122 and homozygous in N190 (‘lmxll’ type) were selected to build N122 linkage groups. Markers that were heterozygous in N190 and homozygous in N122 (‘nnxnp’ type) were selected to build N190 linkage groups. The linkage groups were built using regression mapping algorithm, with a minimum logarithm of odds (LOD) value at 20, and a maximum recombination frequency at 0.40. Marker positioning calculation was performed with a goodness-of-fit jump at 5, followed by a “ripple” procedure (value = 1). Map distances were estimated using the Kosambi mapping function. Genetic distance between SDR markers were corrected using DistortedMap⁶⁹. Linkage maps were drawn with MapChart⁸³. For integrated map construction, markers that were heterozygous in both parents (‘hkxhk’ type) were selected to build combined linkage groups. Markers segregating at distorted Mendelian ratio (expected ratio for ‘hkxhk’ type is 1:2:1, χ2 test, 0.001 < P < 0.05) were marked. The retained markers were then added with the markers from male and female parents to construct a combined map. The linkage groups were built using regression mapping algorithm, with a minimum logarithm of odds (LOD) value at 20, and a maximum recombination frequency of 0.40. Other parameters were the same with linkage map construction above. Regions showing segregation distortion (0.001 < P < 0.05) with more than three adjacent loci were marked as SDR regions^84,85.

Comparison between napiergrass and pearl millet genome

Consensus sequence of mapped markers from TASSEL de-novo UNEAK were used to compare with the reference genome of pearl millet with same parameters (BLASTN defaults with an e-value cutoff of 1 × 10⁻⁸). Markers that showed significant hits to the genome sequence and/or gene models of pearl millet with >80% identity and alignment length >50 bp were extracted and used for comparative genomics study. A circos plot was drawn using circos/0.69-2⁸⁶.

Availability of Materials and Data

All raw data from the genotyping-by-sequencing runs is deposited at NCBI on PRJNA380523.

References

Bhandari, A. P., Sukanya, D. H. & Ramesh, C. R. Application of isozyme data in fingerprinting Napier grass (Pennisetum purpureum Schum.) for germplasm management. Genet. Resour. Crop Evol. 53, 253–264 (2006).
Article CAS Google Scholar
Farrell, G., Simons, S. A., Hillocks, R. J. & Farrell G., H. R. J. Pests, diseases and weeds of napier grass, Pennisetum purpureum: a review. Int. J. Pest Manag. 48, 39–48 (2002).
Article Google Scholar
Singh, B. P., Singh, H. P. & Obeng, E. In Biofuel Crops: Production, Physiology and Genetics (ed. Singh, B. P.) 271–291 (CAB International, 2013).
Chemisquy, M. A., Giussani, L. M., Scataglini, M. A., Kellogg, E. A. & Morrone, O. Phylogenetic studies favour the unification of Pennisetum, Cenchrus and Odontelytrum (Poaceae): A combined nuclear, plastid and morphological analysis, and nomenclatural combinations in. Cenchrus. Ann. Bot. 106, 107–130 (2010).
Article Google Scholar
Ra, K., Shiotsu, F., Abe, J. & Morita, S. Biomass yield and nitrogen use efficiency of cellulosic energy crops for ethanol production. Biomass and Bioenergy 37, 330–334 (2012).
Article CAS Google Scholar
Anderson, W. F., Casler, M. & Baldwin, B. In Genetic Improvement of Bioenergy Crops (ed. Vermerris, W.) (Springer Science + Business Media, LLC, 2008).
Somerville, C., Youngs, H., Taylor, C., Davis, S. C. & Long, S. P. Feedstocks for lignocellulosic biofuels. Science 329, 790–793 (2010).
Article ADS CAS Google Scholar
Augustin, E. & Tcacenco, F. A. Isoenzymatic characterization of elephantgrass (Pennisetum purpureum Schum.) germplasm. Rev. Bras. Genet. 16, 685–696 (1993).
CAS Google Scholar
Jauhar, P. P. Cytogenetics and breeding of pearl millet and related species. (Alan R. Liss, Inc. Progress and Topics in Cytogenetics, Volume 1, 1981).
Taylor, M. G. & Vasil, I. K. Analysis of DNA size, content and cell cycle in leaves of napier grass (Pennisetum purpureum Schum.). Theor. Appl. Genet. 74, 681–686 (1987).
Article CAS Google Scholar
Martel, E., Nay, D. De., Siljak-Yakovlev, S., Brown, S. & Sarr, A. Genome size variation and basic chromosome number in pearl millet and fourteen related Pennisetum species. J. Hered. 88, 139–143 (1997).
Article Google Scholar
Harlan, J. R. & Wet, J. M. J. de. Toward a rational classification of cultivated plants. Taxon 20, 509–517 (1971).
Article Google Scholar
Dowling, C. D., Burson, B. L. & Jessup, R. W. Marker-assisted verification of Kinggrass (Pennisetum purpureum Schumach. x Pennisetum glaucum [L.] R. Br.). Plant Omi. J. 7, 72–79 (2014).
Google Scholar
Hanna, W. & Monson, W. G. Yield, quality, and breeding of pearl millet × napiergrass interspecific hybrids. Agron. J. 72, 358–360 (1980).
Article Google Scholar
Burton, G. W. Hybrids between napier grass and cattail millet. J. Hered. 227–232 (1944).
Article Google Scholar
Muldoon, D. K. & Pearson, C. J. The hybrid between Pennisetum americanum and Pennisetum purpureum. Herb. Abstr. 49, 189–199 (1979).
Google Scholar
Gupta, S. C. & Mhere, O. Identification of superior pearl millet by napier hybrids and napiers in Zimbabwe. African Crop Sci. J. 5, 229–237 (1997).
Article Google Scholar
Lowe, A. J., Thorpe, W., Teale, A. & Hanson, J. Characterisation of germplasm accessions of napier grass (Pennisetum purpureum and P. purpureum × P. glaucum Hybrids) and comparison with farm clones using RAPD. Genet. Resour. Crop Evol. 50, 121–132 (2003).
Article CAS Google Scholar
Harris, K., Anderson, W. F. & Malik, R. Genetic relationships among napiergrass (Pennisetum purpureum Schum.) nursery accessions using AFLP markers. Plant Genet. Resour. 8, 63–70 (2010).
Article CAS Google Scholar
Kandel, R., Singh, H. P., Singh, B. P., Harris-Shultz, K. R. & Anderson, W. F. Assessment of genetic diversity in napier grass (Pennisetum purpureum Schum.) using microsatellite, single-nucleotide polymorphism and insertion-deletion markers from pearl millet (Pennisetum glaucum [L.] R. Br.). Plant Mol. Biol. Report. 34, 265–272 (2016).
Article CAS Google Scholar
Dowling, C. D., Burson, B. L., Foster, J. L., Tarpley, L. & Jessup, R. W. Confirmation of pearl millet-napiergrass hybrids using EST-derived simple sequence repeat (SSR)markers. Am. J. Plant Sci. 4, 1004–1012 (2013).
Article Google Scholar
López, Y. et al. Genetic diversity of biofuel and naturalized napiergrass (Pennisetum purpureum). Invasive Plant Sci. Manag. 7, 229–236 (2014).
Article Google Scholar
Smith, R. L., Schweder, M. E., Chowdhury, M. K. U., Seib, J. C. & Schank, S. C. Development and application of RFLP and RAPD DNA markers in genetic improvement of Pennisetum for biomass and forage production. Biomass and Bioenergy 5, 51–62 (1993).
Article Google Scholar
Negawo, A. T., Teshome, A., Kumar, A., Hanson, J. & Jones, C. S. Opportunities for napier grass (Pennisetum purpureum) improvement using molecular genetics. Agronomy 7, 1–21 (2017).
Article Google Scholar
Song, X., Ge, T., Li, Y. & Hou, X. Genome-wide identification of SSR and SNP markers from the non-heading Chinese cabbage for comparative genomic analyses. BMC Genomics 16, 328 (2015).
Article Google Scholar
Powell, W., Machray, G. C. & Provan, J. Polymorphism revealed by simple sequence repeats. Trends Plant Sci. 1, 215–222 (1996).
Article Google Scholar
Kannan, B. et al. Association analysis of SSR markers with phenology, grain, and stover-yield related traits in pearl millet (Pennisetum glaucum (L.) R. Br.). Sci. J. 2014, 14 (2014).
Google Scholar
Ganal, M. W., Altmann, T. & Röder, M. S. SNP identification in crop plants. Curr. Opin. Plant Biol. 12, 211–7 (2009).
Article CAS Google Scholar
Poland, J. A. & Rife, T. W. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome 5, 92–102 (2012).
Article CAS Google Scholar
Elshire, R. J. et al. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One 6 (2011).
Article ADS CAS Google Scholar
Deschamps, S., Llaca, V. & May, G. D. Genotyping-by-sequencing in plants. Biology (Basel). 1, 460–483 (2012).
PubMed PubMed Central Google Scholar
Poland, J. et al. Genomic selection in wheat breeding using genotyping-by-sequencing. Plant Genome 5, 103–113 (2012).
Article CAS Google Scholar
Lu, F. et al. Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol. PLoS Genet. 9, e1003215 (2013).
Article CAS Google Scholar
Uitdewilligen, J. G. A. M. L. et al. A next-generation sequencing method for genotyping-by-sequencing of highly heterozygous autotetraploid potato. PLoS One 8, e62355 (2013).
Article ADS CAS Google Scholar
Yang, X. et al. Constructing high-density genetic maps for polyploid sugarcane (Saccharum spp.) and identifying quantitative trait loci controlling brown rust resistance. Mol. Breed. 37, 116 (2017).
Article Google Scholar
Clevenger, J., Chavarro, C., Pearl, S. A., Ozias-Akins, P. & Jackson, S. A. Single nucleotide polymorphism identification in polyploids: A review, example, and recommendations. Mol. Plant 8, 831–846 (2015).
Article CAS Google Scholar
Stone, E. A. & Ayroles, J. F. Modulated modularity clustering as an exploratory tool for functional genomic inference. PLoS Genet. 5 (2009).
Article Google Scholar
Poland, J. A., Brown, P. J., Sorrells, M. E. & Jannink, J.-L. Development of high-density genetic maps for barley and wheat using a novel two-enzyme genotyping-by-sequencing approach. PLoS One 7, e32253 (2012).
Article ADS CAS Google Scholar
Punnuri, S. M. et al. Development of a high-density linkage map and tagging leaf spot resistance in pearl millet using GBS markers. Plant Genome 9, 0 (2016).
Article Google Scholar
Varshney, R. K. et al. Pearl millet genome sequence provides a resource to improve agronomic traits in arid environments. Nat. Biotechnol. 35, 969–976 (2017).
Article CAS Google Scholar
Sinche, M. et al. Development and characterization of napiergrass (Pennisetum purpureum) mapping population for flowering time and biomass related traits. Submitt. Publ. (2018).
Glaubitz, J. C. et al. TASSEL-GBS: A high capacity genotyping by sequencing analysis pipeline. PLoS One 9 (2014).
Article ADS Google Scholar
Catchen, J. M. et al. Stacks: Building and genotyping loci de novo from short-read sequences. G3.Genes|Genomes|Genetics 1, 171–182 (2011).
Article CAS Google Scholar
Melo, A. T. O., Bartaula, R. & Hale, I. GBS-SNP-CROP: a reference-optional pipeline for SNP discovery and plant germplasm characterization using variable length, paired-end genotyping-by-sequencing data. BMC Bioinformatics 17, 29 (2016).
Article Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article Google Scholar
Garrison, E. & Marth, G. Haplotype-based variant detection from short-read sequencing. arXiv:1207.3907 9 (2012).
McKenna, A. et al. The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
Article CAS Google Scholar
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat Methods 9, 357–359 (2012).
Article CAS Google Scholar
Yang, X. et al. Mining sequence variations in representative polyploid sugarcane germplasm accessions. BMC Genomics 18 (2017).
Reis, G. Bdos et al. Genomic homeology between Pennisetum purpureum and Pennisetum glaucum (Poaceae). Comp. Cytogenet. 8, 199–209 (2014).
Article Google Scholar
Paterson, A. H. et al. The Sorghum bicolor genome and the diversification of grasses. Nature 457, 551–556 (2009).
Article ADS CAS Google Scholar
Schnable, P., Ware, D., Fulton, R. & Stein, J. The B73 maize genome: complexity, diversity, and dynamics. Science 326, 1112–1115 (2009).
Article ADS CAS Google Scholar
Zhang, G. et al. Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential. Nat. Biotechnol. 30, 549–54 (2012).
Article CAS Google Scholar
Yu, J. et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science 296, 79–92 (2002).
Article ADS CAS Google Scholar
Kellis, M., Birren, B. W. & Lander, E. S. Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae. Nature 428, 617–624 (2004).
Article ADS CAS Google Scholar
Martel, E. et al. Chromosome evolution of Pennisetum species (Poaceae): Implications of ITS phylogeny. Plant Syst. Evol. 249, 139–149 (2004).
Article Google Scholar
Cai, C., Cheng, F. Y., Wu, J., Zhong, Y. & Liu, G. The first high-density genetic map construction in tree peony (Paeonia Sect. Moutan) using genotyping by specific-locus amplified fragment sequencing. PLoS One 10, e0128584 (2015).
Article Google Scholar
Nagy, I., Barth, S., Mehenni-Ciz, J., Abberton, M. T. & Milbourne, D. A hybrid next generation transcript sequencing-based approach to identify allelic and homeolog-specific single nucleotide polymorphisms in allotetraploid white clover. BMC Genomics 14, 100 (2013).
Article CAS Google Scholar
Trick, M., Long, Y., Meng, J. & Bancroft, I. Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. Plant Biotechnol. J. 7, 334–346 (2009).
Article CAS Google Scholar
Zhou, X. et al. Construction of a SNP-based genetic linkage map in cultivated peanut based on large scale marker development using next-generation double-digest restriction-site-associated DNA sequencing (ddRADseq). BMC Genomics 15, 351 (2014).
Article Google Scholar
Pandey, M. K. et al. Advances in Arachis genomics for peanut improvement. Biotechnol. Adv. 30, 639–651 (2012).
Article CAS Google Scholar
Huang, X. et al. Construction of high-resolution genetic maps of Zoysia matrella (L.) Merrill and applications to comparative genomic analysis and QTL mapping of resistance to fall armyworm. BMC Genomics 17, 1–16 (2016).
Article Google Scholar
Crawford, J., Brown, P. J., Voigt, T. & Lee, D. K. Linkage mapping in prairie cordgrass (Spartina pectinata Link) using genotyping-by-sequencing. Mol. Breed. 36, 1–12 (2016).
Article CAS Google Scholar
Yu, X. et al. Comparing a few SNP calling algorithms using low-coverage sequencing data. BMC Bioinformatics 14, 274 (2013).
Article Google Scholar
Kim, C. et al. Application of genotyping by sequencing technology to a variety of crop breeding programs. Plant Sci. 242, 14–22 (2015).
Article Google Scholar
Mathew, L. S. et al. A first genetic map of date palm (Phoenix dactylifera) reveals long-range genome structure conservation in the palms. BMC Genomics 15, 1–10 (2014).
Article MathSciNet Google Scholar
Xu, S. Quantitative trait locus mapping can benefit from segregation distortion. Genetics 180, 2201–2208 (2008).
Article Google Scholar
Taylor, D. R. & Ingvarsson, P. K. Common features of segregation distortion in plants and animals. Genetica 117, 27–35 (2003).
Article CAS Google Scholar
Xie, S. Q., Feng, J. Y. & Zhang, Y. M. Linkage group correction using epistatic distorted markers in F2 and backcross populations. Heredity. 112, 479–488 (2014).
Article Google Scholar
Faris, J. D., Laddomada, B. & Gill, B. S. Molecular mapping of segregation distortion loci in Aegilops tauschii. Genetics 149, 319–327 (1998).
CAS PubMed PubMed Central Google Scholar
Karkkainen, K., Koski, V. & Savolainen, O. Geographical variation in the inbreeding depression of Scots pine. Evolution (N. Y). 50, 111 (1996).
Google Scholar
Bodénès, C., Chancerel, E., Ehrenmann, F., Kremer, A. & Plomion, C. High-density linkage mapping and distribution of segregation distortion regions in the oak genome. DNA Res. 23, 115–124 (2016).
Article Google Scholar
Simpson, J. T. et al. ABySS: a parallel assembler for short read sequence data. Genome Res. 19, 1117–23 (2009).
Article CAS Google Scholar
Huang, X. CAP3: A DNA sequence assembly program. Genome Res. 9, 868–877 (1999).
Article CAS Google Scholar
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff. Fly (Austin). 6, 80–92 (2012).
Article CAS Google Scholar
Thiel, T., Michalek, W., Varshney, R. K. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). TAG. Theor. Appl. Genet. 106, 411–422 (2003).
Article CAS Google Scholar
Rozen, S. & Skaletsky, H. Primer3 on the WWW for general users and for biologist programmers. Methods Mol. Biol. 132, 365–386 (2000).
CAS PubMed Google Scholar
Sinche, M. V. S. Genetic improvement of elephantgrass (Pennisetum purpureum schum.) as a bioenergy feedstock. (University of Florida, 2013).
Dellaporta, S. L., Wood, J. & Hicks, J. B. A plant DNA minipreparation: version II. Plant Mol. Biol. Report. 1, 19–21 (1983).
Article CAS Google Scholar
Li, W. & Godzik, A. Cd-hit: A fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22, 1658–1659 (2006).
Article CAS Google Scholar
Catchen, J., Hohenlohe, P. A., Bassham, S., Amores, A. & Cresko, W. A. Stacks: An analysis tool set for population genomics. Mol. Ecol. 22, 3124–3140 (2013).
Article Google Scholar
Van Ooijen, J. W. JoinMap 4.0: Software for the calculation of genetic linkage maps in experimental populations. Wageningen, Netherlands Kyazma BV (2006).
Voorrips, R. E. MapChart: software for the graphical presentation of linkage maps and QTLs. J. Hered. 93, 77–78 (2002).
Article CAS Google Scholar
Paillard, S. et al. An integrative genetic linkage map of winter wheat (Triticum aestivum L.). Theor. Appl. Genet. 107, 1235–1242 (2003).
Article CAS Google Scholar
Zhang, Z. et al. Construction of a high-density genetic map by specific locus amplified fragment sequencing (SLAF-seq) and its application to Quantitative Trait Loci (QTL) analysis for boll weight in upland cotton (Gossypium hirsutum). BMC Plant Biol. 16 (2016).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639–1645 (2009).
Article CAS Google Scholar

Download references

Acknowledgements

This work was funded in part by the Plant Breeders Workgroup, University of Florida and by the USDA National Institute of Food and Agriculture, Hatch project 1003932. We like to thank William Anderson USDA-ARS, Tifton, GA for providing napiergrass accession N122 and Sun Gro Horticulture, Apopka, FL, for donation of the Fafard #2 potting mix.

Author information

Authors and Affiliations

Agronomy Department, IFAS, University of Florida, Gainesville, FL, 32611, USA
Dev Paudel, Baskaran Kannan, Xiping Yang, Fredy Altpeter & Jianping Wang
Crop Genetics and Breeding Research Unit, USDA-Agricultural Research Service, 115 Coastal Way, Tifton, GA, 31793, USA
Karen Harris-Shultz
Center of Excellence in Genomics & Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, Telangana State, India
Mahendar Thudi & Rajeev K. Varshney
Plant Molecular and Cellular Biology Program, Genetic Institute, University of Florida, Gainesville, FL, 32611, USA
Fredy Altpeter & Jianping Wang
Center for Genomics and Biotechnology, Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Ministry of Education, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, Fujian, 350002, China
Jianping Wang

Authors

Dev Paudel
View author publications
You can also search for this author in PubMed Google Scholar
Baskaran Kannan
View author publications
You can also search for this author in PubMed Google Scholar
Xiping Yang
View author publications
You can also search for this author in PubMed Google Scholar
Karen Harris-Shultz
View author publications
You can also search for this author in PubMed Google Scholar
Mahendar Thudi
View author publications
You can also search for this author in PubMed Google Scholar
Rajeev K. Varshney
View author publications
You can also search for this author in PubMed Google Scholar
Fredy Altpeter
View author publications
You can also search for this author in PubMed Google Scholar
Jianping Wang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.W. and F.A. conceived the study. J.W., F.A. and D.P. designed and coordinated the experiments. D.P., B.K., and X.Y. conducted the experiment. R.K.V. and M.T. provided the reference genome. K.H. provided napiergrass sequences. F.A. provided the napiergrass mapping population. D.P. analyzed the data and drafted the manuscript. All authors reviewed the manuscript.

Corresponding author

Correspondence to Jianping Wang.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Tables and Figures

Supplementary Tables S3 and S4

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Paudel, D., Kannan, B., Yang, X. et al. Surveying the genome and constructing a high-density genetic map of napiergrass (Cenchrus purpureus Schumach). Sci Rep 8, 14419 (2018). https://doi.org/10.1038/s41598-018-32674-x

Download citation

Received: 28 March 2018
Accepted: 13 September 2018
Published: 26 September 2018
DOI: https://doi.org/10.1038/s41598-018-32674-x

Keywords

This article is cited by

Generation of Transgenic Napier Grass (Cenchrus purpureus Schum.) Plants by Biolistic Gene Transfer of a Minimal Expression Cassette
- F. G. Faleiro
- J. Y. Kim
- F. Altpeter
Tropical Plant Biology (2021)
QTL mapping of flowering time and biomass yield in tetraploid alfalfa (Medicago sativa L.)
- Laxman Adhikari
- Shiva Om Makaju
- Ali M. Missaoui
BMC Plant Biology (2019)
Unraveling candidate genes underlying biomass digestibility in elephant grass (Cenchrus purpureus)
- João Romero do Amaral Santos de Car Rocha
- Tiago de Souza Marçal
- Juarez Campolina Machado
BMC Plant Biology (2019)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Napiergrass genome survey

Genotyping-by-sequencing

SNP calling by various SNP callers

Genetic linkage map construction

Comparison between genomes of napiergrass and pearl millet

Discussion

Methods

Napiergrass genome survey

SSR identification and marker development

Plant materials and DNA extraction

Genotyping by sequencing

Comparative genomics

Sequence analysis and SNP calling

Linkage map construction

Comparison between napiergrass and pearl millet genome

Availability of Materials and Data

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Comments

Search

Quick links