The first draft reference genome of the American mink (Neovison vison)

Cai, Zexi; Petersen, Bent; Sahana, Goutam; Madsen, Lone B.; Larsen, Knud; Thomsen, Bo; Bendixen, Christian; Lund, Mogens Sandø; Guldbrandtsen, Bernt; Panitz, Frank

doi:10.1038/s41598-017-15169-z

Download PDF

Article
Open access
Published: 06 November 2017

The first draft reference genome of the American mink (Neovison vison)

Scientific Reports volume 7, Article number: 14564 (2017) Cite this article

3968 Accesses
17 Citations
9 Altmetric
Metrics details

Subjects

Abstract

The American mink (Neovison vison) is a semiaquatic species of mustelid native to North America. It’s an important animal for the fur industry. Many efforts have been made to locate genes influencing fur quality and color, but this search has been impeded by the lack of a reference genome. Here we present the first draft genome of mink. In our study, two mink individuals were sequenced by Illumina sequencing with 797 Gb sequence generated. Assembly yielded 7,175 scaffolds with an N50 of 6.3 Mb and length of 2.4 Gb including gaps. Repeat sequences constitute around 31% of the genome, which is lower than for dog and cat genomes. The alignments of mink, ferret and dog genomes help to illustrate the chromosomes rearrangement. Gene annotation identified 21,053 protein-coding sequences present in mink genome. The reference genome’s structure is consistent with the microsatellite-based genetic map. Mapping of well-studied genes known to be involved in coat quality and coat color, and previously located fur quality QTL provide new knowledge about putative candidate genes for fur traits. The draft genome shows great potential to facilitate genomic research towards improved breeding for high fur quality animals and strengthen our understanding on evolution of Carnivora.

Genome analysis of American minks reveals link of mutations in Ras-related protein-38 gene to Moyle brown coat phenotype

Article Open access 28 September 2020

A chromosome-level genome assembly reveals genomic characteristics of the American mink (Neogale vison)

Article Open access 16 December 2022

Comprehensive genome assembly reveals genetic diversity and carcass consumption insights in critically endangered Asian king vultures

Article Open access 24 April 2024

Introduction

American mink (Neovison vison, 2n = 30) is a semiaquatic species of mustelids native to North America. The exact taxonomic position of the mink remains to be resolved¹. So far data from cytogenetics² and molecular phylogeny³ placed American mink and sea mink (Neovison macrodon) to a separate genus Neovison. Since the recent extinction of the sea mink⁴, the American mink is the only extant member of the genus Neovison in the order Carnivora. The estimated genome size of mink is 2.7 Gb similar to the ferret genome⁵. Besides, American mink has the smallest number of chromosomes among the Carnivora⁶. Because of this, mink was included in many fluorescent in situ hybridization of a chromosome-specific probe from one species to chromosomes from other species (ZOO-FISH) analyses^7,8,9.

American mink is the most common farmed animal for fur, exceeding the silver fox, sable, marten, and skunk in economic importance. Fur quality, fur color and fertility are the most important traits for the fur industry. Considerable genetic research has been done in order to uncover genes causing variation in these traits. The first linkage map based on microsatellite markers was published in 2007¹⁰. This map was updated twice by incorporating more markers and re-assigning markers order^11,12, yielding a total of 104 markers. Quantitative Trait Locus (QTL) mapping based on linkage analyses mapped some genes related to fur quality and fur color to chromosomes¹³. Comparative mapping identified 16 candidate genes for fur quality or color and were mapped to chromosome arms¹². Sequencing of bacterial artificial chromosome (BAC) clones based on hybridization was used to sequence these genes⁵. Also, a transcriptome profile of mink is available¹⁴. However, without a reference genome, prospects to find genes for economical important traits of interest to selective breeding remain limited.

With the first release of the human genome¹⁵ and the emergence of the next generation sequencing technology¹⁶, more and more genome sequences have become available. Even though we already have dog¹⁷, cat¹⁸ and ferret¹⁹ genomes, the divergent²⁰ and chromosome rearrangement²¹ of these species, it will be valuable to obtain the mink genome sequence. With a reference genome assembly for mink available, research and breeding based on genomic approaches will become possible. For example, gene function and expression analysis with RNA-seq (RNA sequencing)²², epigenetics and transcription factor analysis with ChIP-seq (chromatin immunoprecipitation sequencing)²³ and whole genome genotyping all depend on a reference genome. For genotyping, availability of a high quality reference greatly eases discovery and evaluation of single nucleotide polymorphisms (SNP), which will make genome-wide association studies (GWAS) possible²⁴. Also, genomic selection²⁵ which has been extremely successful in several farm animal species is greatly facilitated by easy and reliable marker discovery. In order to provide the fundamental information for genomic research for fur related traits, we present the first draft genome sequence of American mink, and show the application potential for fur industry.

Results

Sequencing and assembly

Whole genome shotgun sequencing (WGS) strategy was used to generate data from two individuals of American minks, one pearl male and one brown female. 383 Gb next-generation Illumina paired-end (PE) reads was generated by sequencing genome shotgun libraries with insert sizes of 150, 165 and 600 bp. In addition, 414 Gb of mate-paired (MP) reads were generated with insert sizes of 3, 5, 6, 8, 10, 14 and 32 kb. The estimated size of the mink genome of 2.7 Gb was covered 295 fold. Sequencing reads were assembled by ALLPATHS-LG²⁶ with three different combinations of data (Supplementary Table S2) due to server memory limitations (1TB memory available). We constructed four assemblies. The first one was the Pearl-mink-assembly (PMA) constructed using reads from the pearl mink. The second one was Brown-mink-assembly (BMA) using part of brown mink data (for details see method section). The third one was a hybrid-assembly (HA) of all pearl mink data and part of brown mink data (part of 150 bp, part of 600 bp, 8 kb, part of 10 kb, 14 kb and 32 kb libraries were used). The fourth assembly, draft-assembly, was constructed by an additional round of scaffolding with SSPACE²⁷ using the HA (lowest number of scaffolds) as the backbone and simulated long insert MP reads from BMA (longer scaffold N50) using ART²⁸. Finally this resulted in a 2.45 Gb assembly including gaps (Table 1), which corresponds to 90% of the estimated 2.7 Gb mink genome size²⁹. The draft genome consisted of 7,175 scaffolds with an N50 of 6.3 Mb, where the largest scaffold was 40.3 Mb. The detailed information on all the four assemblies was presented in Supplementary Table S2. We experienced that the draft assembly, which combined HA with additional scaffolding procedure largely improved the scaffold N50 of assembly and reduced the number of scaffolds.

Table 1 Summary details of the mink genome assembly.

Full size table

Assembly assessment

To assess the quality of the draft genome and the validity of the additional scaffolding procedure, we first mapped a part of the PE and MP sequence data back to the assembly. Insert size libraries of 150 bp, 600 bp and 3 kb were chosen to assess the quality of the assembly by aligning them to the four assemblies (Supplementary Table S3). The result showed that 98%, 98% and 91% of these libraries could be mapped to the draft assembly. In addition, 95.72%, 89.55% and 69.91% of pairs were properly paired (Supplementary Table S3). Compared draft assembly with the PMA, the percentage of alignment increased 0.04%, 0.04% and 0.11%, but properly pair decreased 0.23%, 0.11% and 0.47%. The decreased of properly pair mostly came from the HA, since the HA was built from reads from both individuals. After additional scaffolding, the properly pair for 150 bp and 600 bp libraries were the same in the draft assembly and the HA, and the properly pair for 3 kb library increased 0.42% from the HA. In all, the sequence alignment of these three libraries showed similar results among different assemblies and improvement of the additional scaffolding. However, because the tolerance of heterozygosity of two individuals, some alignments decrease by a small percentage from PMA to HA.

For the two BAC-end sequence libraries, after filtering out clones where it was not possible to find both ends, we had 1,795 pairs of BES (BAC-end sequence) from black mink³⁰ and 833 pairs BES from the CHORI-231 BAC library⁵. The BESs were mapped to the genome assembly. To estimate the properly paired reads for BES libraries, both ends should be uniquely aligned to the same scaffolds. Out of these, 99.32% of the reads from the first library and 89.7% of the reads from the second library could be mapped to the draft assembly. From the first library, 79.55% (1,428 out of 1,795) pairs and 55.58% (463 out of 833) of pairs from the second library were properly paired. The estimated insert sizes were close to the sizes targeted during the construction of the BAC libraries, which were 25 kb (20–50 kb BAC library) and 164 kb (170 kb BAC library) (Supplementary Table S3). Comparing the BES alignment of draft assembly with PMA and HA, we found the properly paired increased from 76.27% to 78.27% and at last to 79.55% for the first BES library, and increased from 40.58% to 50.54% and at last to 55.58% for the second BES library. This showed our strategy successfully solved some long distance linking.

The similar short reads mapping pattern was also presented in ferret assembly. We downloaded and aligned selected libraries of the ferret assembly (Supplementary Table S4). The 180 bp library had 88.93% of reads that could be aligned, and 79.20% of them were properly paired. The 3 kb library had 93.15% reads aligned and 80.64% paired properly. The 6 kb-10 kb and 40 kb libraries had 95.90% and 86.42% reads aligned and 82.06% and 63.77% properly paired, respectively. The poor properly paired reads of the BES libraries of mink and 40 kb library of the ferret assembly came mostly from two ends aligned to different scaffolds (Supplementary Table S3 and Supplementary Table S4). These comparisons showed that we had a comparable genome assembly for mink similar to ferret.

The completeness of the genes represented in the assembly was assessed by BUSCO version 2³¹. The mammalian orthologous gene set (4,104 genes) was used to assess the presence of standard genes in the genome assembly. The same analysis was also conducted with single-individual (Pearl mink and Brown mink) assembly and ferret genome. Results showed that 98% (95.8% complete and 2.2% fragmentary) of these 4,104 genes could be detected in the draft mink assembly (Supplementary Table S5). We could see the final scaffolding using the simulated mate-pair data improved the complete from 93.80% (PMA) and 94.9% (BMA) to 95.80%, fragment genes decreased from 3.8% and 2.7% to 2.2% and missing genes decreased from 2.4% (PMA and BMA) to 2.0% (Supplementary Table S5). The results from the draft assembly were comparable with that of ferret and showed that our strategy to build the draft assembly improved the assembly of coding sequence regions. Moreover, we also checked the 45S rDNA (ribosomal DNA) completeness in the assembly, as other species whose genome were built from second-generation sequencing; we also could not detect the complete 45S rDNA in the assembly³². This mostly came from the difficulty to assemble tandem repeat cluster from short reads³³.

Repeat annotation in the mink genome and comparison to related species

RepeatModeler³⁴ was used to construct the species-specific repeat library for mink and RepeatExplorer³⁵ was used to analyze the identified repeat families. RepeatExplorer identified three novel satellite repeats in mink. One of these satellites was similar to the Mustela putorius 1080 bp Bam HI repeat DNA (GenBank: x59440.1). Satellite repeats constituted around 2.26% of the total mink genome based on the result of RepeatExplorer (Supplementary Figure S1). RepeatModeler identified 285 repeat consensuses in mink. Among these, LINE (long interspersed nuclear elements) and LTR (long terminal repeat) had the highest number of family members (Supplementary Table S6). Combining the output of these two analyses and a repeat database of the dog genome³⁶, we built a comprehensive mink repeat database for RepeatMasker. At the end, 29% of the mink genome was found to be composed of repeat sequences (Table 2). Adding the low represented satellite repeats sequence from the RepeatMasker result, we had around 31% of mink genome sequence belonging to repeat sequence. Our assembly have 10% less sequence than estimated genome size and part of these sequences probably are repeat sequence, so the real repeat content of mink might be larger than we observed now. The dog genome and cat genome have 43% and 44% of repeat sequence (http://www.repeatmasker.org) according to the pre-analysis genome from RepeatMasker website. Since there are still some part of repeat elements that are missing in dog and cat assembly^37,38, the repeat content will be slightly higher than 43% for dog and 44% for cat. Therefore, we believe mink have less repeat content than dog and cat. The most abundant repeat family LINEs is shared by these three genomes: around 20% of the dog genome, 21% of the cat genome and 15% of the mink genome consist of LINEs. In all three species, the second most abundant repeat family is short interspersed nuclear elements (SINE). These constitute 11%, 11% and 7% of the genomes respectively. The dynamics of the repeat sequence is largely shaping the genome³⁹. Therefore, we computed the substitution level of repeat sequences in these three species by aligning all elements belonging to the same family to consensus sequences in each species. As shown in Fig. 1, the dog and cat genomes share one major peak of substitution level at 30–35. Dog has another small substitution level peak around 0–10, while cat has a small substitution level peak around 5–10. In contrast, the mink assembly has only one substitution level peak around 10–25. Moreover, the peaks for LINEs and SINEs are similar in dog and cat. In dog and cat, the peak for LINEs are at 5, 20 and 30–35. For SINEs, there is a consistence pattern with LINEs in these two species. However, the peak for LINEs and SINEs are different in mink genome. The LINEs in mink has a peak at around 20–25. The SINEs in mink has a peak at around 10–20. These results showed that more repeat sequences in dog and cat have high identity compared to repeat families’ consensus sequences. This implied that dog and cat have more recently active repeat sequences, especially LINEs and SINEs. This could partially explains why dog and cat genomes have higher amounts of repeat sequence than mink genome, because dog and cat genome have more recently inserted repeat sequences compared to the mink genome.

Table 2 The repeat sequence composition of mink genome.

Full size table

Alignment with the dog genome assembly

The mink and the ferret genome assemblies were aligned to the dog genome assembly and similar alignment patterns were observed (Fig. 2A for mink, Fig. 2B for ferret and Supplementary Figures S2–S38). A total of 1,525,079,306 bp of the mink genome assembly and 1,516,552,860 bp of the ferret genome assembly could be aligned to the dog genome. The distributions of synteny block sizes were similar for the two species (Table 3). Both the mink and the ferret assemblies shared a number of chromosomal rearrangements when compared to the dog genome (Table 4, Fig. 2 and Supplementary Figures S2–S38). For example, scaffold 66 (nn66) of mink and scaffold 37 (mf37) of ferret could be aligned to dog chromosome 1 (Fig. 2) and dog chromosome 12 (Supplementary Figure S12). Likewise part of nn4 and mf7 could be aligned to part of dog chromosome 14, whereas other part of nn4 and mf7 could be aligned to the reversed sequence of other part of dog chromosome 14 (Supplementary Figure S14). First, these findings demonstrated that the scaffold structure is consistent between the mink and the ferret, some of the same rearrangements were detected when comparing to the dog assembly. Second, they identified specific rearrangements that must have happened since the last common ancestor of the three species and either the current dog population or the last common ancestor of mink and ferret.

Table 3 The size distribution of synteny block.

Full size table

Table 4 Chromosome rearrangement of dog compared to mink and ferret.

Full size table

Gene annotation and orthologous analysis

Combining ab inito gene prediction, protein alignment and transcriptome assembly⁴⁰, we identified 21,053 protein-coding genes in the mink genome assembly. To create a detailed annotation of these genes, mink orthologous gene families were identified by scanning EggNOG⁴¹ mammalian database with HMMER3⁴². Moreover, two comparison analyses were performed by comparing mink orthologous gene families with three mammalian genomes namely human, mouse, dog, and with four Carnivora species namely dog, cat, panda and ferret. The mink proteome contained 14,066 orthologous gene families containing 17,052 genes. Among these, 11,477 gene families in mink were shared by all other three mammalian genomes compared here (Fig. 3A). Subsequently we checked the sequence similarity of mink genes identified here against genes from all species deposited in the OrthoMCL database⁴³. Out of 15,608 genes which could be identified in the OrthoMCL data, 7,645 genes were most similar to dog genes compared with other species (Supplementary File 2). Gene family comparison among five Carnivora species (Fig. 3B) showed that our mink genome assembly has 841 orthologous gene families (990 genes) not shared with these four Carnivora genomes. The overlap of mink unique orthologous groups in Fig. 3A and Fig. 3B was 355. We further checked the 4,001 genes that were not mapped to any mammalian gene families. 1,119 of them matched other animals’ genes (EggNOG animal collection). These data suggested either mink retains a large amount of genes that may have been lost in other Carnivora genomes and mammalian genomes or we have some putative pseudogenes that show similarity to proteins from other kingdoms.

Integrating the linkage map with the reference assembly and identification of fur quality genes

The linkage maps are useful tool for genetic research. We downloaded 103 microsatellite (SSR) marker primers or whole clone sequences¹². These SSR markers were mapped to the mink genome assembly and 71 of them were able to anchor in the assembly. A total of 51 scaffolds harbored a single marker. Seven scaffolds had two markers. Among these, twelve markers were mapped relatively close to each other validating both the genetic and the physical distances. Two scaffolds had three markers, and all of them were located close to each other in same order as in the linkage map (Supplementary Table S7). These results demonstrated the agreement of the draft genome assembly with the linkage map. We mapped 21 fur color and fur quality genes (Supplementary Table S8) to the mink genome assembly, and compared their physical locations on the genome assembly with the physical locations of SSR markers within previously associated QTLs¹³ for fur properties. Four genes of the 21 candidates were located in QTL intervals (Table 5). According to Uniprot⁴⁴, HLADRB1 (HLA-DR beta chain) is involved in membrane development and immune response. MITF (microphthalmia-associated transcription factor) may play a vital role in regulation of tyrosinase (TYR) and tyrosinase-related protein 1 (TYRP1) and also in the differentiation of various types of cell^45,46. AGRP (agouti-related protein) may be involve in feeding behavior⁴⁷. Finally PMEL (melanocyte protein) involved in the biogenesis of melanosomes⁴⁸. Even though we found these genes close to fur quality QTLs, we could not conclude that variation in these genes was responsible for the QTL affecting fur quality. However, it demonstrated that the draft genome serves as a platform for multiple research approaches.

Table 5 The genes close to fur quality QTLs.

Full size table

Discussion

With the development of next generation sequencing technology, genome assembly becomes feasible for non-model organisms. Even though the assembly process is still computationally challenging for large genome, reference genomes for more and more species have been published⁴⁹. Instead of following the conventional strategy, we assembled sequences from two individual minks, then some of the sequencing libraries were used to build a combined assembly and finally, simulated long insert reads from one individual were used to scaffolding combined assembly to build a consensus draft assembly. The reasons to adopt this strategy are the following: Firstly, ALLPATHS-LG²⁶ has a well-developed algorithm to assemble the consensus sequence for polymorphic regions. Secondly, the sequencing of human⁵⁰ and cattle⁵¹ reference genomes was also based on multiple individuals. Thirdly, the computation power to assemble combined dataset from both individuals exceeded the available 1TB server capacity. We also tried a different less memory-consuming assembler, however the result was smaller scaffold N50 and more scaffolds number. Finally, the available algorithms for merging assemblies were all not suitable for large genomes so we used the simulated MP data and additional scaffolding to achieve this.

American mink (Neovison vison) has an estimated genome size of 2.7 Gb²⁹ and among the assemblies of large genome species, we achieved a competitive number of N50. We assembled 7,175 scaffolds with an N50 of 6.3 Mb. Among published genome assemblies, this number ranges from 10 kb (two-toed sloth) to 47.0 Mb (horse)⁵². Alignment of three sequencing libraries and two external BAC-end libraries (Supplementary Table S3) confirmed the validity of our assembly. Moreover, compared with the sequence alignment of ferret (Supplementary Table S4) we have achieved a comparable assembly. The adverse results of properly paired reads of two BES libraries and the ferret 40 kb library mostly came from the pairs aligned to different scaffolds. The assessment using BUSCO V2 with a mammalian orthologous gene set showed our assembly represented genes well with 95.80% complete genes, 2.2% of genes fragmented and 2.0% of genes missing (Supplementary Table S5). All of above results showed we have a high quality mink genome. The quality of the assembly largely influences the following analysis, as we can see from the BUSCO V2 result, the draft assembly has 2% more complete mammalian single copy-gene detected compared with the Pearl-mink-assembly (Supplementary Table S5). In addition, the genome alignment also depends on high quality assembly. The less fragment assembly not only largely reduces the complexity of genome alignment but also helps to detect the genome rearrangement between species. To further improve our assembly, long reads like PacBio⁵³ can help to increase the contiguity of assembly⁵⁴ and to reduce the ambiguous sequence in the assembly⁵⁵. To solve further distance than the libraries we included in our assembly, we need Optical mapping data⁵⁶ or chromatin interaction data⁵⁷. With the help with long reads, we will have better genome annotation.

The mink genome showed less repeat sequence compared to the dog and cat genomes. However, all of these genomes have LINEs being the most abundant and SINEs being the second. The similarity of the genome repeats composition may be caused by the relatively recent separation of Carnivora species²⁰. However, from the substitution level of these three species, we can see the difference in the dynamics of repeat sequence. As shown in Fig. 1, the dog and cat genomes have more repeat sequence elements and showed high identity to consensus sequence judged from the low substitution rate. In addition, the newly inserted elements are easier to detect in assembly. This suggests that dog and cat genomes have more recently inserted repeat sequences compared to mink. We know eukaryotic genomes have different amount of repeat sequence and also different genome structure because of the differential propagation and deletion of these elements⁵⁸. Even within the closely related species like dog, cat and mink, the repeat sequence dynamics are different. By the genome alignment of mink and ferret to dog, we identified several specific rearrangements that must have happened since the last common ancestor of the three species and either the current dog population or the last common ancestor of mink and ferret. These findings will help us to better understand the evolution of Carnivora.

Genomic selection⁵⁹ has revolutionized breeding of several livestock animals. This constitutes a paradigmatic shift from the time before reference genomes were available⁶⁰. A reference genome assembly allows reliable identification of large numbers of markers and thereby facilitates application of genomic selection in practice. A reference genome assembly will also greatly improve breeding using genomics tools in the mink industry. With the reference, we can integrate available genetic research results within the genome and make them more sharable between different research and development groups working in mink breeding. For example, by combining the linkage map, the location of fur quality genes and fur quality QTLs¹³, we can search for genetic variants contributing to fur quality. The next step after the reference mink genome would be generating abundant markers covering the whole genome including SNPs which are the markers of choice today⁶¹. With this, the application of genomic selection and performing genome-wide association analysis will become feasible in mink.

Methods

Genomic data generation and genome assembly

The first mink sequenced was a male pearl American mink (N. vison) individual from the Aarhus University farm, Denmark. Genomic DNA was isolated and sequencing was performed by AROS (http://arosab.com). In order to use the ALLPATHS-LG pipeline⁶², we designed one overlapping pair-end library (165 bp, 100 PE) and two long insert size mate-pair library (3 kb and 5 kb, 100 PE). All data were generated by HiSeq. 2500 platform (Illumina Inc. San Diego, CA, USA). The total data were 163.2 Gb for pair-end sequence and 184.3 Gb for mate-pair sequence. The second animal sequenced was a brown mink female individual obtained from a private farm. For de novo assembly using ALLPATHS-LG sequencing libraries with different insert sizes were applied. Illumina libraries for the HiSeq. 2000 platform were generated following manufacturer’s protocol: two paired-end libraries with overlapping 150 bp reads and 600 bp inserts, respectively; three mate-pair libraries with 3 kb, 6 kb and 10 kb inserts. In addition, 8 kb, 14 kb and 32 kb insert libraries were acquired through Eurofins (www.eurofinsdna.com). Sequencing data of pearl mink is deposited on European Nucleotide Archive (ENA, http://www.ebi.ac.uk/ena) with accession number ERR1676595-ERR1676603 and sequencing data for brown mink is under project PRJEB16307. Due to the limitation of server memory, we performed three assemblies. The first was the Pearl-mink-assembly (PMA), which used all sequencing data from Pearl mink. The second one was the Brown-mink-assembly (BMA), which used part of Brown mink data (150 bp, 600 bp, 3 kb, 5 kb, 6 kb, 8 kb, part of 10 kb, 14 kb and 32 kb). The third assembly was hybrid-assembly (HA) with 165 bp PE library, 3 kb and 5 kb MP libraries from the Pearl mink and part of 600 bp PE library, 8 kb, part of 10 kb, 14 kb and 32 kb MP libraries from the Brown mink. To allow for genetic differences between the two individuals, we set PLOIDY = 2 and HAPLOIDIFY = True to ALLPATHS-LG, which helps to assemble the consensus sequence of polymorphic regions. Quality control and error correction were automated procedure within ALLPATHS-LG. Ultimately the hybrid-assembly and the Brown-mink-assembly were chose to construct the consensus draft assembly. In order to merge these two assemblies, we simulated 10k, 20k and 40k MP data from the second assembly with ART²⁸ followed by a re-scaffolding of the HA by SSPACE²⁷. The details of the libraries were listed in Supplementary Table 1S. The final assembly is deposited on ENA with accession ERZ337136.

Assembly assessment

To check the correctness of the assembly and additional scaffolding procedure, we mapped genome sequencing data of the 150 bp, 600 bp and 3 kb libraries to the assembly using BWA⁶³. In order to compare with the ferret assembly, ferret sequence libraries 180 bp (SRR085065), 3 kb (SRR085064), 6 kb –10 kb (SRR253162) and 40 kb (SRR253149) were downloaded from DNAnexus (https://www.dnanexus.com/) and aligned to the ferret genome with the same procedure. BWA men infers the reads orientation and the insert size during alignment and the properly paired reads were reported from SAMtools⁶⁴ flagstat. Two external BES (BAC-end sequence) libraries^5,30 were also used to check the correctness of the draft assembly. Firstly, the two BAC-ends of the same BAC clone were collected and renamed; BAC clones missing one of the end reads were removed. Then, the BAC-end sequences were aligned using BWA⁶³. If two ends of the same BES aligned to different scaffolds or only one end was mapped, the BAC was marked as not properly paired. Also, BUSCO v2³¹ was used to check the completeness of genome assembly using vertebrate core genes. We ran BUSCO with the mammalian orthologous gene set (4,104 genes), calculated the complete, fragment, and lost genes in the assembly. We also used RNAmmer⁶⁵ to detect the 45S ribosomal DNA cluster in our assembly.

Repeat sequence annotation

RepeatExplorer³⁵ and RepeatModeler³⁴ were used to perform de novo prediction of novel repeat sequences in the mink genome before running RepeatMasker. RepeatExplorer’s result was analyzed by aligned contigs against the NR database⁶⁶ using BLAST⁶⁷. Clusters containing non-repeat sequences were removed from the result. Consensus satellite repeat sequences were extracted by Tandem Repeat Finder⁶⁸. The RepeatModeler pipeline was used to obtain a consensus sequence for each repeat family. Finally, results from RepeatExplorer and RepeatModeler were combined with the dog repeat sequence database³⁶ to construct a repeat database for mink. We used this repeat database with RepeatMasker⁶⁹ to annotate repetitive sequences in the mink genome. The substitution level calculation and plots were done using calcDivergenceFromAlign.pl and createRepeatLandscape.pl scripts provided with RepeatMasker.

Genome alignment

The soft-masked genomic sequence of mink and ferret (MusPutFur 1.0) were aligned to soft-masked dog (CanFam 3.1) genome downloaded⁷⁰ from Ensembl using LASTZ⁷¹. Before alignment, we removed all the sequences named Unknown from dog genome. The pairwise genome alignment was chained according to their location in both genomes. The netting process chose for the reference species the best sub-chain in each region. A custom-made python script conducted the statistic of the block size. The genome ring figures were generated by Circos⁷².

Gene annotation

The whole procedure of annotation consisted of ab initio gene prediction, homology-based prediction and RNA-seq. The information was merged together by the EVM⁴⁰ weighted algorithm to build a consensus gene set. 1) AUGUSTUS⁷³ with human parameter settings was used to perform the ab initio gene prediction. 2) Protein alignment was performed by Exonerate⁷⁴ and Spaln⁷⁵ using the Uniprot⁴⁴ database. 3) An American mink transcriptome was available¹⁴. In order to improve the annotation, we re-analyzed this RNA-seq data set (PRJEB1260). Using Trinity⁷⁶, we performed both de novo transcriptome assembly and also genome-guided transcriptome assembly. Both assemblies were then used in the annotation pipeline. PASApipeline⁷⁷ was used to generate gene structures from the two transcriptome assemblies and build a comprehensive transcriptome database. To avoid false positives generated during transcriptome assembly, we did not use it directly as evidence. Instead, we used PASApipeline. PASApipeline will first align transcripts to the genome and perform a new assembly based on its alignment, so we will have a PASA_alignment and a PASA_assembly for evidences for annotation. Then all evidences were combined by EVM setting weights for AUGUSTUS to 1, for Exonerate to 4, for Spaln to 4, for PASA_alignment to 1 and for PASA_assembly to 10.

Orthologous gene families

The amino acid sequences from the mink genome were extracted from annotation and scanned against EggNOG⁴¹ mammalian database using HMMER3⁴². The best hit to the database for each gene was used to identify the orthologous groups. Mink orthologous groups were compared with human (GRCh37), mouse (NCBIM37) and dog (BROADD2). Similarity of mink genes with genes in other genomes was investigated using the OrthoMCL⁴³ online service. For each gene in the mink genome, the genome with the closest match was identified. Likewise all the orthologous groups containing dog (CanFam 3.1), cat (Felis_catus_6.2), panda (ailMel1) and ferret (MusPutFur 1.0) were extracted to perform the same comparison among Carnivora species. Finally, genes which could not be assigned to any mammalian gene family were scanned against the EggNOG⁴¹ animal database using HMMER3⁴².

Integration of the linkage map with assembly and identification of fur quality genes

The most recent version of mink linkage map¹² was obtained. The forward and reverse primer of each microsatellite (SSR) marker was extracted. For microsatellites where the primer information was not available, we mapped the whole clone sequence which was used to design primers to genome assembly. Totally, 103 markers out of 104 markers were kept for analysis. For primer mapping, both primers mapped and distances in the range 200 to 300 bp were considered correct; for clone sequence alignment, we required full alignment with few mismatches. Potential fur quality and color gene set⁵ was mapped to genome using Exonerate⁷⁴. We located the previously reported QTL¹³ interval by SSR markers and compared the scaffold location of these markers with the locations of genes. The genes located in the interval were extracted and annotated by Uniprot⁴⁴.

Availability of data

The datasets generated and/or analyzed during the current study are available on European Nucleotide Archive (ENA, http://www.ebi.ac.uk/ena) with accession number ERR1676595-ERR1676603 for Pearl mink; project PRJEB16307 for brown mink and ERZ337136 for the draft assembly.

References

Harding, L. E. & Smith, F. A. Mustela or Vison? Evidence for the taxonomic status of the American mink and a distinct biogeographic radiation of American weasels. Mol Phylogenet Evol 52, 632–642 (2009).
Article CAS PubMed Google Scholar
Abramov, A. A taxonomic review of the genus Mustela (Mammalia, Carnivora). Zoosystematica Rossica 8, 357 (2000).
Google Scholar
Kurose, N., Abramov, A. V. & Masuda, R. Molecular phylogeny and taxonomy of the genus Mustela (Mustelidae, Carnivora), inferred from mitochondrial DNA sequences: New perspectives on phylogenetic status of the back-striped weasel and American mink. Mammal Study 33, 25–33 (2008).
Article Google Scholar
Manville, R. H. The extinct sea mink, with taxonomic notes. (Smithsonian Institution, 1966).
Anistoroaei, R., ten Hallers, B., Nefedov, M., Christensen, K. & de Jong, P. Construction of an American mink bacterial artificial chromosome (BAC) library and sequencing candidate genes important for the fur industry. BMC Genomics 12, 354 (2011).
Article CAS PubMed PubMed Central Google Scholar
Nie, W. et al. Chromosomal rearrangements and karyotype evolution in carnivores revealed by chromosome painting. Heredity (Edinb) 108, 17–27 (2012).
Article CAS Google Scholar
Rubtsov, N. et al. Zoo-FISH with region-specific paints for mink chromosome 5q: delineation of inter-and intrachromosomal rearrangements in human, pig, and fox. Cytogenetic and Genome Research 90, 268–270 (2000).
Article CAS Google Scholar
Rettenberger, G. et al. ZOO-FISH analysis: cat and human karyotypes closely resemble the putative ancestral mammalian karyotype. Chromosome Research 3, 479–486 (1995).
Article CAS PubMed Google Scholar
Hameister, H. et al. Zoo-FISH analysis: the American mink (Mustela vison) closely resembles the cat karyotype. Chromosome Research 5, 5–11 (1997).
Article CAS PubMed Google Scholar
Anistoroaei, R., Menzorov, A., Serov, O., Farid, A. & Christensen, K. The first linkage map of the American mink (Mustela vison). Anim Genet 38, 384–388 (2007).
Article CAS PubMed Google Scholar
Anistoroaei, R. et al. An extended anchored linkage map and virtual mapping for the American mink genome based on homology to human and dog. Genomics 94, 204–210 (2009).
Article CAS PubMed Google Scholar
Anistoroaei, R. et al. A re-assigned American mink (Neovison vison) map optimal for genome-wide studies. Gene 511, 66–72 (2012).
Article CAS PubMed Google Scholar
Thirstrup, J. P. et al. Identifying QTL and genetic correlations between fur quality traits in mink (Neovison vison). Anim Genet 45, 105–110 (2014).
Article CAS PubMed Google Scholar
Christensen, K. & Anistoroaei, R. An American mink (Neovison vison) transcriptome. Anim Genet 45, 301–303 (2014).
Article CAS PubMed Google Scholar
Venter, J. C. et al. The sequence of the human genome. Science 291, 1304–1351 (2001).
Article CAS PubMed ADS Google Scholar
Schuster, S. C. Next-generation sequencing transforms today’s biology. Nature 200, 16–18 (2007).
Google Scholar
Lindblad-Toh, K. et al. Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature 438, 803–819 (2005).
Article CAS PubMed ADS Google Scholar
Montague, M. J. et al. Comparative analysis of the domestic cat genome reveals genetic signatures underlying feline biology and domestication. Proceedings of the National Academy of Sciences 111, 17230–17235 (2014).
Article CAS ADS Google Scholar
Peng, X. et al. The draft genome sequence of the ferret (Mustela putorius furo) facilitates study of human respiratory disease. Nat Biotechnol 32, 1250–1255 (2014).
Article CAS PubMed PubMed Central Google Scholar
Nyakatura, K. & Bininda-Emonds, O. R. Updating the evolutionary history of Carnivora (Mammalia): a new species-level supertree complete with divergence time estimates. BMC biology 10, 1 (2012).
Article Google Scholar
Kukekova, A. V. et al. Chromosomal mapping of canine-derived BAC clones to the red fox and American mink genomes. J Hered 100(Suppl 1), S42–53 (2009).
Article CAS PubMed PubMed Central Google Scholar
Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature protocols 7, 562–578 (2012).
Article CAS PubMed PubMed Central Google Scholar
Park, P. J. ChIP-seq: advantages and challenges of a maturing technology. Nature reviews. Genetics 10, 669–680 (2009).
Article CAS PubMed PubMed Central Google Scholar
Daetwyler, H. D. et al. Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nature genetics 46, 858–865 (2014).
Article CAS PubMed Google Scholar
Hayes, B. & Goddard, M. Genome-wide association and genomic selection in animal breeding. Genome 53, 876–883 (2010).
Article CAS PubMed Google Scholar
Gnerre, S. et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci USA 108, 1513–1518 (2011).
Article CAS PubMed ADS Google Scholar
Boetzer, M., Henkel, C. V., Jansen, H. J., Butler, D. & Pirovano, W. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27, 578–579 (2011).
Article CAS PubMed Google Scholar
Huang, W., Li, L., Myers, J. R. & Marth, G. T. ART: a next-generation sequencing read simulator. Bioinformatics 28, 593–594 (2012).
Article PubMed CAS Google Scholar
Skorupski, J. 70 years of research on the American mink (Neovison vison schreb., 1777) genetics - where are we now? Genetika 47, 357–373 (2015).
Article Google Scholar
Benkel, B. F. et al. A comparative, BAC end sequence enabled map of the genome of the American mink (Neovison vison). Genes & Genomics 34, 83–91 (2012).
Article CAS Google Scholar
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Article PubMed CAS Google Scholar
Seemann, S. E., Anthon, C., Palasca, O. & Gorodkin, J. Quality Assessment of Domesticated Animal Genome Assemblies. Bioinformatics and Biology insights 9, 49–58 (2015).
PubMed Google Scholar
Alkan, C., Sajjadian, S. & Eichler, E. E. Limitations of next-generation genome sequence assembly. Nature methods 8, 61–65 (2011).
Article CAS PubMed Google Scholar
Smit, A. & Hubley, R. RepeatModeler Open-1.0. Repeat Masker Website (2010).
Novák, P., Neumann, P., Pech, J., Steinhaisl, J. & Macas, J. RepeatExplorer: a Galaxy-based web server for genome-wide characterization of eukaryotic repetitive elements from next-generation sequence reads. Bioinformatics 29, 792–793 (2013).
Article PubMed CAS Google Scholar
Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenetic and genome research 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Hayden, K. E. & Willard, H. F. Composition and organization of active centromere sequences in complex genomes. BMC genomics 13, 324 (2012).
Article CAS PubMed PubMed Central Google Scholar
Pontius, J. U. & O’brien, S. J. Artifacts of the 1.9 × Feline Genome Assembly Derived from the Feline-Specific Satellite Sequence. Journal of Heredity 100, S14–S18 (2009).
Article CAS PubMed PubMed Central Google Scholar
Kazazian, H. H. Mobile elements: drivers of genome evolution. science 303, 1626–1632 (2004).
Article CAS PubMed ADS Google Scholar
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol 9, R7 (2008).
Article PubMed PubMed Central CAS Google Scholar
Powell, S. et al. eggNOGv3. 0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges. Nucleic acids research 40, D284–D289 (2012).
Article CAS PubMed Google Scholar
Eddy, S. R. A new generation of homology search tools based on probabilistic inference. in Genome Inform. 23, 205–211 (2009).
Google Scholar
Chen, F., Mackey, A. J., Stoeckert, C. J. & Roos, D. S. OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups. Nucleic acids research 34, D363–D368 (2006).
Article CAS PubMed Google Scholar
Consortium, U. Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic acids research 40, D71–D75 (2011).
Article CAS Google Scholar
Takeda, K. et al. Ser298 of MITF, a mutation site in Waardenburg syndrome type 2, is a phosphorylation site with functional significance. Human molecular genetics 9, 125–132 (2000).
Article CAS PubMed Google Scholar
Genovese, G. et al. The tumor suppressor HINT1 regulates MITF and β-catenin transcriptional activity in melanoma cells. Cell Cycle 11, 2206–2215 (2012).
Article CAS PubMed Google Scholar
Yang, Y.-k et al. Characterization of Agouti-related protein binding to melanocortin receptors. Molecular Endocrinology 13, 148–155 (1999).
Article CAS PubMed Google Scholar
Berson, J. F., Harper, D. C., Tenza, D., Raposo, G. & Marks, M. S. Pmel17 initiates premelanosome morphogenesis within multivesicular bodies. Molecular biology of the cell 12, 3451–3464 (2001).
Article CAS PubMed PubMed Central Google Scholar
Koepfli, K. P., Paten, B., Genome, K. Co. S. & O’Brien, S. J. The Genome 10K Project: a way forward. Annu Rev Anim Biosci 3, 57–111 (2015).
Article CAS PubMed Google Scholar
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article CAS PubMed ADS Google Scholar
Elsik, C. G., Tellam, R. L. & Worley, K. C. The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324, 522–528 (2009).
Article PubMed PubMed Central ADS CAS Google Scholar
Lindblad-Toh, K. et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature 478, 476–482 (2011).
Article CAS PubMed PubMed Central Google Scholar
Eid, J. et al. Real-time DNA sequencing from single polymerase molecules. Science 323, 133–138 (2009).
Article CAS PubMed ADS Google Scholar
Pendleton, M. et al. Assembly and diploid architecture of an individual human genome via single-molecule technologies. Nature methods 12, 780–786 (2015).
Article CAS PubMed PubMed Central Google Scholar
Bickhart, D. M. et al. Single-molecule sequencing and conformational capture enable de novo mammalian reference genomes. bioRxiv, 064352 (2016).
Latreille, P. et al. Optical mapping as a routine tool for bacterial genome sequence finishing. BMC genomics 8, 321 (2007).
Article PubMed PubMed Central CAS Google Scholar
Burton, J. N. et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nature biotechnology 31, 1119–1125 (2013).
Article CAS PubMed PubMed Central Google Scholar
Eichler, E. E. & Sankoff, D. Structural dynamics of eukaryotic chromosome evolution. science 301, 793–797 (2003).
Article CAS PubMed ADS Google Scholar
Hayes, B. & Goddard, M. Prediction of total genetic value using genome-wide dense marker maps. Genetics 157, 1819–1829 (2001).
PubMed PubMed Central Google Scholar
Hayes, B. J., Lewin, H. A. & Goddard, M. E. The future of livestock breeding: genomic selection for efficiency, reduced emissions intensity, and adaptation. Trends Genet 29, 206–214 (2013).
Article CAS PubMed Google Scholar
Garvin, M. R., Saitoh, K. & Gharrett, A. J. Application of single nucleotide polymorphisms to non-model species: a technical review. Mol Ecol Resour 10, 915–934 (2010).
Article CAS PubMed Google Scholar
Butler, J. et al. ALLPATHS: de novo assembly of whole-genome shotgun microreads. Genome research 18, 810–820 (2008).
Article CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Article CAS PubMed PubMed Central Google Scholar
Li, H. et al. The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
Article PubMed PubMed Central CAS Google Scholar
Lagesen, K. et al. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic acids research 35, 3100–3108 (2007).
Article CAS PubMed PubMed Central Google Scholar
Pruitt, K. D., Tatusova, T. & Maglott, D. R. NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic acids research 35, D61–D65 (2007).
Article CAS PubMed Google Scholar
Johnson, M. et al. NCBI BLAST: a better web interface. Nucleic acids research 36, W5–W9 (2008).
Article CAS PubMed PubMed Central Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic acids research 27, 573 (1999).
Article CAS PubMed PubMed Central Google Scholar
Tarailo‐Graovac, M. & Chen, N. Using RepeatMasker to identify repetitive elements in genomic sequences. Current Protocols in Bioinformatics, 4.10. 11-14.10. 14 (2009).
Flicek, P. et al. Ensembl 2013. Nucleic Acids Res 41, D48–55 (2013).
Article CAS PubMed Google Scholar
Harris, R. S. Improved pairwise alignment of genomic DNA. (ProQuest, 2007).
Krzywinski, M. et al. Circos: an information aesthetic for comparative genomics. Genome research 19, 1639–1645 (2009).
Article CAS PubMed PubMed Central Google Scholar
Stanke, M. et al. AUGUSTUS: ab initio prediction of alternative transcripts. Nucleic acids research 34, W435–W439 (2006).
Article CAS PubMed PubMed Central Google Scholar
Slater, G. S. & Birney, E. Automated generation of heuristics for biological sequence comparison. BMC Bioinformatics 6, 31 (2005).
Article PubMed PubMed Central CAS Google Scholar
Iwata, H. & Gotoh, O. Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features. Nucleic Acids Res 40, e161 (2012).
Article CAS PubMed PubMed Central Google Scholar
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature biotechnology 29, 644–652 (2011).
Article CAS PubMed PubMed Central Google Scholar
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic acids research 31, 5654–5666 (2003).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This research was supported by the Center for Genomic Selection in Animals and Plants (GenSAP) partially funded by Innovation Fund Denmark (grant 0603–00519B).

Author information

Bernt Guldbrandtsen and Frank Panitz contributed equally to this work.

Authors and Affiliations

Center for Quantitative Genetics and Genomics, Department of Molecular Biology and Genetics, Aarhus University, DK-8830, Tjele, Denmark
Zexi Cai, Goutam Sahana, Mogens Sandø Lund & Bernt Guldbrandtsen
DTU Bioinformatics, Department of Bio and Health Informatics, Technical University of Denmark, 2800 Kgs, Lyngby, Denmark
Bent Petersen
Section for Molecular Genetics and Systems Biology, Department of Molecular Biology and Genetics, Aarhus University, DK-8830, Tjele, Denmark
Lone B. Madsen, Knud Larsen, Bo Thomsen, Christian Bendixen & Frank Panitz
Centre of Excellence for Omics-Driven Computational Biodiscovery (COMBio), Faculty of Applied Sciences, AIMST University, Kedah, Malaysia
Bent Petersen

Authors

Zexi Cai
View author publications
You can also search for this author in PubMed Google Scholar
Bent Petersen
View author publications
You can also search for this author in PubMed Google Scholar
Goutam Sahana
View author publications
You can also search for this author in PubMed Google Scholar
Lone B. Madsen
View author publications
You can also search for this author in PubMed Google Scholar
Knud Larsen
View author publications
You can also search for this author in PubMed Google Scholar
Bo Thomsen
View author publications
You can also search for this author in PubMed Google Scholar
Christian Bendixen
View author publications
You can also search for this author in PubMed Google Scholar
Mogens Sandø Lund
View author publications
You can also search for this author in PubMed Google Scholar
Bernt Guldbrandtsen
View author publications
You can also search for this author in PubMed Google Scholar
Frank Panitz
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.S.L., B.G., G.S., B.T., F.G. and C.B. conceived and designed the study. Z.C., B.P. and F.P. performed the analyses. L.B.M. and K.L. performed experiments. M.S.L., B.G. and C.B. provided facilities. Z.C. wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Zexi Cai.

Ethics declarations

Competing Interests

The authors declare that they have no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary Information

Supplementary Dataset

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cai, Z., Petersen, B., Sahana, G. et al. The first draft reference genome of the American mink (Neovison vison). Sci Rep 7, 14564 (2017). https://doi.org/10.1038/s41598-017-15169-z

Download citation

Received: 09 May 2017
Accepted: 23 October 2017
Published: 06 November 2017
DOI: https://doi.org/10.1038/s41598-017-15169-z

This article is cited by

Strong selection signatures for Aleutian disease tolerance acting on novel candidate genes linked to immune and cellular responses in American mink (Neogale vison)
- Seyed Milad Vahedi
- Siavash Salek Ardestani
- K. Fraser Clark
Scientific Reports (2024)
A chromosome-level genome assembly reveals genomic characteristics of the American mink (Neogale vison)
- Karim Karimi
- Duy Ngoc Do
- Younes Miar
Communications Biology (2022)
Detection of selection signatures for response to Aleutian mink disease virus infection in American mink
- Karim Karimi
- A. Hossain Farid
- Younes Miar
Scientific Reports (2021)
Semiaquatic mammals might be intermediate hosts to spread avian influenza viruses from avian to human
- Ping Zhao
- Lingsha Sun
- Chenglong Xiong
Scientific Reports (2019)
SNP markers associated with body size and pelt length in American mink (Neovison vison)
- Zexi Cai
- Trine Michelle Villumsen
- Mogens Sandø Lund
BMC Genetics (2018)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Sequencing and assembly

Assembly assessment

Repeat annotation in the mink genome and comparison to related species

Alignment with the dog genome assembly

Gene annotation and orthologous analysis

Integrating the linkage map with the reference assembly and identification of fur quality genes

Discussion

Methods

Genomic data generation and genome assembly

Assembly assessment

Repeat sequence annotation

Genome alignment

Gene annotation

Orthologous gene families

Integration of the linkage map with assembly and identification of fur quality genes

Availability of data

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links