Abstract
Chinese sturgeon (Acipenser sinensis), a critically endangered Acipenseridae family member, is one of the largest anadromous, native fish in China. Numerous research programmes and protection agencies have focused on breeding and preserving this endangered species. However, available information is limited on the different stages of sex development, especially on the reproductive regulation of the hypothalamus-pituitary-gonad (HPG) axis of A. sinensis. To unravel the mechanism of gene interactions during sex differentiation and gonad development of A. sinensis, we performed transcriptome sequencing using HPG samples from male and female A. sinensis in two developmental stages. In this study, 271.19 Gb high-quality transcriptome data were obtained from 45 samples belonging to 15 individuals (six in stage I, six males and three females in stage II). These transcriptomic data will help us understand the reproductive regulation of the HPG axis in the development stages of A. sinensis and provide important reference data for genomic and genetic studies in A. sinensis and related species.
Design Type(s) | development or differentiation design • transcription profiling design • sequence assembly objective • sequence annotation objective |
Measurement Type(s) | transcription profiling assay |
Technology Type(s) | RNA sequencing |
Factor Type(s) | sex • developmental stage |
Sample Characteristic(s) | Acipenser sinensis • pituitary gland • Hypothalamus • Gonad |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Similar content being viewed by others
Background & Summary
The order Acipenseriformes comprises 27 extant species of sturgeons, which are considered “living fossils” with a fossil record dating back to the upper Cretaceous1,2. In terms of scientific value, these chondrostean species preserve many transition traits between Chondrichthyes and Osteichthyes, occupying a vital evolutionary position. Sturgeons include three degrees of polyploidy, including ~120, ~240 and ~360 chromosomes, which might result from multiple and independent duplication events3,4. Sturgeons are important species for studying vertebrate evolution and whole genome duplication. Regarding economic value, black caviar is very popular in the European market, and sturgeon meat is also an important food resource5. Unfortunately, most sturgeons are currently endangered6.
Chinese sturgeon (Acipenser sinensis) is a large anadromous fish present in the main stream of the Yangtze River and East China Sea in China5,7. At present, Chinese sturgeon is critically endangered, which is mainly attributed to the following reasons: (i) asynchronous and delayed sexual maturity between male and female Chinese sturgeon (8–18 years for male and 14–26 years for female)6; and (ii) human activities, such as overfishing, damming, shipping and pollution7. The sex ratio (female:male) of wild Chinese sturgeon has gradually increased from 1.1:1 (1981–1983) to 5.86:1 (2003–2004)8. Therefore, artificial propagation and selective release are considered efficient approaches to recover the wild population size and adjust the sex ratio of the wild population of Chinese sturgeon. However, the sexes of A. sinensis cannot be distinguished due to a lack of secondary sexual characteristics and molecular markers of sex identification. Moreover, whether sex differentiation of A. sinensis is determined by genetics, the environment or a combination of both remains unclear.
Sturgeon genomes are complicated due to large genome sizes (1.6–9.32 pg/C), many chromosomes (consisting of macro- and micro-chromosomes) and controversial ploidy4,9,10. To date, no sturgeon reference genome is available. Transcriptome sequencing is very useful for identifying novel genes, checking gene activity, revealing genic functions, and exploring the molecular mechanisms of development and sex differentiation of sturgeon species11,12. Transcriptome analyses regarding sex determination and development of several sturgeons, including A. naccarii13, A. schrenckii14,15,16,17,18, A. sinensis6,19,20, A. gueldenstaedtii21, A. fulvescens11, A. baerii22, and A. dabryanus23,24, have been reported. Transcriptomes between male and female A. sinensis gonads were analysed in a previous study6 where some differential genes between the ovary and testis were predicted, and potential gametogenesis-related genes were screened out. However, identifying the sex determination mechanism of A. sinensis only by analysing genes in gonad tissues is not possible. The regulation of multiple tissues, signals and genes must be considered to unveil the mechanism of sex differentiation and discover sexual markers.
The reproductive process in vertebrates relies on multiple regulatory factors of the hypothalamus-pituitary-gonad (HPG) axis25,26. Gonadal differentiation, development, and maturation in fish are also regulated by many kinds of hormones, such as gonadotropin-releasing hormones, gonadotropin hormones, steroid hormones and other relative hormones, in the HPG axis14,27,28. Recent studies on the HPG axis of A. schrenckii focused only on the genes in the HPG axis from brain tissue by kisspeptin treatment14. However, obtaining transcriptome profiles of whole HPG tissues in sturgeon is necessary to better understand the reproductive mechanisms.
Here, the HPG axis transcriptomes of A. sinensis were sequenced using Illumina HiSeq 2000 and 4000 platforms. A total of 271.19 Gb filtered data from 45 samples belonging to 15 individuals (six individuals in stage I, six males and three females in stage II) were obtained, and 74.50% of 121,952 assembled unigenes were annotated to the NCBI non-redundant protein database (NR), Swiss-Prot, Kyoto Encyclopedia of Genes and Genomes (KEGG), Cluster of Orthologous Groups of proteins (COG), InterPro, Gene Ontology (GO) and NT databases. Additionally, 78,770 unigenes contained complete open reading frames (ORFs). A total of 51,570 simple sequence repeats (SSRs) were detected. We identified 1,531 to 34,439 differentially expressed genes (DEGs) in 18 pairwise comparisons of HPG samples of male and female A. sinensis in two stages. We reported the first integrated transcriptome data from HPG samples of male and female A. sinensis in stage I and II of gonadal development. These data offer valuable resources for research into reproductive regulation and the HPG axis interaction in A. sinensis and other sturgeons.
Methods
Ethics statement
The experiments were performed in accordance with the guidelines and regulations of the National Institute of Health Guide for the Care and Use of Laboratory Animals and were approved by the Institutional Review Board on Bioethics and Biosafety of the Chinese Sturgeon Research Institute.
Sample preparation
Individuals of A. sinensis used here were selected from the artificial breeding population of the Chinese Sturgeon Research Institute, China Three Gorges Corporation (Yichang, China). The stages of gonadal development were classified following previously described methods29. The sex cannot be distinguished by a histochemical assay for gonads in stage I, while it can be easily distinguished in stage II in A. sinensis. Herein, 6 individuals (1 year old) in stage I and 9 individuals (6 males and 3 females over 4 years old) in stage II were selected. All experimental individuals were first euthanized with M222. Three kinds of tissues (hypothalamus (H), pituitary (P) and gonad (G)) were collected from each individual, flash frozen in liquid nitrogen and stored at −80 °C for RNA extraction. A total of 45 samples from 15 individuals (three tissues for each individual) were used for cDNA library construction.
Illumina sequencing and data processing
Total RNA was extracted with TRIzol reagent (Invitrogen) following the manufacturer’s instructions. The RNA purity, integrity and concentration were calculated and checked by an Agilent 2100 Bioanalyzer (Agilent, Santa Clara, CA), and high-quality RNA (RNA Integrity Number > 7.0) was used for Illumina sequencing. Total RNA samples (5 μg) were then subjected to cDNA construction following Illumina truSeq stranded mRNA sample preparation protocol. Next, 45 libraries were sequenced using Illumina HiSeqTM 2000/4000 with 2 × 90 or 100 bp paired-end sequencing. To obtain high-quality reads, the raw reads were filtered by SOAPnuke (v1.5.6)30 with the default parameters except “-l 20 -q 0.2 –M 3” according to the following criteria: (i) reads containing adaptors (adapters of more than 15 bases matched to reads with maximal 3-base mismatches allowed) were discarded, (ii) reads with a high proportion (> 5%) of unknown nucleotides (N) were removed, and (iii) reads with ≥ 20% bases Q ≤ 20 were removed. Here, the filtering parameters were stricter than the default parameters, and reads containing low quality bases or adapters were removed entirely, rather than being trimmed.
Transcriptome assembly and annotation
All data obtained from the 45 libraries were assembled by the Trinity program (version: release-2013-08-14), including Inchworm, Chrysalis and Butterfly modules, with default parameters except “–path_reinforcement_distance 85 –min_kmer_cov 3”31. To minimize redundancy, we clustered transcripts using TGICL32 and the non-redundant sequences of >200 bp were retained. Finally, the longest sequence was preserved and designated as a unigene. Following the filtering methods of the unigene set of Andrias davidianus33, we used a strict pipeline to filter these sequences with some modification to reduce the background and assembly errors. There are some differences compared with the published filtered cut-offs33. Here, if the lengths of unigenes were in the range of 200–500 bp with the fragments per kilobase per million mapped fragments (FPKM) < 50 in just one sample or FPKM < 1 in ten samples, they were removed. Several different public protein databases were used to validate and annotate the assembled unigenes for assigning gene names, coding sequences (CDS) and predicting protein annotations. The sequence-based alignments were mapped against the NCBI NR protein database, Swiss-Prot protein database, KEGG and COG using the BLASTx algorithm34,35 with an E-value threshold of 1e−5. The priority order of NR, Swiss-Prot, KEGG, and COG was set. The unigenes that did not align to any of the above databases were predicted as ORFs using ESTScan software36. To further evaluate the different sample mapping rates and the quality of the assembled unigenes, we aligned all tissue samples with high-quality reads to the unigenes using SOAP2.21, allowing up to 5 base mismatches37.
To annotate and categorize A. sinensis unigene function, we conducted NR annotation, and GO analysis. The NR animal protein sequences came from dozens of species; based on NR annotation, unigenes were assigned to GO classes using BLAST2GO38. With WEGO software39, the assigned GO terms were summarized into the three main GO categories, including biological process, molecular function and cellular component. Overall, all the unigenes were assigned COG classifications based on all the unigenes blasted against the COG database. The COG-annotated putative proteins were classified into 25 functional categories. To assess the quality of the transcriptome gene set, we evaluated the completeness of coding gene set using Benchmarking Universal Single-Copy Orthologs (BUSCO)40.
Differentially expressed gene analysis and RT-qPCR validation
To obtain the gene expression information, we mapped the high-quality reads of each sample to all transcripts using SOAP2 software41. After counting the number of mapped reads, estimated FPKM values were calculated through RSEM based on the mapping results. Principal component analysis (PCA) was performed for the 45 samples with the FPKM by the princomp function in the R package42. The 45 samples from the HPG in two stages were designed for 18 pairwise comparisons. The DEGs of these pairwise comparisons were evaluated by Noiseq software with probability no less than 0.843. The GO annotation methods are similar to those mentioned above in unigene annotations. Compared to the whole genome background, the DEG pathway enrichment analysis was identified by a significantly enriched pathway. After multiple testing for the p-value, pathways with a q-value ≤ 0.05 were defined as significantly enriched pathways.
To validate the DEGs, we performed RT-qPCR to evaluate the sequencing and data analysis. Total RNA was extracted using TRIzol reagent (Ambion) according to the manufacturer’s protocol. Approximately 1 μg of RNA was converted to cDNA. Then, the cDNA templates were reverse-transcribed with a PrimeScriptTM RT reagent kit with gDNA Eraser (Perfect Real Time) (TakaRa, Dalian, China). RT-qPCRs were performed on a My IQTM colour Real-time PCR Detection System (Bio-Rad, USA). Using β-actin as a reference gene normalized by the median expression, the relative expression levels of 10 target sex-related genes were calculated by the 2−ΔΔCt method44. RT-qPCR was performed on three biological replicates, and the data are presented as the mean ± SD. With Student's t-test, the difference was considered statistically significant at a p-value < 0.05.
Microsatellite markers
SSRs were detected in unigenes by MIcroSAtellite (MISA; http://pgrc.ipk-gatersleben.de/misa, version 1.0) software. The following types of SSRs were detected: mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeats, as well as compound SSRs [the sequence including more than one type of repeat units, e.g., (GA)n(TC)m]. Here, the standards of the repeat unit numbers were set as follows: mono-10, dimer-6, trimer-5, tetramer-5, pentamer-5, and hexamer-5.
Data Records
All raw reads of transcriptome sequences have been submitted to the NCBI Sequence Read Archive45. The assembled transcriptome data were deposited in NCBI’s GenBank46. The gene expression measurements were deposited in Gene Expression Omnibus (GEO)47. The transcriptome annotation of unigenes in the NR, NT, SwissProt, KEGG, COG, InterPro and GO database, differentially expressed genes in different comparisons, RT-qPCR test results and statistic of SSR motifs of the Acipenser sinensis unigenes can be found in our Figshare records48.
Technical Validation
A total of 3,255.19 M raw reads were generated by Illumina HiSeq 2000 and 4000 platforms. After filtering raw reads, 2,967.34 M high-quality reads (271.19 Gb high-quality data) were used for the transcriptome assembly, and all 45 samples were used for differential expression analysis. Sample sequencing statistics and high-quality data information are presented in Table 1. The statistics of the assembled unigene set of the 45 samples are listed in Online-only Table 1; the total number ranged from 40,374 to 155,858. The mean length and N50 length of the 121,952 unigenes were 1,384 bp and 2,208 bp, respectively. To annotate and categorize A. sinensis unigene function, we conducted NR annotation and GO analysis. A total of 76,268 unigenes matched the NR animal species dataset with BLASTx. Then, 53.86% of the unigenes were most similar to the genes of Lepisosteus oculatus, followed by Latimeria chalumnae (3.58%), Salmo salar (3.22%), Scleropages formosus (2.21%), Danio rerio (2.07%), Astyanax mexicanus (1.76%) and Clupea harengus (1.69%) (Fig. 1a).
A total of 74.5% of unigenes (90,853) were aligned to 7 databases (NR, Swiss-Prot, KEGG, COG, InterPro, GO and NT). A total of 76,268 unigenes were annotated to NR, 78,088 to NT, 67,964 to Swiss-Prot, 63,920 to KEGG, 30,194 to COG, 59,417 to InterPro and 53,038 to the GO database (Fig. 1b). For the functional annotation and classification analyses, with the annotation data based on the classification by GO, the cellular process, the cell and cell part category were the most frequent GO classification groups (Fig. 2). In COG analysis, 30,194 unigenes were annotated and classified into 25 functional categories (Supplementary Fig. 1). The largest cluster was “the general function prediction only”, followed by “Cell wall/membrane/envelope biogenesis” and “Signal transduction mechanisms”. Signal transduction; Cancers: Overview, Global and overview maps; Endocrine system; and Immune system were the top five annotated pathways according to KEGG classification analysis (Supplementary Fig. 2).
In BUSCO annotation analysis, the total number of Actinopterygii genes for evaluation was 4,584, and 85.7% of the total BUSCOs identified. In this evaluation data, the ‘Complete and single-copy BUSCOs’ was 42.5%, and the ‘Complete and duplicated BUSCOs’ was 43.2%.
Based on the gene expression data, PCA was performed for dimension reduction to extract a small number of representative features that can represent the effects of all genes. Herein, the PCA-based method on these data set succeeded in discriminating hypothalamus (H), pituitary (P) and gonad (G) from each other (Fig. 3a). However, the female sample displayed significant differences among replicates, especially for the female hypothalamus, possibly because the female samples came from different families. Then, Noiseq was used to calculate differential expression of two kinds of comparison purposes (18 pairwise comparisons including 9 in HPG and 9 in different sexes). Approximately 74.5–85.6% of the short reads for each individual were mapped to the final unigenes, indicating the good quality of the assembly results. In the female hypothalamus versus female pituitary (FH-VS-FP), 865 genes were upregulated, while 666 genes were downregulated. In the female pituitary versus female gonad (FP-VS-FG), 1,091 genes were upregulated, while 2,857 genes were downregulated. In the female hypothalamus versus female gonad (FH-VS-FG), 1,456 genes were upregulated, while 3,097 genes were downregulated. The number of DEGs greatly increased in the order of FH-VS-FP → FP-VS-FG → FH-VS-FG. A similar trend of gene expression in HPG also occurred in both males and individuals whose sexes could not be distinguished in stage I. Of the differentially expressed unigenes between males and females in the same kinds of tissues, 772 genes were upregulated, while 1,077 genes were downregulated in the unknown-sex hypothalamus versus male hypothalamus (UH-VS-MH); 3,696 genes were upregulated, while 13,685 genes were downregulated in the unknown-sex hypothalamus versus female hypothalamus (UH-VS-FH); and 4,981 genes were upregulated, while 17,211 genes were downregulated in the male hypothalamus versus female hypothalamus (MH-VS-FH). In addition to the comparisons above, the total results of pairwise comparisons of 9 kinds of tissues are described in Fig. 3b. To validate the repeatability and reproducibility of DEGs generated from annotated data, we re-amplified 10 sex-related genes by RT-PCR. Nine of the 10 genes had gene expression profiles similar to those from RNA-Seq, suggesting that the sex-related genes identified through RNA-Seq were highly accurate and reliable.
The unigene set generated a total of 51,570 SSRs, with the largest number of SSR motifs of mono-nucleotide repeats (18,162), followed by di-nucleotide repeats (17,628) and tri-nucleotide repeats (12,799) (Fig. 4).
Usage Notes
For the first time, we reported the transcriptome resources of the integrated HPG of stage I and stage II in A. sinensis. The novel de novo assembly unigenes will provide the following: (i) abundant functional genes for further research on molecular evolution, genetic breeding and fish disease prevention; and (ii) an abundance of SSR markers in A. sinensis. In addition, the DEGs will offer possibilities (i) to understand the gene interaction and regulations of gonad development from stage I to II in A. sinensis, (ii) to explore the gene co-expression and sex differentiation in A. sinensis, and (iii) to reveal gene or tissue interaction models of the HPG axis in A. sinensis. These findings are also valuable for research on developmental regulation or gene expression patterns of tissue interaction, the identification of HPG- and sex-related genes, and developing SSR markers in other sturgeon species.
References
Grande, L. & Hilton, E. J. An Exquisitely Preserved Skeleton Representing a Primitive Sturgeon from the Upper Cretaceous Judith River Formation of Montana (Acipenseriformes: Acipenseridae: N. Gen. and Sp.). Journal of Paleontology 80, 1–39 (2006).
Bemis, W. E., Findeis, E. K. & Grande, L. An overview of Acipenseriformes. Environmental Biology of Fishes 48, 25–71 (1997).
Ludwig, A., Belfiore, N. M., Pitra, C., Svirsky, V. & Jenneckens, I. Genome duplication events and functional reduction of ploidy levels in sturgeon (Acipenser, Huso and Scaphirhynchus). Genetics 158, 1203–1215 (2001).
Havelka, M., Kašpar, V., Hulák, M. & Flajšhans, M. Sturgeon genetics and cytogenetics: a review related to ploidy levels and interspecific hybridization. Folia Zoologica 60, 93–103 (2011).
Ludwig, A. Identification of Acipenseriformes species in trade. Journal of Applied Ichthyology 24, 2–19 (2008).
Yue, H., Li, C., Du, H., Zhang, S. & Wei, Q. Sequencing and De Novo Assembly of the Gonadal Transcriptome of the Endangered Chinese Sturgeon (Acipenser sinensis). PLoS ONE 10, e0127332 (2015).
Wei, Q. et al. Biology, fisheries, and conservation of sturgeons and paddlefish in China. Environmental Biology of Fishes 48, 241–255 (1997).
Wei, Q. et al. Variations in spawning stock structure of Acipenser sinensis within 24 years since damming of Gezhouba Dam. Journal of Fishery Sciences of China 12, 5 (2005).
Ryan, G. T. & Gregory T. R. Animal genome size database, http://www.genomesize.com (2011).
Yu, X., Zhou, T., Li, Y., Li, K. & Zhou, M. Chromosomes of Chinese fresh-water fishes. (Beijing (in Chinese): Science Publishing House, 1989).
Hale, M. C., McCormick, C. R., Jackson, J. R. & Dewoody, J. A. Next-generation pyrosequencing of gonad transcriptomes in the polyploid lake sturgeon (Acipenser fulvescens): the relative merits of normalization and rarefaction in gene discovery. BMC Genomics 10, 203 (2009).
Sun, F. et al. Male-biased genes in catfish as revealed by RNA-Seq analysis of the testis transcriptome. PloS ONE 8, e68452 (2013).
Vidotto, M. et al. Transcriptome sequencing and de novo annotation of the critically endangered Adriatic sturgeon. BMC Genomics 14, 407 (2013).
Jin, S. et al. Identification of genes in the hypothalamus-pituitary-gonad axis in the brain of Amur sturgeons (Acipenser schrenckii) by comparative transcriptome analysis in relation to kisspeptin treatment. Gene 595, 53–61 (2016).
Jin, S. B. et al. Comparative transcriptome analysis of testes and ovaries for the discovery of novel genes from Amur sturgeon (Acipenser schrenckii). Genetics and Molecular Research 14, 18913–18927 (2015).
Li, S. et al. Transcriptome profile of Amur sturgeon (Acipenser schrenckii) liver provides insights into immune modulation in response to Yersinia ruckeri infection. Aquaculture 492, 137–146 (2018).
Yuan, L., Zhang, X., Li, L., Jiang, H. & Chen, J. High-throughput sequencing of microRNA transcriptome and expression assay in the sturgeon, Acipenser schrenckii. PloS ONE 9, e115251 (2014).
Zhang, X. J., Jiang, H. Y., Li, L. M., Yuan, L. H. & Chen, J. P. Transcriptome analysis and de novo annotation of the critically endangered Amur sturgeon (Acipenser schrenckii). Genetics and Molecular Research 15(2), 1–13 (2016).
Zhu, R. et al. De novo annotation of the immune-enriched transcriptome provides insights into immune system genes of Chinese sturgeon (Acipenser sinensis). Fish Shellfish Immunol 55, 699–716 (2016).
Yang, J. et al. Transcriptome profiling reveals candidate cleft palate-related genes in cultured Chinese sturgeons (Acipenser sinensis). Gene 666, 1–8 (2018).
Chen, Y. et al. Discovery and identification of candidate sex-related genes based on transcriptome sequencing of Russian sturgeon (Acipenser gueldenstaedtii) gonads. Physiological Genomics 48, 464–476 (2016).
Song, W., Jiang, K., Zhang, F., Lin, Y. & Ma, L. Transcriptome Sequencing, De Novo Assembly and Differential Gene Expression Analysis of the Early Development of Acipenser baeri. PloS ONE 10, e0137450 (2015).
Zhang, S. et al. Characterization and expression analysis of g- and c-type lysozymes in Dabry's sturgeon (Acipenser dabryanus). Fish & Shellfish Immunology 76, 260–265 (2018).
Chen, Y. et al. Gonadal transcriptome sequencing of the critically endangered Acipenser dabryanus to discover candidate sex-related genes. PeerJ 6, e5389 (2018).
Sower, S. A., Freamat, M. & Kavanaugh, S. I. The origins of the vertebrate hypothalamic–pituitary–gonadal (HPG) and hypothalamic–pituitary–thyroid (HPT) endocrine systems: New insights from lampreys. General and Comparative Endocrinology 161, 20–29 (2009).
Shi, J. et al. Long-term effects of Bisphenol AF (BPAF) on hormonal balance and genes of hypothalamus-pituitary-gonad axis and liver of zebrafish (Danio rerio), and the impact on offspring. Chemosphere 128, 252–257 (2015).
Nagahama, Y. Molecular mechanisms of sex determination and gonadal sex differentiation in fish. Fish physiology and biochemistry 31, 105–109 (2005).
Kim, N. N., Shin, H. S., Choi, Y. J. & Choi, C. Y. Kisspeptin regulates the hypothalamus-pituitary-gonad axis gene expression during sexual maturation in the cinnamon clownfish, Amphiprion melanopus. Comparative biochemistry and physiology. Part B, Biochemistry & Molecular Biology 168, 19–32 (2014).
Chen, X., Wei, Q., Yang, D. & Zhu, Y. Observations on the formation and development of the primary germinal tissue of cultured Chinese sturgeon, Acipenser sinensis. Journal of Applied Ichthyology 22, 358–360 (2006).
Chen, Y. et al. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7, 1–6 (2018).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature biotechnology 29, 644–652 (2011).
Pertea, G. et al. TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets. Bioinformatics 19, 651–652 (2003).
Geng, X. et al. A reference gene set construction using RNA-seq of multiple tissues of Chinese giant salamander, Andrias davidianus. GigaScience 6, 1–7 (2017).
Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J. & Zhang, Z. Webb Miller and David J. Lipman. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research 25, 3389–3402 (1997).
Cameron, M., Williams, H. E. & Cannane, A. Improved Gapped Alignment in BLAST. Ieee Acm T Comput Bi 1, 116–129 (2004).
Iseli, C., Jongeneel, C. V. & Bucher, P. ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proceedings. International Conference on Intelligent Systems for Molecular Biology 99, 138–148 (1999).
Li, R., Li, Y., Kristiansen, K. & Wang, J. SOAP: short oligonucleotide alignment program. Bioinformatics 24, 713–714 (2008).
Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676 (2005).
Ye, J. et al. WEGO: a web tool for plotting GO annotations. Nucleic Acids Reserch 34, W293–297 (2006).
Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
Li, R. et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25, 1966–1967 (2009).
Sollner, J. F. et al. An RNA-Seq atlas of gene expression in mouse and rat normal tissues. Scientific Data 4, 170185 (2017).
Tarazona, S. et al. Data quality aware analysis of differential expression in RNA-seq with NOISeq R/Bioc package. Nucleic Acids Research 43, e140 (2015).
Van Hiel, M. B. et al. Identification and validation of housekeeping genes in brains of the desert locust Schistocerca gregaria under different developmental conditions. BMC Molecular Biology 10, 56 (2009).
NCBI Sequence Read Archive, https://identifiers.org/ncbi/insdc.sra:SRP084065 (2016).
GenBank, https://identifiers.org/ncbi/insdc:GGYF01000000 (2018).
Gene Expression Omnibus, https://identifiers.org/geo:GSE126616 (2019).
Du, H. et al. A. Hypothalamus-pituitary-gonad axis transcriptome profiling for sex differentiation in Acipenser sinensis. figshare, https://doi.org/10.6084/m9.figshare.c.4244492 (2019).
Acknowledgements
This work was supported by the Three Gorges Environmental Funds of China Three Gorges Corporation (XN270). We acknowledge the assistance of Juanjuan Liu, Yacheng Hu, Chun Tan, Qingkai Zeng and Xun Zhao in the project application and implementation, and Baifu Guo, Debin Shu, Jianming Zhang, Jianyi Wan, Hua Jiang, Jiayuan Tian, Jun Rao, Dezhi Zhang and other colleagues in sample preparation. We also acknowledge Mohamed Foda Ph.D., Yuan Lin Ph.D. and Zhihui Xiu Ph.D. for reviewing and revising the paper.
Author information
Authors and Affiliations
Contributions
H.D. and X.L. conceived and designed the experiments; H.D., Y.G. and L.C. directed the research project. X.L. and K.X. performed the experiments. J.C., J.J., B.W. and J.X. analysed the data and organized the datasets. The manuscript was written by B.W., J.J. and H.D. and revised by J.C. and J.Y. All authors discussed the results and commented on the manuscript.
Corresponding authors
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Online-only Table
ISA-Tab metadata file
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.
About this article
Cite this article
Du, H., Jian, J., Wang, B. et al. Hypothalamus-pituitary-gonad axis transcriptome profiling for sex differentiation in Acipenser sinensis. Sci Data 6, 87 (2019). https://doi.org/10.1038/s41597-019-0099-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-019-0099-1
This article is cited by
-
Genome-wide identification, phylogeny and expressional profile of the Dmrt gene family in Chinese sturgeon (Acipenser sinensis)
Scientific Reports (2024)
-
Transcript annotation of Chinese sturgeon (Acipenser sinensis) using Iso-seq and RNA-seq data
Scientific Data (2023)
-
Developing microsatellite duplex PCR reactions for sterlet (Acipenser ruthenus) and their application in parentage identification
Scientific Reports (2022)
-
Development and characterization of a duplex PCR assay in Chinese sturgeon (Acipenser sinensis) for genetic analysis
Scientific Reports (2020)