Abstract
As a fruit of the current revolution in sequencing technology, transcriptomes can now be analyzed at an unprecedented level of detail. These advances have been exploited for detecting differential expressed genes across biological samples and for quantifying the abundances of various RNA transcripts within one gene. However, explicit strategies for detecting the hidden differential abundances of RNA transcripts in biological samples have not been defined. In this work, we present two novel statistical tests to address this issue: a 'gene structure sensitive' Poisson test for detecting differential expression when the transcript structure of the gene is known, and a kernel-based test called Maximum Mean Discrepancy when it is unknown. We analyzed the proposed approaches on simulated read data for two artificial samples as well as on factual reads generated by the Illumina Genome Analyzer for two C. elegans samples. Our analysis shows that the Poisson test identifies genes with differential transcript expression considerably better that previously proposed RNA transcript quantification approaches for this task. The MMD test is able to detect a large fraction (75%) of such differential cases without the knowledge of the annotated transcripts. It is therefore well-suited to analyze RNA-Seq experiments when the genome annotations are incomplete or not available, where other approaches have to fail.
Similar content being viewed by others
Article PDF
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Stegle, O., Drewe, P., Bohnert, R. et al. Statistical Tests for Detecting Differential RNA-Transcript Expression from Read Counts. Nat Prec (2010). https://doi.org/10.1038/npre.2010.4437.1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/npre.2010.4437.1
Keywords
This article is cited by
-
Transcriptome analysis of Polygonatum cyrtonema Hua: identification of genes involved in polysaccharide biosynthesis
Plant Methods (2019)
-
Identification of lung cancer gene markers through kernel maximum mean discrepancy and information entropy
BMC Medical Genomics (2019)
-
Simultaneous Isoform Discovery and Quantification from RNA-Seq
Statistics in Biosciences (2013)