Expression profiling by high throughput sequencing
Summary
High-throughput RNA-sequencing has now become the gold standard method for whole-transcriptome gene expression analysis. It is widely used in a number of applications studying various transcriptomes of cells and tissues. It is also being increasingly considered for a number of clinical applications, including expression profiling for diagnostics or alternative transcripts detection. However, RNA sequencing can be challenging in some situations, for instance due to low input quantities or degraded RNA samples. Several protocols have been proposed to overcome some of these challenges, and many are available as commercial kits. Here we perform a comprehensive testing of three recent commercial technologies for RNA-seq library preparation (Truseq, Smarter and Smarter Ultra-Low) on human reference tissue preparations, for standard (1ug), low (100 and 10 ng) and ultra-low (< 1 ng) input quantities, and for mRNA and total RNA, stranded or unstranded. We analyze the results using read quality and alignments metrics, gene detection and differential gene expression metrics. Overall, we show that the Truseq kit performs well at 100 ng input quantity, while the Smarter kit shows degraded performances for 100 and 10 ng input quantities, and that the Smarter Ultra-Low kit performs quite well for input quantities < 1 ng. All the results are discussed in details, and we provide guidelines for the selection of a RNA-seq library preparation kits by biologists.
Overall design
RNA-seq profiles of Human Universal Reference Total RNA (Clontech) and Human Brain reference RNA (Ambion) were generated by deep sequencing, in duplicate, using Illumina HiSeq2000 or 2500 sequencers.