Nat Methods; co-auth.: group Reymond

Nat Methods. 2013 Nov 3. doi: 10.1038/nmeth.2714. [Epub ahead of print]

Assessment of transcript reconstruction methods for RNA-seq.

Steijger TAbril JFEngström PGKokocinski FThe RGASP ConsortiumAbril JFAkerman MAlioto TAmbrosini GAntonarakis SEBehr JBertone P,Bohnert RBucher PCloonan NDerrien TDjebali SDu JDudoit SEngström PGGerstein MGingeras TRGonzalez DGrimmond SMGuigó RHabegger LHarrow JHubbard TJIseli CJean GKahles AKokocinski FLagarde JLeng JLefebvre GLewis SMortazavi ANiermann PRätsch GReymond A,Ribeca PRichard HRougemont JRozowsky JSammeth MSboner ASchulz MHSearle SMSolorzano NDSolovyev VStanke MSteijger TStevenson BJStockinger HValsesia AWeese DWhite SWold BJWu JWu TDZeller GZerbino DZhang MQHubbard TJGuigó RHarrow JBertone P.

Source

European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, UK.

Abstract

We evaluated 25 protocol variants of 14 independent computational methods for exon identification, transcript reconstruction and expression-level quantification from RNA-seq data. Our results show that most algorithms are able to identify discrete transcript components with high success rates but that assembly of complete isoform structures poses a major challenge even when all constituent elements are identified. Expression-level estimates also varied widely across methods, even when based on similar transcript models. Consequently, the complexity of higher eukaryotic genomes imposes severe limitations on transcript recall and splice product discrimination that are likely to remain limiting factors for the analysis of current-generation RNA-seq data.

PMID:

 

24185837