What are the most influencing factors in reconstructing a reliable transcriptome assembly? Academic Article uri icon


  • AbstractReconstructing the genome and transcriptome for a new or extant species are essential steps in expanding our understanding of the organisms active RNA landscape and gene regulatory dynamics, as well as for developing therapeutic targets to fight disease. The advancement of sequencing technologies has paved the way to generate high-quality draft transcriptomes. With many possible approaches available to accomplish this task, there is a need for a closer investigation of the factors that influence the quality of the results. We carried out an extensive survey of variety of elements that are important in transcriptome assembly. We utilized the human RNA-Seq data from the Sequencing Quality Control Consortium (SEQC) as a well-characterized and comprehensive resource with an available, well-studied human reference genome. Our results indicate that the quality of the library construction significantly impacts the quality of the assembly. Higher coverage of the genome is not as important as the quality of the input RNA-Seq data. Thus, once a certain coverage is attained, the quality of the assembly is mainly dependent on the base-calling accuracy of the input sequencing reads; and it is important to avoid saturating the assembler with extra coverage.

altmetric score

  • 16.8

author list (cited authors)

  • Ghaffari, N., Abante, J., Singh, R., Blood, P. D., Pipes, L., Mason, C., & Johnson, C. D.

citation count

  • 0

complete list of authors

  • Ghaffari, Noushin||Abante, Jordi||Singh, Raminder||Blood, Philip D||Pipes, Lenore||Mason, Christopher||Johnson, Charles D

publication date

  • November 2017