Explain why RNA-seq data is particularly valuable for refining eukaryotic gene predictions.
Think about your answer, then reveal below.
Model answer: RNA-seq captures the actual transcripts produced by cells, directly revealing which regions of the genome are transcribed and how exons are spliced together. Reads that span exon-exon junctions provide definitive evidence for intron locations and alternative splicing patterns. This experimental evidence resolves ambiguities that ab initio methods cannot — such as which of several possible splice sites is actually used, whether a predicted gene is truly expressed, and the full diversity of transcript isoforms.
Before RNA-seq, gene annotation relied heavily on ESTs (expressed sequence tags) and cDNA libraries, which had limited coverage and tissue representation. RNA-seq provides comprehensive, quantitative, tissue-specific transcript evidence that has dramatically improved annotation quality in every genome where it has been applied.