Question

Approach to support expression of an annotation from RNASeq

0

Entering edit mode

8.5 years ago

samuelmiver ▴ 440

I have a set of new annotation candidates to be considered in a genome using data from proteomics.

I would like to support them with data coming from RNA Sequencing data. My initial idea was to take all the trusted annotations and study its distribution in RPKM (population A) and the same for regions of the genome with no annotation (population B). With these two population I would calculate the probability of my candidate belonging to population A (expressed) or population B (not expressed). I see this approach excessively complex to just say if a region is being expressed or not.

Is there any tool or approach to assign a probability of being expressed per base/annotation? If not, does it make sense the approach suggested?

RNA-Seq expression statistics • 1.4k views

ADD COMMENT • link updated 8.5 years ago by natasha.sernova ★ 4.0k • written 8.5 years ago by samuelmiver ▴ 440

score 1 · Answer 1 · 2016-05-23

There are some recent paper, they may be useful to you.

A survey of best practices for RNA-seq data analysis

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4728800/

Computational Methods for Transcriptome Annotation and
Quantification using RNA-Seq

http://www.broadinstitute.org/~mgarber/17436_1_merged_1302536134.pdf

Assessment of transcript reconstruction methods for RNA-seq

http://www.nature.com/nmeth/journal/v10/n12/full/nmeth.2714.html