Question

which annotation I should for RNA-seq? Ensembl, UCSC or refseq?

1

Entering edit mode

8.5 years ago

bxia ▴ 180

Does it really matter?

I took a look of the annotation file, they looked very different...

RNA-Seq Ensembl RefSeq • 3.8k views

ADD COMMENT • link updated 8.5 years ago by Denise CS ★ 5.2k • written 8.5 years ago by bxia ▴ 180

2

Entering edit mode

Which organism are you working with ?

If they looked very different, other than the naming conventions of genes/chromosomes, don't you think it matters ?

ADD REPLY • link 8.5 years ago by GouthamAtla 12k

score 2 · Answer 1 · 2016-06-04

It's not surprising the annotation files are different. They are independent projects annotating the same assembly (e.g. GRCh38 in human, GRCm38 in mouse) but relying on different methodologies for calling genes and transcripts. In Ensembl, the annotation needs to be supported by biological evidence (mRNA, EST, protein, RNASeq reads). Ab initio predictions are not listed in the annotation file whereas you may have some predicted transcripts in the RefSeq set (those based on XM or XP entries). The Ensembl annotation is the Gencode annotation, a merge between automatically annotated genes with manually annotated genes by HAVANA. The latter collaborates with labs/other groups to experimentally validate some of their transcripts (see Howald et al) and to include pseudogene annotation from Yale (see these slides).

score 0 · Answer 2 · 2016-06-02

0

Entering edit mode

8.5 years ago

Steven Lakin ★ 1.8k

Choose whichever is the most up to date for the organism that you're studying; for the popular model organisms, it won't matter, but for more rarely studied organisms, it will depend on which database is the most up to date for that model.

ADD COMMENT • link 8.5 years ago by Steven Lakin ★ 1.8k

score 0 · Answer 3 · 2016-06-02

0

Entering edit mode

8.5 years ago

Antonio R. Franco ★ 5.2k

Yes, it matters, but only in a reduced set of genes that are being differentially annotated among the different platforms

It has been recently published a paper about it, but I cannot find it right now. Sorry

ADD COMMENT • link 8.5 years ago by Antonio R. Franco ★ 5.2k

1

Entering edit mode

Would it be this one by Zhao & Zhang?

ADD REPLY • link 8.5 years ago by Denise CS ★ 5.2k

0

Entering edit mode

Yes, it is the same paper

ADD REPLY • link 8.5 years ago by Antonio R. Franco ★ 5.2k