How to build genome index file with ERCC (spike-in )
1
1
Entering edit mode
5.9 years ago
Mike ★ 1.9k

Hi all,

I have GRCh37 fasta sequence & GTF annotation file and ERCC fasta sequence file (but I don’t have ERCC annotation file (GTF)). I want to build genome index file including ERCC sequence using STAR. please help me how to build index file with ERCC?

Thanks

genome sequence annotation • 3.5k views
ADD COMMENT
6
Entering edit mode
5.9 years ago

Cat the fasta files together and then add an entry for each spike-in into the GTF file. Note that you could instead just map against the spike-ins and use samtools idxstats to get the counts for the spike-ins. The multimapping rate within the spike-ins and between the spike-ins and the human genome will be very low, so this will produce accurate enough results.

ADD COMMENT
0
Entering edit mode

Thanks Devon, Sorry, how to add entry for each spike-in into the GTF file?

ADD REPLY
0
Entering edit mode

It's a text file, open it in your editor of choice.

ADD REPLY
0
Entering edit mode

Thanks, Sorry for again, but I don't have annotation information for spike-ins. Can I add only spike-in names ?

ADD REPLY
1
Entering edit mode

The annotation is the name and entire length, which you can get from the fasta file.

ADD REPLY

Login before adding your answer.

Traffic: 1709 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6