whole genome fasta/gtf for cell lines
1
0
Entering edit mode
8.7 years ago

Hello,
I need to create a reference sequence index for TopHat; it is straightforward for the human chromosomes alone, but what if I need to index genomes from human cell lines containing viral integrated sequences, for instance SiHa (containing human papillomavirus) and Akata (containing Epstein-Barr virus)? How can a create a reference for these cells? Is there a GTF file for cell lines?
Thank you,
L.

genome RNA-Seq Assembly • 2.0k views
ADD COMMENT
0
Entering edit mode
8.7 years ago
michael.ante ★ 3.9k

Hi Luigi,

You can either create a combined index by combining the fasta files and the gtfs of human and the virus annotation, or create an index for the virus genomes and map to that first. The unmapped reads can then be used to align against the human annotation.

Cheers

Michael

ADD COMMENT

Login before adding your answer.

Traffic: 2111 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6