HeLa reference fasta
1
1
Entering edit mode
7.4 years ago

Dear all,

My first question is if there is / where to find a referece .fasta file for the HeLa cell line.

I am working with several DNA sequencing samples from HeLa. So far I have used UCSC's hg19 as a reference, however when aligning with bwa I can see a lot of R1 fwd alignments with a lot of soft clipping at the 5' in my particular regions of interest. I am starting to think whether this might be due to structural variation differences between hg19 and HeLa beginning around where I am looking into.

My second question is on how to generate a customised reference fasta file from a standard reference fasta file and variation data. Any publication/software describing how to do this?

There are several 2013 efforts studying variation in HeLa. I thought that perhaps combining hg19 with single nucleotide and structural variations files might do what I need.

Any ideas?

HeLa reference hg19 • 3.5k views
ADD COMMENT
1
Entering edit mode

According to the most comprehensive review of the HeLa genome I've seen, it's not a small change between it and hg19. It's a completely rearranged cell line. https://www.ncbi.nlm.nih.gov/pubmed/23550136

ADD REPLY
2
Entering edit mode
7.4 years ago
GenoMax 147k

I believe that you need to apply for access to HeLa cell data. More information here.

ADD COMMENT
2
Entering edit mode

Yes, you need to request access to the data from dbGAP. From what I've been told, requesting access is a pain and has to be done for each use of the data separately plus access authorization expires quickly and has to be renewed. My opinion is that if the HeLa genome sequence is of long term interest to several projects in your lab, you'd be better off sequencing the HeLa cells you're using. Code from the Landry et al. paper should be freely available.

ADD REPLY

Login before adding your answer.

Traffic: 2202 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6