Are variable lengths are a problem for TopHat2?
2
1
Entering edit mode
10.5 years ago
ktsuttlemyre ▴ 40

I have some fastq files that contain a sequence length distribution that are not uniform. Most sequences are trimmed at 75bp with ~3/8ths at 74bp ~1/4th at 73bp and trailing off all the way to the 34bp length. This closely resembles an exponential curve. Anyway, I am curious if the varying lengths will cause problems when attempting to use TopHat2 to align to a reference genome.

Tophat2 alignment miseq fastq • 2.2k views
ADD COMMENT
2
Entering edit mode
10.5 years ago

Tophat2 supports variable read-length. So using Tophat2 won't be an issue. Personally, I don't use reads with length less than 40 nt. I don't think reads with variable read lengths will cause any problem with the interpretation of results generated from Topaht2.

ADD COMMENT
2
Entering edit mode
10.5 years ago

That'll be fine. Variable lengths are pretty much the norm post-QC.

ADD COMMENT

Login before adding your answer.

Traffic: 2670 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6