Another way to identify of the library type your data
1
0
Entering edit mode
9.8 years ago
arronar ▴ 290

Hi.

If you don't know the library type of your sequenced data to use with tophat, tophat's manual says :

One possible way to figure out the correct library-type is to run TopHat with a small subset of the reads (e.g., 1M) as follows.

  1. run TopHat with fr-firststrand and count the number of junctions in junctions.bed (one of the output files from TopHat)
  2. run TopHat with fr-secondstrand and count the number of junctions in junctions.bed

Since the splice junction finding algorithm of TopHat makes use of library-type information (if provided), one of the two TopHat runs would result in many more splice junctions than the other one. You can then use the library type that gives more junctions. If this is not the case TopHat might not work well with your sequencing protocol. Please let us know more details about your protocol so we can add support for new library types.

Is there any other way to answer that question?

RNA-Seq • 2.0k views
ADD COMMENT
2
Entering edit mode
9.7 years ago

Another tool: http://rseqc.sourceforge.net/#infer-experiment-py

For non stranded libraries, the strandedness of reads (if they align to the forward or the reverse strand of the reference fasta) and the strandedness of transcripts (determined from their gtf annotation file) should be independent.

ADD COMMENT
0
Entering edit mode

Thank you very much. Now. I will compare those results with the way of the tophat manual where both runs returned me "JUNC0000106"

ADD REPLY

Login before adding your answer.

Traffic: 1576 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6