Question

Another way to identify of the library type your data

0

Entering edit mode

10.4 years ago

arronar ▴ 290

Hi.

If you don't know the library type of your sequenced data to use with tophat, tophat's manual says :

One possible way to figure out the correct library-type is to run TopHat with a small subset of the reads (e.g., 1M) as follows.

run TopHat with fr-firststrand and count the number of junctions in junctions.bed (one of the output files from TopHat)

run TopHat with fr-secondstrand and count the number of junctions in junctions.bed

Since the splice junction finding algorithm of TopHat makes use of library-type information (if provided), one of the two TopHat runs would result in many more splice junctions than the other one. You can then use the library type that gives more junctions. If this is not the case TopHat might not work well with your sequencing protocol. Please let us know more details about your protocol so we can add support for new library types.

Is there any other way to answer that question?

RNA-Seq • 2.2k views

ADD COMMENT • link updated 3.2 years ago by Ram 45k • written 10.4 years ago by arronar ▴ 290

Ram · Accepted Answer · 2015-03-31

2

Entering edit mode

10.4 years ago

Ashutosh Pandey 12k

Another tool: http://rseqc.sourceforge.net/#infer-experiment-py

For non stranded libraries, the strandedness of reads (if they align to the forward or the reverse strand of the reference fasta) and the strandedness of transcripts (determined from their gtf annotation file) should be independent.

ADD COMMENT • link updated 3.2 years ago by Ram 45k • written 10.4 years ago by Ashutosh Pandey 12k

0

Entering edit mode

Thank you very much. Now. I will compare those results with the way of the tophat manual where both runs returned me "JUNC0000106"

ADD REPLY • link updated 3.2 years ago by Ram 45k • written 10.4 years ago by arronar ▴ 290