junction.bed output files from tophat
0
1
Entering edit mode
6.6 years ago

im newbie to NGS and linux. I have used Tophat to determine the splice junction in my sequences. i want to know that how could i play with junctions.bed output file of tophat?

Actually I want to develop a PWM (position weight matrix) from this data. for which i need to extract 13 nucleotides (3 from exon and 10 from intron ) at 5' of splice junction and 17 nucleotides (14 intronic and 3 exonic nucleotides ) from 3' splice junction to buld PWM. can anyone help me how to extract these specific nucleotides in separate file? further is there any tool to deal with such questions???

next-gen • 1.8k views
ADD COMMENT
1
Entering edit mode

You should know that the old 'Tuxedo' pipeline of Tophat(2) and Cufflinks is no longer the "advisable" tool for RNA-seq analysis. The software is deprecated/ in low maintenance and should be replaced by HISAT2, StringTie and ballgown. See this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. There are also other alternatives, including alignment with STAR and bbmap, or pseudo-alignment using salmon.

In addition, I converted this post from a "Tool" to a "Question". "Tool" should only be used to announce new software.

ADD REPLY
0
Entering edit mode

thank you so much for the help...

ADD REPLY

Login before adding your answer.

Traffic: 2010 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6