Entering edit mode
5.0 years ago
markddesimone
▴
60
I have a series of files containing exon-exon junctions, essentially chr, start, end, strand of each intron. Is there a way to assemble these into transcripts without having the original bam files?
Thank you
Thank you, here is an example, they are bam.bed files, no header, the usual bed format. After adjusting start end as described here Tophat Junctions.Bed File the coordinates are the intron coordinates. Only junctions between exons are detected. I can use gencode annotation to determine which exons are likely either side of the intron but would like to assemble them into likely transcripts so I could get transcript abundance measures. :
Maybe RegTools could help: https://regtools.readthedocs.io/en/latest/commands/junctions-annotate/ I haven't tried this command but it seems that it can output transcripts and genes that overlap junctions if you have provided reference fasta and annotations gtf files.