This is my bed file for all the exon coordinates ..small subset so i want to take out all the exons that of a given gene let say i have gene in chr 1 which starts from chr1 11868 12227
so i want to parse out all the exons that comes in between 11868 12227
this is my small subset
cat exon.bed | head -10
chr1 11868 12227 + exon
chr1 11871 12227 + exon
chr1 11873 12227 + exon
chr1 12009 12057 + exon
chr1 12178 12227 + exon
chr1 12594 12721 + exon
chr1 12612 12697 + exon
chr1 12612 12721 + exon
chr1 12612 12721 + exon
chr1 12974 13052 + exon
How do i parse out , i use mostly R and bit of shell script but I m not sure if i can use R , may a few lines of perl or shell script can help me solve my problem.
Any help or suggestion would be highly appreciated
how about just using awk ?
if you need a faster solution, query your file using tabix.
@Pierre thank you very much for the quick solution at least some start for me to think , the way you suggested what if I have to do for all the genes with their respective coordinates ,how do i do that, because some gene might have one exons and some might have multiple exon...I hope i am kind of making you understand my problem
You might also want to look at txdb packages in Bioconductor.
Take a look at the rtracklayer Bioconductor package and
import
. Then, after importing the bedfile, look at the Bioconductor GenomicRanges%over%
method. These are big hammers for a small problem, but if you use R and are doing genomics, GenomicRanges can quickly become your best friend.okay that sounds really cool , yes i mostly use R for all the genomics work I will try the library and let know
Hello krushnach80!
It appears that your post has been cross-posted to another site: https://bioinformatics.stackexchange.com/questions/3239/parse-out-exon-coordinates-from-bed-file-for-each-gene
This is typically not recommended as it runs the risk of annoying people in both communities.
@Pierre i regret that i posted in earlier but as I didn't get any response so i posted in both communities i would keep in mind not to repeat it
Oh you didn't get a response after 2 hours on a Sunday, that is indeed unreasonably long. Quite a lazy community indeed, next thing you know we'll have a personal life to take care of.
@ WouterDeCoster Im sorry for that i was talking about this question which i asked earlier realted to this which was kind of not specific
Parse out exon for divergent primer design