Entering edit mode
8.0 years ago
jt358
▴
10
Hi all,
Quick query, one can obviously blat/blast short sequences (125bp) to see if they align to a protein coding region - but for those that don't align, is there a way to identify the nearest gene to the inputted sequence?
Cheers all
Jamie
If you have an alignment and a bed file of the genes then you could use
bedtools closest
. Link for closest doc.Hey genomax2 - helpful as always
I can create a BED file in PLINK I assume, but perhaps a silly question, what does bedtools run short sequences against? I see you mention 'if I have an alignment' - does that mean I need an annotated indexed genome of my species?
Cheers
Jamie
Please use
ADD REPLY
to answer to earlier comments, as such this thread remains logically structured and easy to follow.Also note that a bed file from plink isn't the same as a bed file in the rest of bioinformatics. What genomax2 means is an interval file containing chromosome, begin and end of every gene. You can probably convert a gtf/gff file for that.