Dear all,
I was wondering if there is a possibility to filter DNA sequences of defined length (lets say about 30 bp) forthe presence of a specific base at a defined positin.
Something like: Give me only the sequences which contain a G at position 5 and an A at position 20.
Maybe this tool is so trivial that it was never implemented in galaxy or I am not seraching with the right key words.
does anybody have an idea?
Thank you very much,
Phil
What is an input file? FASTA?
dear all, thank you very much,
the input file would be a simple fasta and the length of the seuqences is always the same.
with my very limited understanding of programming (I have never done this before) I will try to get this working on a windows PC...
CHeers,
phil
Please if you can mark answer or up-vote any solution above.
philipp.rathert : Please use
ADD COMMENT/ADD REPLY
when responding to existing posts to keep threads logically organized.cpad0112 's solution may be the only one that will work natively on windows. Other two solution s will require python (@st.ph.n) and a virtual unix environment (@Pierre).