Entering edit mode
6.0 years ago
2nelly
▴
350
Hi all,
Can anybody suggest me a tool in which you can provide a DNA sequence in order to be scanned against known motifs?
Thank you in advance
A: Extracting the coordinates for a specific base from multifasta Instead of base, you can use pattern/motif while using seqkit.
This requires exact matching of the query to the subject, right? Not really suited for motif scanning where each position can have base substitutions while still belonging to the same motif, depending on the position frequency matrices.
Program supports regex and degenerate bases. An example data would have been better. @ 2nelly / Could you provide some example data?
something like this:
Actually what I am interested in is to: 1) Provide a PWM like the above and scan for known motifs:
2) Give a consensus motif directly and look for possible matches to known motifs
Please edit your top-level question to really point out what you need. In the question you say "a DNA sequence" and now it is a PWM that you want to compare against a given set of other PWMs. Please be precise so that people can understand what you want.
You asked this in the original question and got answers that fit that request.
Not describing the problem accurately leads to just noise in threads, which makes it hard to understand what is being asked and what answer goes with the actual request.
Please edit the original question and provide accurate complete information there.
For matters of completeness, if someone is interested in checking a single sequence against a collection of PFMs/PWMs, see my tutorial.
Dear ATpoint,
I already have the occurrences of some candidate motifs, came from thousands regions. What I am looking for is to check if this motif is found in a db like jaspar.
Thank you
Sorry genomax but I disagree,
The question is quite straightforward. My original goal was to provide a consensus DNA sequence to scan against known motifs, like i said as number 2 in my previous reply.
The number 1 came after the first reply.
So, if I get an answer to my original question, I will be happy. The second query was additional to the answers I received.
Adding comments like this, you increase the noise in threads for no reason. If you disagree, you could have sent me a dm.
Thank you
You are free to disagree, but if you manage to confuse a couple of our top contributors then you probably did not explain your question sufficiently well. Given that your original question is just one sentence suggests that you did not explain things enough. The fact that comments are going back and forward about what exactly you are looking for only confirm this.
And we don't have dm's on biostars.
My suggestion is that we abandon this thread, you think about what you want and clearly describe that in a new thread, providing a link to this thread for background information.
You guys keep adding noise to threads, by accusing me for not providing the correct query info.
Seems like it is easy to you to criticize people behind your keyboard instead of trying to get a second look at the question.
I think it s pointless to look for any solution in this thread.
P.S. The dm option would be a good idea, instead of destroying the questions of "some not so brilliant minds" like us.
Have a great day.
Hi,
Please do not be offended, none of the comments here are personal. Questions are expected to address the issue at hand in full, and be complete on creation. When information that foundationally changes a the problem at hand is added because contributors have to dig in and excavate the actual problem, discussions get out of hand really fast and the overall post plummets in value.
You're welcome to create another question and describe the problem with as much detail as you can, or I can clean up this thread and you can edit your question with the relevant details. What would you prefer?
There is a significant difference between your 'input data' being a PWM and an actual sequence. Nothing in your original question mentioned this matrix, so you got answers about detecting sequences.
Had you just stated the real format of your data in the first place, none of this 'noise' would be here. You didn't even use the word 'consensus' in your top level post, so how were we ever to know you had a PWM?
The onus is on you as the question asker, to make the task as simple as possible for the community to answer your question, it is not on us to guess/interpret what you meant all along. If we have to "take a second look at the question" at all, it wasn't a very well posed question.