I need to remove repeated nucleotide sequence from the sequences I am working with. The sequences are in FASTA format. Can i do this by using BLASTn ? How? Thank you.
I need to remove repeated nucleotide sequence from the sequences I am working with. The sequences are in FASTA format. Can i do this by using BLASTn ? How? Thank you.
It sounds like you are trying to create a set of non-redundant sequences. There are many methods to do this but the important thing to consider is how many sequences you have because not all methods are equal in this regard. Uclust may work just fine, though if you have a lot of sequences (e.g., one or more lanes of HiSeq) I don't think this will work out well, in my experience. I recommend you try Vmatch if you have access to it (there is a section in the manual on how to accomplish this task).
The proper name of what you are asking for is "repeat masking". So, you have to look for tools to "mask" low entropy sequences.
There are many tools to mask sequences:
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Can you be a little more clear in what exactly you are asking? What is your research question? I don't understand why you would use blastn?
If you look in the comments to Giovanni's answer, it appears the goal is to remove repeat sequences (i.e., create a non-redundant set). There is a dearth of information about the data and question, so all we can really offer are some general guidelines/approaches. I don't think blastn has anything to do with this question, other than it is probably a familiar tool.
Question is unclear due to use of the term "repeated nucleotide sequence". You need to clarify whether you want to remove (1) entire sequences which are duplicates or (2) repeat regions within individual sequences.