NCBI Contamination Screen
0
1
Entering edit mode
4.5 years ago
Pegasus ▴ 120

Hi all,

I have medusa-scaffold data, which contain 160 scaffolds and some NNN bridges. The file is in fasta format, I need to remove all scaffolds less than 200, and vectors possible contamination, this why I am trying to use NCBI Vecscreen. VecScreen However, as a beginner, I still do not know where can I find the option of removing these <200 bp, and how to upload my fasta data, since I have no accession number yet.

Please let me know if you are familiar with such a tool. Many thanks!

next-gen genome • 1.1k views
ADD COMMENT
1
Entering edit mode

If you are looking to simply remove sequences that are <200 bp (no screening per se) then you can use BBtools.

reformat.sh in=your.fa out=filtered.fa minlength=200
ADD REPLY
0
Entering edit mode

Thanks genomax can this code be used in the mac terminal!

ADD REPLY
1
Entering edit mode

Yes. BBTools are written in java. So you just need Java runtime.

ADD REPLY
0
Entering edit mode

Hi, biomax, please check the edited question. Thanks in advance for your valuable answers

ADD REPLY
0
Entering edit mode

Thanks it works well

ADD REPLY

Login before adding your answer.

Traffic: 1667 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6