How to remove redundant and poor quality ESTs
1
0
Entering edit mode
10.0 years ago
dinesh ▴ 50

Hi community, i have a question that, How to remove redundant and poor quality ESTs from whole data set through online .......?

genome blast alignment • 2.3k views
ADD COMMENT
1
Entering edit mode
10.0 years ago
Prakki Rama ★ 2.7k

These are not online, but can be helpful.

Redundant sequences

  1. Check this biostars post to remove exact duplicate sequences.
  2. If you want to remove sequences based on similarity cutoff, you can try cd-hit-est, uclust etc.

Poor quality sequences

  1. If you have reads, then you can map those reads using some mapping tools such as bowtie, BWA and check if the sequence has sufficient coverage or not. Those with insufficient must be of poor quality.
ADD COMMENT
0
Entering edit mode

1) sir, the default options in cd-hit-EST are enough to run the programme or i have to change the value(suggest me)..........

2) I found EGassembler online software. can i use this one for my work

ADD REPLY
0
Entering edit mode

It depends on your requirement. If suppose you want to collapse shorter sequences covering 80% of the longer sequence with 70% identity, you need to change the parameters accordingly. Look for options -aS and -c in cd-hit. I have not used EGassembler before, so I cannot comment.

ADD REPLY

Login before adding your answer.

Traffic: 1995 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6