Best Approach To Find Non-Human Sequences In Human Ests
2
5
Entering edit mode
14.5 years ago
Neilfws 49k

Several years ago, I was involved with a project to detect putative archaeal sequences in human sequence data. Unfortunately, as is often the case in academia, the database described in the publication was not maintained when I moved to a new job.

This type of study has perhaps been superseded by the Human Microbiome Project. However, I'm still interested in methods to detect potentially-interesting "contaminant" sequences in public databases of sequences that are (supposedly) from one organism.

Originally, we used BLAT to search the human EST database using archaeal genes (from complete genome sequences) as the query. My questions are:

  1. Was using these 2 databases the best approach, or are there better sources?
  2. Would you use BLAT today for this type of task, or a different tool?
sequence comparison database search est • 3.2k views
ADD COMMENT
0
Entering edit mode

@neilfws: Any luck finding an answer?

ADD REPLY
2
Entering edit mode
ADD COMMENT
0
Entering edit mode

Interesting. Although when I go to download at their website, it seems that there are the same number of "pre-cleansed" human ESTs as "cleansed".

ADD REPLY
0
Entering edit mode
13.5 years ago

I guess searching hgEST with genes from complete archaeal genomes is just fine. One could perhaps expand the number of archaeal genes by including genes from draft genomes or any archaeal genes from Genbank. Genes could be simple predictions from Glimmer or Prodigal. ESTs and genes could be pre-clustered before searching (see next):

For the second question I will suggest Uclust.

ADD COMMENT

Login before adding your answer.

Traffic: 2551 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6