Annotation using Pfam
1
0
Entering edit mode
7 months ago

Hi Community,

I have 17 metagenomically assembled genomes (MAGs). I want to annotate the protein of these genome using Pfam.

I want a step by step guideline for the annotations in Linux bash.

My idea is: I need to download the Pfam database but there are lot of files and I am confused which one should I download. Next I need to use HMM for the annotations.

Thank you so much!

annotation metagenomic Protein Pfam • 798 views
ADD COMMENT
4
Entering edit mode
7 months ago
Mensur Dlakic ★ 28k

This is not going to be step by step, but hopefully still useful. First download and unpack Pfam HMM database:

wget https://ftp.ebi.ac.uk/pub/databases/Pfam/current_release/Pfam-A.hmm.gz
gunzip Pfam-A.hmm.gz
hmmpress Pfam-A.hmm

Assuming your protein sequences are in proteome.faa, do the search:

hmmscan -o annotations.txt --tblout annotations.tbl -E 0.01 --cpu 4 Pfam-A.hmm proteome.faa

Programs from the HMMer package are described here.

ADD COMMENT
0
Entering edit mode

Thank you so much

ADD REPLY
0
Entering edit mode

Dear Prof/Dr/Sir,

May I kindly ask, is the E-value used usually is 0.01? Or we can be more stringent with going with 1e-5 which are taught by the academician as a usual standards (to expect 1 error in 100,000)?

Thank you very much

ADD REPLY
0
Entering edit mode

I think you already answered your own question. If you want to be more stringent and get strong hits without any manual checking, go with lower E-values. If you want to include more potential hits for later manual inspection, higher E-values are acceptable.

ADD REPLY

Login before adding your answer.

Traffic: 2535 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6