Entering edit mode
2.4 years ago
jscience
▴
10
I only have the accession codes of several proteins from the MGnify database. I would like to retrieve the full amino acid sequence data from the database, but I have not been able to find a way to do this using the web server or available API.
An example of the accession code is MGYP001242958860.
How can I get the protein sequence?
That does not look like a valid accession: https://www.ebi.ac.uk/metagenomics/search/studies?query=+MGYP001242958860
It does not return any hits when you do a text search, but it is a valid accession for protein data. See https://www.ebi.ac.uk/metagenomics/sequence-search/seq?seq_ac=MGYP000080499703&seq_id=21980190
Are these accessions private? It looks like most of the study accessions available on https://www.ebi.ac.uk/metagenomics/search/studies?page=175 are in the format
MGYPnnnnnnnn
(8 digit number). Examples you show above are much longer and not found in text search.