Hello community,
I would like to have an overview of all
formats a acc. no.
can have/ types of acc. no..
So far I have been working with acc. no. with 3 letters + 5 numerals
or 3 l. + 7 n.
or 3 l. + 9 n.
(eg. MAM88718, MBA1146895 or WP_088983797).
But now I have encountered some with only 1 numeral +3 letters
(eg. 5YFE). It looks like a pdb ID
to me rather than an acc. no. but I got that reference from a search in the non-redundant NCBI DB
...
So...
I would like to know where they come from.
Does anyone know what kind of acc. no.
this is?
And...
I remember some time ago I found a nice list that displayed an overview of all kinds of acc. no. like:
Nucleotide: 1 letter + 5 numerals OR 2 letters + 6 numerals
Protein: 3 letters + 5 numerals
WGS: 4 letters + 2 numerals for WGS assembly version + 6-8 numerals
MGA: 5 letters + 7 numerals
... just like here: https://letgen.org/GGGB%20Genetics%20&%20Bioinformatics/accession-number/ or here: https://academic.oup.com/nar/article/44/D1/D733/2502674 (but neither of those is the website I remember - it was nicer).
That is it! Thank you :)