I am writing regular error reports for plant sequences that are being annotated as fungi and vice versa to the GenBank email adress for annotation updates. About 3/4 of my email are being ignored and the wrong sequences are still online, although it is obvious they are not correctly annotated. I think this is highy concerning. How can I get GenBank to listen to me? I put a lot of work into trying to fix all the errors I can find.
Well if I am persistent enough and re-mail the same email a couple of times eventually somebody takes action. (I wait 2 weeks minimum between the reminders.) Here are two examples of sequences I "removed" from GenBank:
https://www.ncbi.nlm.nih.gov/nuccore/KC840062 https://www.ncbi.nlm.nih.gov/nuccore/JF409911
here are some more: KC662173 : KC662175 : KT201445 : KJ999405 : KP294358 : EU593548 : EU593547
I'll join genomax on the kudos part :)
Out of curiosity what kind of evidence do you provide (negative/non-specific blast searches)? One of the records that you link above says
You are obviously not the submitter of that record. Perhaps NCBI just uses a standard template to provide that message. Kudos for doing your part to purge incorrect information from GenBank.
At this point I was wondering how many of those Pan genome / comparative genome / genome announcement studies perform a contamination check before analyzing the data.
These ones were pretty obvious cases. I put a screenshot of the top ~30 BLAST results for each sequence in the email and if most of them are fungi and not plant sequences it's kinda obvious the uploader did not check the sequence before uploading.
I email the submitter first, because that is obviously the correct way to go. But sometimes the submitter has left academia/died/does not take action and then there is no other choice than to contact the database maintainer.
Here is a sequence I am working on right now and the submitter has answered so hopefully it will be gone soon. KT695200
But the whole process is so tiresome. I wish I could just click "report" in the sequence profile and it would trigger a manual investigation.