I'm looking to create a list of regular expressions that can distinguish between the IDs of various databases? I know that some will be ambiguous but at least it could help narrow down which databases to check.
For example:
Kegg IDs: \w{,3}\d{1,}
Entrez IDs: \d*
RefSeq IDs: \w{2}_\d{1,}\.\d{1,}
Anyone have any to add? This might be a useful community resource.
very useful topic
changed to community wiki.
Yeah, I've been trying to convert a 'mixed bag' of IDs and I had trouble even placing some of them. Hopefully this will help out.