I am working on a second design for a sequence capture array (similar to a micro-array, but with genomic DNA) and I want to blast a few sequences that seem problematic in my design. The sequences are various ESTs or cDNA sequences that will help me design the probes. I am using blastplus to find the names of the genes corresponding to these sequences.
When I use blastplus with the nr database, with the format "6", I have these info, separated by tabs :
Query name, Gene accession number, % of identity, Length, Mismatch, etc...
I would like to have the same format, but with the names of the genes. It's more useful than the accession number.
Is it possible to modify the format #6 to have this information ?
Thank you very much... sometimes, it's hard to be a beginner in the world of bioinfo !
is it a really a NCBI gene ID or is it the accession number of the EST/cDNA ?
Yeah, I was looking to get the NCBI gene ID, but the only info I had was the accession number. But I have a homemade python script that can help me with that, it parses the info I need and write it in a new file... I didn't write the script though and I wanted to find another solution, which Marina gave me further down. Thanks !