Entering edit mode
9.1 years ago
reneebend
•
0
Dear Professors,
I am working on a project where I need to find both the longest transcript (and NM#) and the HGMD transcript (and NM#) for a list of over 100 genes. Then I need to see if these two match.
I have figured out a way I can do this clicking through many links on the UCSC genome browser page, but I'm wondering if there's an easier or quicker way to access this information.
Thank you!
NCBI-NOW! Student,
Renee Bend
You can use UCSC's MySQL access to get this information. You can fetch all mRNA sequence IDs (NM_*) for a gene and query the gbSeq table with them to get their size (or use HAVING to pick the one with the max size).
Hi, Thank you so much for your answer, but it seems like I may need an alternate route, at least for the time being, as I am not yet able to download MySQL.
Do you have any ideas for being able to do this fully online?
Thank you,
Renee
I have come up with a way to get some of the information I need, though I still haven't figured out how to get an amino acid count.
I am using BioMart.
Again, at this point I am clicking on individual
XP_ID
s to get the amino acid length for each protein, and note which one is the longest. Any tips for automating this part would be helpful. I still don't have authorization to download MySQL.Thank you!
Renee
See if Galaxy can help you run queries to UCSC MySQL