I have a set of results from Whippet, which lists deltaPsi values for segments of genes (which may or may not be exons). For the analysis I have the relevant Ensembl Gene IDs, but I need to identify which specific isoforms involve these segments that are flagged as significantly different between conditions.
Is there a quick programmatic way of extracting all transcripts for the particular gene which contain a particular segment coordinates?
You tagged the biomart, does that mean you have tried that but could not find what you are looking for? Asking this because what you describe as a problem is a kind of job that the biomart can help with an answer.
I tagged it biomart as I expect there probably is a solution using biomart but I'm not too au fait with it (or biomaRt, the R package) to extract what I need.
I'm afraid BioMart is not the best way of doing it as it's gene oriented. If you decide to output Ensembl transcript stable IDs (ENSTs) for a given genomic region, the the BioMart is going to look for a gene overlapping this region and print all of the gene's transcripts. You could, however, do it using the REST API and overlap endpoint described here: https://rest.ensembl.org/documentation/info/overlap_region Here's na example: https://rest.ensembl.org/overlap/region/human/17:27630005-27630969?feature=transcript;content-type=application/json