I have a list of target IDs (think like KEGG ids or Taxids from blast) for TFs identified ffrom our data using JASPAR core databases (vertebrate and insect). I want to make some nice tables using this information but I find no easy way to extract the IDs names and class information from JASPAR.
For example I have a list of ids like:
MA0052.4
MA06602.1
MA0497.1
I want to get their associated TF name, class, and even family information which would be something like:
MA0052.4 MEF2A MADS box factors Regulators of differentiation
MA06602.1 Arid5a ARID ARID-related
MA0497.1 MEF2C MADS box factors Regulators of differentiation
But there seems to be no way to extract this info from JASPAR itself by providing a list. I have to insert these ID names one by one to get them and I have a list of 100s of IDs to go through. Anyone figured out a workaround for this?
The second one was a typo on my part but this isn't giving the class or family information. The TRANSFAC files seem to have that info but I worry if they are differently annotated somehow.
Then take this as a template and modify it to work with the transfac files rather than the jaspar files. It's a great bash-fu training exercise, an essential skill for any analyst.