I have a big genomic file including coordinate as below.
Data:
chr10 42383177 42384128
chr10 42384129 42385080
chr10 42385081 42386032
chr10 42386033 42386984
chr10 42386985 42387936
chr10 42387937 42388888
chr10 42388889 42389840
chr10 42389841 42390793
I would like to extract some features from UCSC like:
- GC content
- conservation score (phastCons)
- repeat fraction (RepeatMasker)
is there any way to download the whole genomic table from UCSC or any script to get those scores for my given coordinates? How can I get those features from UCSC?
Many thanks in advance!
@Many Thanks Luis.
I have two more questions:
What is the difference between PhastCons100way and PhastConsElements100way? and which one should be used?
I could not find 'phastCons100way.valueAverage', there is only phastCons100way.value' that is for 100 specious when I check the output file, is there any option to give me the average of scores?