Question

Regions of extreme GC content

1

Entering edit mode

9.7 years ago

wanabi ▴ 60

Hello for CNV analysis QC, I am looking for a reliable bed file of regions with extreme (>90%, <10%) GC content.

Any idea were I can find such file? I tried UCSC but it gives it in bins of 5bp, which is not very convenient.

Thanks

gc-content • 3.1k views

ADD COMMENT • link updated 2.9 years ago by Ram 45k • written 9.7 years ago by wanabi ▴ 60

GenoMax · Answer 1 · 2015-10-19

5

Entering edit mode

9.7 years ago

dariober 15k

It's not too difficult to generate a bed file of GC content along the genome. You just need the reference fasta file and a genome file giving the length of the chromosomes. Then with bedtools, first create sliding windows along the genome and for each window calculate the %GC, then use e.g. awk to get rows where %CG is above/below a threshold, something like:

bedtools makewindows -g hg19.genome -w 1000 \
| nucBed -fi hg19.genome.fasta -bed \
| awk '$5 > 0.9 || $5 < 0.1'

See 'nucBed -h' for the output format, I think %GC is going to be in column 5, not sure though.

ADD COMMENT • link 9.7 years ago by dariober 15k

0

Entering edit mode

what is the bed file in this command?

ADD REPLY • link 8.5 years ago by fatima.m.zare ▴ 30

0

Entering edit mode

Did you notice the | in the command? Output of bedtools makewindows is directly being piped into nucBed (it is in the bed format).

ADD REPLY • link 8.5 years ago by GenoMax 152k

0

Entering edit mode

Actually I have an error:

"Less than the req'd two fields were encountered in the genome file (hg19.genome) at line 1."

and my hg19.genome is:

chr1    249250621
chr2    243199373
chr3    198022430
chr4    191154276
chr5    180915260
chr6    171115067
chr7    159138663
chrX    155270560
chr8    146364022
chr9    141213431
.
.
.

ADD REPLY • link updated 8.5 years ago by GenoMax 152k • written 8.5 years ago by fatima.m.zare ▴ 30

0

Entering edit mode

I can fix my error... thanks for your help.

ADD REPLY • link 8.5 years ago by fatima.m.zare ▴ 30

0

Entering edit mode

What turned out to be the problem?

ADD REPLY • link 8.5 years ago by GenoMax 152k

0

Entering edit mode

I changed "chr1" to "1" and .... . then my error was fixed.

ADD REPLY • link 8.5 years ago by fatima.m.zare ▴ 30

0

Entering edit mode

Do you know how can I compute mappability? I want to fix mappability bias and I need mappability like gc content.

ADD REPLY • link 8.5 years ago by fatima.m.zare ▴ 30