Question

HiC data - How to know the restriction enzymes that were used for generating HiC datasets

0

Entering edit mode

3.4 years ago

nkmalini97 • 0

Hello! Any tool for the downstream analysis of the HiC Sequencing data requires the restriction enzyme which was used for generating the HiC data. Is there any way to get it from the Sequencing data? Can I get any information from the fastq files?

restriction_enzyme HiC • 2.4k views

ADD COMMENT • link 3.3 years ago by nkmalini97 • 0

0

Entering edit mode

Figuring out HiC restriction enzyme

ADD REPLY • link 3.4 years ago by ATpoint 86k

score 1 · Answer 1 · 2021-08-26

1

Entering edit mode

3.3 years ago

mvk ▴ 10

Hi,

DpnII is/was used in my case, I had 'GATCGATC' ligation sites in my reads.

A good way to check this is to map your reads against a reference genome, and check if your mapped reads are split on GATC-GATC (or other enzyme site). The overhangs of the enzyme cut sites are filled in and ligated, so you can determine the enzyme site this way. Or you can ask the supplier of the data off course ;-)

ADD COMMENT • link 3.3 years ago by mvk ▴ 10

0

Entering edit mode

@mvk Thank you so much for clarifying. Is there any way I can get it from the fasta sequences of the fastq files of the data? Generally the knowledge of the enzyme is essential from the start of the analysis for HiC-data. I will also have to give the ligation site as one of the parameters for some tools. I create a digested reference genome bed file based on the digestion enzyme used. Then give it along with the data to a tool which does mapping and report the valid pairs. So I am interested to know if I can get any idea about the enzyme from the fastq files itself. Also which mapping tool can be used if I will have to follow the approach you mentioned about.

ADD REPLY • link 3.3 years ago by nkmalini97 • 0