Read groups information?
0
0
Entering edit mode
4.6 years ago
Researcher ▴ 20

I am doing sequence analysis on the genomic data of E. Coli. My sequencing comes from Illumina sequencing. As I don't have the detailed information, I set the arguments of AddOrReplaceReadGroups method in Picard tools as follows:

RGID=SampleName RGLB=SampleName RGPU=illumina RGPL=illumina RGLB=SampleName RGSM=SampleName

Is it fine or should I look for the exact value for these fields?

sequencing sequence picard next-gen picard tools • 1.2k views
ADD COMMENT
0
Entering edit mode

There is no formal definition of what is a read group,

This was the first sentence of the tutorial that @Pierre had linked on the last question you had asked about this topic. Are you trying to use GATK for bacteria?

ADD REPLY
0
Entering edit mode

Dear @genomax

yeah, I am using GATK, but, I skipped some steps like "Base quality score recalibration (BQSR)" for bacteria. So, now my question is for which steps of GATK, the read group information is needed?

ADD REPLY
0
Entering edit mode

Any answer from @genomax , @ATpoint or anyone else?

ADD REPLY
0
Entering edit mode

Any answer from @genomax , @ATpoint or anyone else?

ADD REPLY
0
Entering edit mode

I am not a regular GATK user but all GATK/Picard tools likely require RG.

That said snippy was a simpler tool recommended today for bacterial SNP calling.

ADD REPLY
0
Entering edit mode

Do you plan to use any tools that rely on exact information for this like some of the Broad Institute/Picard/GATK tools family? if not then just leave it as is, I personally never care about the inconsistently-defined concept of RGs.

ADD REPLY
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 2715 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6