Hello Friends,
I have Illumina Miseq amplicon sequences that I need to demultiplex for use in qiime2. My library prep used a dual indexed method that allowed me to multiplex 4 separate gene markers (ie. 16S, 18S...) per sample. Each individual sample number has a tag that I can easily demultiplex, but I am having trouble demultiplexing the separate genes out of each sample. I figure I can use the primer pairs that were used in the original amplification step to parse out the 4 genes within each sample. Unfortunately, Qiime does not offer this in their demultiplexing tools. Can anyone point me in the right direction or have any tips on the best way to demultiplex my samples by both sample name and then by gene.
Thanks, Danny
could you post an example of what your data looks like?
I created a subset of one of my sample files that was created after demultiplexing using the illumina indexs. Within the forward and reverse files are amplicons of 16S, 18S, UPA, and rbcL. I want to use the primers that I amplified each of these genes with to parse them out. Below are forward reads taken from the same sample, the first two are 18S reads that you can see the forward primer (GTACACACCGCCCGTC). The third read is from the UPA amplicon and starts with a separate primer that was used to amplify them (GGACAGAAAGACCCTATGAA). Does this make more sense?
BBCDBFFFFFFCGGGGGGGGGGHGGGGGGGGHGHGGGGGGHHGGGGGHGHHHHHGGGHHGHHGHGGGGGGGHGGGGGHHHHHHHHHHHGGGGGHHHHHGGGGGGGHGHHGGGGGGGGHHHHGGGA;CGHHHHHHHHHHHHHGGGGFFGGGGGGGGGGGGGGCGGGFGGGGGFFFFFFFFFFFFFF.DFFFFFFFFFFFFFFFFFFFFFFFFFFFDFFFFFFFFFFFFFFFFFFFFFFFFFFFFDFFFFFFF @M00348:26:000000000-D2NFB:1:1101:16189:1725 1:N:0:GGAGCTAC+ACTGCATA GTACACACCGCCCGTCGCTACTACCGAGTGAATTATGTCATGATGCCCTGGGACTGGACGTTGAACGGGTGTCAAAGCCTGTTCGATGCTAGAATCAGCGTAAAATGGCGCAATTTCGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCAGAAGGATCAA + ABBBBFFFFFFFFGGGGFGGGGHHGGGGGGGGGHHHGGGGHHHHHGGGHHHHHHHHGGGGHHHGGGGGGHHGHGGGGDHHGHHHHHHHHHGGHHHHHHHHGGGGGHHHHHHHHGGGGGHHHGHHHGHHHHHFHCCGHHHHHHHHHHHGDFGGGGFFGGGGGGGGGGGGAFEGFFGFFFFFFFFFFFFFFFFFFFFEFFFEDFFFFFFEFFFFFFFAFFFB@@CFBFFFBBFFFFFFFFFFFFFFDFFFBA. @M00348:26:000000000-D2NFB:1:1101:12525:1946 1:N:0:GGAGCTAC+ACTGCATA GTACACACCGCCCGTCGCTACTACCGATTGAATGAATTAGTGAGCTTCAGAGATCGAGCTGTTTCGGGCAACCGGGTCAGTTTGAGAACCGAATCAAACTTGCTCATTTAGAGGAAGTAAAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCAGAAGGATCAT + EEDEEFFFFFFEGGGGGGGGGGHGGGGGEFGHHHHGGGGGHHGGGGGHHHHHHHHGGGGHHHGGGGGGHHHHGGGGGGHHHHHHHHHHHHHHHHHHHHHGHGGGGHGHHGGEFGGHGHHFGGG.CGHHHHHHHHGGGHHGGGHHHHGGGGGGGGGGGFGGGGGGGGGGGDFFFFHFFFFFFFHFHFFFFFFFHFFFFFFFFFFFFFFFFFDFFFFFFFFFFFFFFFHFFFFFFFFFFFFFFFFFFFFFFFF @M00348:27:000000000-D2TDJ:1:1101:8422:19699 1:N:0:GGAGCTAC+ACTGCATA GGACAGAAAGACCCTATGAAGCTTTACTATAGCCTGGAATTGTGTTCGGGCTTCGCTTACGCAGGATAGGTGGGAGGCTGTGAAGTTCTGCTTGTGGGCAGGATGGAGCCAACGGTGAGATACCACTTTAGCGAGGCTAGAATTCTAACCCCTGCCCGTCATCCGGGAGGGAGACAGTTTCAGGGGGGTAGTTTGACTGGGGCGGTCGCCTCCTAAAAGGTAACGGAGGCGCGCAAAGGTTCCCTCAGG