How to take out sequences with barcode?
1
1
Entering edit mode
5.8 years ago
suvratha ▴ 70

Hello,

https://www.ncbi.nlm.nih.gov/sra/SRX2791702[accn]

In the above link, the design section has barcode sequences, how do i get all the reads with each particular barcode?

I did try using grep '^<barcode sequence="">' from the fastq file. But as you can see the last column in the link is "no. of sequences" and when i try to count the number by using grep, I'm getting a different number. The number I get is not matching with what they have provided.

Am i using grep incorrectly? what is the position of these barcode sequences?

Thanks!

sequencing fastq illumina barcode • 2.2k views
ADD COMMENT
0
Entering edit mode

use GBS tools such as GBSX for extracting reads with defined bar codes. suvratha

ADD REPLY
0
Entering edit mode

this helped, thanks!

ADD REPLY
0
Entering edit mode
5.8 years ago
Ido Tamir 5.2k

You could have been more precise with the difference between your read numbers and the stated numbers. If the stated one is bigger in all samples, then its because often and by default demultiplexing is done with 1 mismatch, which grep can not do.

ADD COMMENT
0
Entering edit mode

grep gives more than the number mentioned. for e.g - one the mentioned numbers there is about 6.1k and grep gives me 13.5k.

ADD REPLY

Login before adding your answer.

Traffic: 2656 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6