Entering edit mode
7.7 years ago
steven.davis
▴
10
Is there a way to extract instrument and flowcell from a read name like this:
>gnl|SRA|SRR1840614.1.1 FCC1KPRACXX:1:1101:1291:2172
FCC1KPRACXX : what is this?
1 = lane?
1101 = ?
1291 = x
2172 = y
I need to extract the flowcell if possible, so it can be assigned to reads downstream in a sam file read-group tag.
Pierre's description of the numbers is correct. Additionally, I suspect it is theoretically possible to convert "FCC1KPRACXX" into an instrument, but I'm not aware of a tool that does that. That is a string that Illumina sticks on the beginning of sequence identifiers and certainly has meaning to them; so, I suggest you contact Illumina and ask them how to translate it into a specific instrument. If you get a useful response, I'd encourage you to post it here.