Genome Sequencing - what is the 'identifier'?
0
0
Entering edit mode
3.3 years ago
pollyyhjo ▴ 10

What is the 'identifier' of the first read in both files? Here is the code I get. Also, what does this identifier of both reads tell us? enter image description here

enter image description here

coding sequence identifier • 996 views
ADD COMMENT
0
0
Entering edit mode

Specifically: https://en.wikipedia.org/wiki/FASTQ_format#Illumina_sequence_identifiers

So there is no identifier (as far as a sample ID goes) inside an Illumina file. You would normally have that information in the name of the file. If someone "coded" the names to be generic (like what you have) then you had better have a key/metadata file that links the index sequence you see in header (GGACTCCT+CTCCTTAC) with a sample_ID/file names.

ADD REPLY
0
Entering edit mode

So we cannot tell what identifier from the code above?

ADD REPLY
0
Entering edit mode

Identifier for? If for sample, then no.

But if you wanted to know what sequencer the sample ran on then you get the serial number NB551191. Flow cell serial number is HM5WHBGX5. Data is from lane 1.

ADD REPLY

Login before adding your answer.

Traffic: 2034 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6