I am testing for a o pipeline of wheat fastq files to do variant calling on them I have a couple of raw and trimmed (cutadapt) files but they are too big for simple testing where cna I get shorter files or shorten the ones that I got in order to be able to test faster?
would that be ok shoudl I take the headers and everything how should I do this?
Fastq files have no headers. It is a plain text file. Please google how to get subsets of text files with Unix tools such as
head
. If the file is compressed, decompress first in a pipe such aszcat your.fastq.gz | head -n 400000 > subset.fq
. This would get you the first 100.000 reads (factor 4 because one read consists of 4 lines, check fastq format specifications on why that is).