Hello !
I'm attempting to utilize the Sarek pipeline for analyzing Ion Torrent data, but I'm facing challenges with configuring single-end files. Unlike Illumina, the Ion Torrent data I have comes in the form of a single FASTQ file per sample.
My issue lies in adapting the sample sheet to accommodate these single-end data. According to the documentation, the sample sheet structure is designed for paired-end data, whereas in my case, I only have one FASTQ file per sample.
Here's an example of my data structure:
patient,sample,lane,fastq_1
patient1,test_sample,lane_1,test.fastq.gz
I've created this simplified configuration for my single FASTQ file, but I'm wondering if there's a specific method to correctly adapt the sample sheet for Ion Torrent single-end data.
Additionally, are there specific parameters or configurations within the Sarek pipeline that need adjustment to support this single-end data type from Ion Torrent?
Any help or guidance in correctly adapting the nf-core/sarek configuration for Ion Torrent single-end data would be greatly appreciated!
Thank you very much for your assistance!
Best regards,
Nour-Eddine
Thank you for your clear and helpful response regarding Sarek and single-end data. I appreciate your help!
I'm considering converting my Ion Torrent data into R1 and R2 files to suit Sarek. Do you think this conversion would be suitable in this context, or are there other considerations I should keep in mind?
I would definitely not recommend trying to convert single end fastq files into pseudo paired end data. In my opinion it would be better to use/adapt a different pipeline or simply write your own.