Because many journals request to have all the sequences in a public database, I wanted to submit my data to ENA or GenBank, but now I have some problems. The sequencing was done in 2011 with Illumina HiSeq200 of 16S V6 region. After barcode and primer removal the average read length was about 80 bp. My first problem is that GenBank accepts over 200 and ENA over 100 bp long reads, so do I have to find another database that accepts <100 bp reads?
And my second question is do you have to submit raw reads or after quality check? Because ENA requests fastq files, but after denoising I only have fasta files and unable to merge it with quality file. Although it makes more sense to me to upload denoised data and not raw reads as there is not much useful information in the low quality reads.
I totally agree here. Demultiplex. Do not clean. Submit to SRA. GenBank is an incorrect database to submit Illumina reads to.