I got the good quality genome IDs for 54000 genomes like below:
#genome
G001281285
G000014725
G000775715
G000254175
G001380675
G900057405
G001076295
and I also got all 74000 genome sequence files compressed in fna folder like below
cd fna/
G001284865.fna.bz2 G002910165.fna.bz2 G009390615.fna.bz2
G001284885.fna.bz2 G002910195.fna.bz2 G009390655.fna.bz2
Now could you please help how I can extract the 54000 genome sequence files based on above genome IDs from fna/ folder?
or are you looking for a 'bash script' to process all files automatically? (if so, this is not clear from your post)
Thanks lieven, I have updated the post, I want to extract 54000 genome files based on genome ID from fna folders which contains 74000 individual genome files in compressed form.
Many thanks
thanks lieven, Yes, it will be great if I can have bash script to uncompress all files automatically.
all compress files is in filtered/ folder and I am thinking to use below loop but sure if it correct?
or can I used just
Many thanks
the latter should normally work indeed. (simplest to use this approach)
the bash loop will not work as it is, change it to: