I am working with RNA Seq data analysis. I have to download MSU7 rice genome. somehow i downloaded from (http://rice.plantbiology.msu.edu/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/pseudomolecules/version_7.0/all.dir/) But i am not able to find Annotation (.gtf) file and reference genome (.fa) file . so how can i find and how can i download. Another question is how to define (--frag-len-mean 262 --frag-len-std-dev 80 ) these two things for oryza sativa (how much i need to give mean and std-dev). help me
In Ensembl Plants, you can download the gtf and the fasta files from their FTP. The assembly there is a "a unified assembly of the 12 rice pseudomolecules of Oryza sativa Japonica Group cv. Nipponbare by scientists from the MSU Rice Genome Annotation Project (MSU) and the International Rice Genome Sequencing Project (IRGSP) / Rice Annotation Project Database (RAP-DB).