Hello,
I would like to use ORFik to map Ribo-reads to different ORFs in the maize genome. The latest version of the genome is Zm-B73-REFERENCE-NAM-5.0.fa. The annotation file is a GFF3. I have the genome fasta file, the fasta fai file, and the GFF3 file. The ORFik package uses GTF instead of GFF3, so I used gffread to convert my GFF3 to a GTF. Below you can see my code:
# Import packages ----
library("ORFik", lib = "~/Rlibs") # Loads the package
library("GenomicRanges", lib = "~/Rlibs")
library("GenomicFeatures", lib = "~/Rlibs")
# Specify files locations
where_to_save_config <- "~/Documents/R/Ribo-seq/ORFik_config.csv"
parent_folder <- "~/Documents/R/Ribo-seq"
fastq.dir <- file.path(parent_folder, "raw_data")
bam.dir <- file.path(parent_folder, "processed_data")
reference.dir <- file.path(parent_folder, "references")
config.save(where_to_save_config,
fastq.dir, bam.dir, reference.dir)
# Check ORFik config structure
config()
# Folder structure for uORF experiment
conf <- config.exper(experiment = "uORF_maize", # short experiment info
assembly = "Zm-B73-REFERENCE-NAM-5.0", # In reference folder
type = c("Ribo-seq")) # fastq and bam type
conf
# Assign local annotation files
gtf <- "/home/R/Ribo-seq/references/Zea_mays.Zm-B73-REFERENCE-NAM-5.0.55.gtf"
genome <- "/home/R/Ribo-seq/references/Zm-B73-REFERENCE-NAM-5.0.fa"
makeTxdbFromGenome(gtf, genome, organism = "Zea mays")
When I run makeTxdbFromGenome(gtf, genome, organism = "Zea mays")
I get the following output:
Making txdb of GTF
Import genomic features from the file as a GRanges object ... Error in .tidy_seqinfo(gr, circ_seqs, chrominfo) :
'chrominfo' must describe at least all the chromosomes of the genomic features imported from the file. Chromosomes missing
from 'chrominfo': 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
Could the conversion have messed up the GTF? I see the chromosome info in both the GFF3 and the GTF files.
Thanks ahead for any guidance.
Thanks again.