Hi there , I am working on transcriptome project on Solanum lycopersicum. I have download reference genome and GFF files from https://www.ncbi.nlm.nih.gov/genome/7?genome_assembly_id=1878360
While checking the GFF file I got know to there is ? symbol in the strand column
NC_007898.3 RefSeq mRNA 227097 142099 . ? . ID=rna-LyesC2p008;Parent=gene-LyesC2p008;Dbxref=GeneID:3950473;gbkey=mRNA;gene=rps12;locus_tag=LyesC2p008
Generally the strand column has either + or - value
Can anyone help me how we can interpret this?
Ref 1.http://mblab.wustl.edu/GTF2.html 2.https://asia.ensembl.org/info/website/upload/gff.html#fieldshttps://www.ncbi.nlm.nih.gov/genome/7?genome_assembly_id=1878360
3.Reference genome : https://www.ncbi.nlm.nih.gov/genome/7?genome_assembly_id=1878360
Hi!
I have taken a quick look to the CDS annotations belonging to the given transcript ID with "?" strand. If you look at them, some of the CDSs are annotated with
+
and some with-
, see:My guess is that this is why the transcript is annotated with "?".