hello everyone i retrieved raw unmapped data from GEO and mapped them using python wrapper kb for kallisto bustools and it shows this message :
WARNING [count] 22588 gene IDs do not have corresponding valid gene names. These genes will use their gene IDs instead.(how to deal with this, i read their paper and they didn’t address dealing with nameless gene ids)
this number varies whether i used the default reference and index or i created my own.
these are the commands i used : mapping and generating the matrix command :
time kb count --h5ad -i index.idx -g t2g.txt -x 10xv3 -o output --filter bustools -t 8 -m 7G SRR28245473_1.fastq SRR28245473_2.fastq
using default index :
kb ref -d human -i index.idx -g t2g.txt -f1 transcriptome.fasta
creating ur index :
kb ref -i transcriptome.idx -g transcripts_to_genes.txt -f1 cdna.fa Homo_sapiens.GRCh38.dna_sm.primary_assembly.fa Homo_sapiens.GRCh38.112.gtf
thank you so much but how to deal with them in downstream analysis ?
Just use them as if they are any other gene (they are the same as every other gene, we just don't have official names for them).