I have a BAM file which I have filtered to contain only mapped pairs of duplicates. I want to find out the gene names that my reads are covering. What should the next step in my workflow be?
As you said in the comment, you got a hg38 reference genome, I can suggest you to download the gtf or gff of this genome ( https://www.gencodegenes.org/releases/current.html ). It will be usefull for the last part.
Do you have a genome annotation (generally in the GTF or GFF3 formats)? If you do, you can count reads mapping to each gene using featureCounts.
Thank you. I mapped against hg38. Will this work?
Yes, that should be fine. Just download a GTF (preferably from the same source as your genome fasta) for featureCounts.
Thank you very much. I will give this a try.
Is your BAM file mapped against a reference? Is it one of the commonly available genomes?
I mapped it against hg38.