I am interested in detecting the type of virus infection I may have in my human samples using available gene expression data from the same samples. Like; You have this signature, suggesting that your sample may be infected with virus X.
Does anyone have an idea how I best could do this?
I am afraid that what I have been doing so far merely suggests that I may have an infection, but doesn’t say much on the type.
Thank you for your input!
I only have the human expression data....
do u mean u have only expression values or sequencing data ? When you say "type of virus infection I may have in my human samples", you will have some viral sequences in your data, which could be detected by aligning them to viral genomes.
The expression data i have is not the raw data, it has already been mapped to the human genome. I have only the expression values for human genes.
Yes, but you sequenced the mRNA within that sample. Depending on a number of factors, present viral material (mRNA, genomes, and so on) might have been picked up during sequencing. So you might have both human and viral reads, to find them you'll want to BLAST your reads against a viral genome database, as Goutham said.
With that said, depending on the sequencing approach, sample and what virus, there is large variability in how much and how well you'll detect virus. Huge. You also have to consider the difference between a pathogen and something present in the host microbiome. I would explore using PCR in your human sample to confirm, there are other methods 'wet' methods you might want to explore.