Filtering mitochondrial genes in DESEq results
1
0
Entering edit mode
15 months ago

Hi all!

I took bulk RNAseq data done on leukaemia cells into R and I carried out deseq. I have three questions:

  1. I am confused about the limits people place on differentially expressed genes (DEGs). So far, I have only filtered for the adjusted p-value to isolate significantly differentiated genes. However, I don't know if I should impose a cut-off for the log2fold change?
  2. How do I map the ENSEMBL ids to the gene name?
  3. Is it possible to filter the DEGs to isolate mitochondrial genes? For example, taking a list such as a list of mitochondrial genes from MitoCarta, is it possible to parse that through R and isolate the results that correspond to the list?

I'm new to bioinformatics, so any help is appreciated!

DESEq2 RNA-seq differentially-expressed-genes • 1.1k views
ADD COMMENT
0
Entering edit mode
  1. Try a Volcano plot. You need thresholds on both p.adj and fold-change if you're looking for genes that are significantly differentially expressed AND you want to be confident in your results. p.adj will give you the latter but not the former.
  2. Search the forum and the web, there are many ways to do this. Your best bet is biomaRt - the R package.
  3. It should theoretically be possible. Again, biomaRt is your best bet. Look for a way to map ENS IDs to contig, all MT genes will map to chrM/MT contig.
ADD REPLY
0
Entering edit mode

1) I am confused about the limits people place on differentially expressed genes (DEGs). So far, I have only filtered for the adjusted p-value to isolate significantly differentiated genes. However, I don't know if I should impose a cut-off for the log2fold change?

It is better to set cut off/threshold for log2FC beside adjusted p-values to say your genes are DE in your test compared to control samples. People usually set log2FC value from -1/-2 to 1/2, but it is not universal criteria. You can chose threshold what is the best for your data.

2)How do I map the ENSEMBL ids to the gene name?

Please check in this link below. You can use biomart an R package for this.

How to use biomart on R to convert Ensembl Gene IDs to Symbols?

3)Is it possible to filter the DEGs to isolate mitochondrial genes? For example, taking a list such as a list of mitochondrial genes from MitoCarta, is it possible to parse that through R and isolate the results that correspond to the list?

Yes it is very easy in R. There are different ways of doing it. In one way you can make a list of mitochondrial genes and merge with your DESeq2 results and get your result.

ADD REPLY
0
Entering edit mode
15 months ago
bthom ▴ 10

1) I am confused about the limits people place on differentially expressed genes (DEGs). So far, I have only filtered for the adjusted p-value to isolate significantly differentiated genes. However, I don't know if I should impose a cut-off for the log2fold change?

There is no "rule" per se, however, most people will use an adjusted p-value of < 0.05 and a log2FC > |1|. The log2FC cut-off is applied to help reduce the DEG list to only the most biologically relevant --- this is similar to the cut-off that is applied to other experiments such as western blots or RT-qPCR.

3)Is it possible to filter the DEGs to isolate mitochondrial genes? For example, taking a list such as a list of mitochondrial genes from MitoCarta, is it possible to parse that through R and isolate the results that correspond to the list?

Yes, as @Ram said you can leverage the chrM/MT suffix and identify mitochondrial genes it with a filter(grepl()) command.

ADD COMMENT
0
Entering edit mode

chrM/MT suffix

What are you talking about?

identify mitochondrial genes it with a filter(grepl()) command.

What on earth are you going on about?

Also, two posts addressing all 3 points are added as comments. How does your post that addresses 2/3 of the questions become an answer?

ADD REPLY

Login before adding your answer.

Traffic: 1931 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6