Hello everyone,
Initially I express that I am not very expert in bioinformatics analysis.
I have the TMP from RNAseq data. These data come from Arabidopsis seeds infected with a fungal inoculum. I select the data by calculating the zscore. Thanks to the tutorials and the forums I have managed to run the code. However, I suspect that there is a problem that I need to solve and I don't know how to do it.
datExpr <- read.csv("TPM_Zscore_Ready.csv", header=TRUE, sep=',')
colnames(datExpr)
row.names(datExpr) = datExpr$Genes
datExpr$Genes = NULL
datExpr = as.data.frame(t(datExpr))
dim(datExpr)
my data (imput) looks like this
Genes
AT1G04990.6
AT3G24180.1
AT4G35510.1
AT2G24420.1
AT5G08450.2
AT5G07970.1
AT5G11530.1
AT4G29380.1
AT5G41350.1
AT3G14270.2
So, I don't know how to properly remove the numbers after the 'dot'. I understand that this is a product of splicing and I assume that it is not just removing them arbitrarily. I was reading about gsub function but I am don't understand very well...
For do the enrichGO analysis, the developers of the clusterprofiler package helped me,
# creating universe: Universe are all expressed genes from datExpr. They will be the background
df <- read.csv("TPM_Zscore_Ready.csv", header=TRUE, sep=',')
colnames(df)
universe = df [1]
write.csv(universe, file = "GO_enrichment/universe.csv")
# My universe is ready to use in ORA
universe<-as.character(universe[,1])
universe <- gsub("\\..*", "", universe)
universe <- sort(universe, decreasing = TRUE)
# loading DEGs
#After, in excel I will create a file with all the genes of the selected modules and then I will load them for the GO analysis
DEGs <- read.delim("GO_enrichment/DEGs.txt", header = TRUE)
deg_list <- vector("list", ncol(DEGs))
names(deg_list) <- colnames(DEGs)
for (i in 1:ncol(DEGs)) {
gene <- unique(DEGs[, i])
gene <- gsub("\\..*", "", gene)
deg_list[[i]] <- gene
}
however for the networks to export in cytoescape and for the gene filter the problem persists. I really appreciate if someone can guide me towards the correction.
Thank you very much
Tutorial:How To Ask Good Questions On Technical And Scientific Forums
you have the
gsub
line that removes the dot, so what problem still persists?So sorry Istvan Albert I did not clearly describe the problem. The gsub in 'univero' had no problems, I was just trying to give an example but it caused confusion.
Thank you very much!