Entering edit mode
3 days ago
Naila
•
0
Hi what I wanted to do is have the different Wong conditions (14) along the top and then inset the log fold values underneath. I also have joined the GS(gene symbols) from both data sets. Its not working and I don't know why, please can anyone help?
library(readr)
library(dplyr)
library(readxl)
# 1. Load the Reshaped CSV (renamed)
file_path <- "C:/Users/ns01288/Documents/tarney_wong_joined_logfc_renamed.csv"
tryCatch({
tarney_df <- read_csv(file_path)
}, error = function(e) {
stop(paste("Error reading renamed CSV file:", e))
})
# 2. Load the Wong Excel Data
excel_path <- "C:/Users/ns01288/OneDrive - University of Surrey/data using disertation/Total_Proteome_wong_et_al_Tarney_et_al.xlsx"
tryCatch({
wong_df <- read_excel(excel_path, sheet = "Wong_et_al_Total_Proteome")
}, error = function(e) {
stop(paste("Error reading Wong Excel file:", e))
})
# 3. Join Based on Gene (GS)
combined_df <- tarney_df %>%
left_join(wong_df, by = "GS", suffix = c(".tarney", ".wong"))
# 4. Replace Tarney "n/a" with Wong Data
for (col in colnames(wong_df)[-1]) { # Iterate over Wong condition columns
combined_df$TP_Tarney_et_al_logFC_LGSOCvsTube <- ifelse(
combined_df$TP_Tarney_et_al_logFC_LGSOCvsTube == "n/a",
combined_df[[paste0(col, ".wong")]],
combined_df$TP_Tarney_et_al_logFC_LGSOCvsTube
)
}
# 5. Select the desired columns, keeping Tarney data separate, and adding Wong data as extra columns.
final_df <- combined_df %>%
select(GS, TP_Tarney_et_al_logFC_LGSOCvsTube, colnames(wong_df)[-1])
# 6. Write to CSV
new_file_path <- "C:/Users/ns01288/Documents/tarney_wong_joined_logfc_combined_renamed.csv"
tryCatch({
write_csv(final_df, new_file_path)
print(paste("Combined renamed data saved to:", new_file_path))
}, error = function(e) {
stop(paste("Error writing CSV file:", e))
})
This what the csv file look like