Hello everyone,
Recently, I downloaded this table:
https://journals.plos.org/plosgenetics/article?id=10.1371/journal.pgen.1001154#s5
But I realized there are 115/18,931 gene symbols which are duplicated (or repeated several times some of them).
I was wondering what is the best way to proceed.
Thank you in advance.
Francisco Requena
How are the chromosomal locations of those repetitive gene symbols (same or different location) ?
And for your statement "I was wondering what is the best way to proceed", it is impossible to answer unless you explain what you like to do with those genes.
Hello! Thank you for your fast reply. I have checked their locations and they are distributed across the genome. This score (along with others) will be displayed in a software tool for clinician use. Since there are genes duplicated, if the user searches for any of those genes, it will be displayed two rows (with the same information but the HI score different)
I think that EagleEye was asking if, given any duplicate pair of gene symbols, do they have the same genomic co-ordinates? Also, can you provide an example of such a gene symbol pair?
First, you didn't link to a table but to the list of supplementary material of the paper. Second, there are two tables there and both have fewer than 18000 lines (so presumably fewer gene symbols) and don't appear to have duplicated gene symbols. Could it be that you're talking about another data set or paper?