Question

Gene copies vs. gene paralogs - what's the difference

1

Entering edit mode

7.8 years ago

liorglic ★ 1.5k

Hi all,

I'm trying to get into the theory and practice of gene copy number variation (CNV) analysis, but there is something basic confusing me, which I couldn't yet figure out. Sorry if this is a dumb/trivial question - would appreciate your help anyway.

My confusion is regarding the terms 'gene copy' and 'paralogs'. As far as I understand, paralogs are created when a gene undergoes duplication (never mind by what molecular mechanism), and then starts accumulating mutations as evolution proceeds. So, if gene X was duplicated to create another X, and then X changed to become X', are X and X' considered copies of the same gene, or are they paralogs? Is it a matter of applying some threshold on the sequence similarity between X and X', so they are considered copies up to the point where they diversify enough? Or maybe gene copies are expected to be perfect duplicates? If so, I'd guess that finding such gene pairs is very rare... Maybe it's a matter of function, so once X' gets a different function from X (neo-functionalization), it is considered a paralog? This is a rather complex and difficult to measure definition...
To make things more clear, I'm interested in CNV analysis in the context of whole genome sequence data (not older technologies such as CGH), if that matters.

Could anyone clarify this point for me, or refer me to relevant literature? Thanks a lot!

CNV paralogs WGS • 3.7k views

ADD COMMENT • link updated 7.7 years ago by Emily 24k • written 7.8 years ago by liorglic ★ 1.5k

0

Entering edit mode

Maybe look at biology SE?

ADD REPLY • link 7.8 years ago by Ram 45k

0

Entering edit mode

In very simple terms, a gene copy is still ‘the same gene’, it is just simply, a copy. A paralog may no longer be considered the same gene however, if it has sufficiently drifted since duplication.

In your nomenclature, you could perhaps think of it as Gene X copies and then there is Gene X1 and X2. Eventually X2 might turn in to X’ which has gone on to acquire a new function. I would personally say this is no longer a copy.

ADD REPLY • link 7.8 years ago by Joe 22k

score 2 · Answer 1 · 2018-01-10

2

Entering edit mode

7.7 years ago

Emily 24k

I would say that a paralogue pair is found in that copy number in most individuals of that species. A copy is found in different copy numbers in different individuals.

ADD COMMENT • link 7.7 years ago by Emily 24k

score 0 · Answer 2 · 2018-01-07

I'd say that you are mostly right, and having a gene copied can lead to a paralog gene to be stably included somewhere in the genome. But for dose-sensitive genes having an extra copy is often not tolerated and will cause disease, a pathogenic copy number variant. Paralogs can only become a stable part of the genome if they're tolerated. A "fresh" copy number event is most often a perfect duplicate but will acquire mutations. A paralog which acquired mutations might lose function and become a pseudogene, or get a new function. In the latter case, it's typically no longer considered a copy of gene X, but having the same ancestral gene. Or something like that.