Hi everyone, I'm new to the field of comparative analysis. May I know the difference between comparative genome analysis and comparative genomics? Are they similar?
Apart from that, I learnt that researchers sometimes will do multiple sequence alignment (MSA) on the sequences and find difference between the sequences. May I know what techniques or methods the researchers used to identify the differences among the aligned sequences? I tried to find the methods from the published articles but many similar articles never stated on how they find out the differences, any experts/professionals that knew bout this can explain to me about this? actually I'm kinda lost bout this *sob sob*
Thank you in advanced for all the explanations and I will appreciate all the responses. :)))
First of all this is a very general question or approach. It is a huge area. you can compare many part of genomes. briefly, the two statements intend to find similar/dissimilar part of genomes. For instance, you can compare; 1. genome structures: coding/non-coding part, length, same elements, size etc. 2. gene structure: number and size of genes, introns, axons. 3. evolution of some part of genomes: gene duplications, gene family expansion/contractions or gain/lost
@techniques or methods the researchers used to identify the differences among the aligned sequences? ; They do , as you say, multiple sequence alignment by using some tools such as mafft, muscle. They select, for instance, a gene which is found in genomes of species that they are interested in, and they use those tools whether any conserved sites, indels (insertion/deletions) in the gene. Lets say several amino acids of the gene may have been evolved in a species, and the evolved part (different amino acids) may be involved in a specific motif that is unique to that gene, meaning the key part of the gene for its function. If you want to answer a scientific question which is related to or based on evolution of a gene, you may further need to do selection analyses (positive selection or relaxed selection) that may have occurred on some specific amino acid sites of the gene.
Oh I see. Thank you for the explanations. :) May I know which tools they will be using to search for conserved sites and indels? Because this is the thing I'm searching for my project, and it's quite hard to find as some articles did not stated about this at all (tools or techniques they used to search the conserved sites and indels).
A short description:
https://en.wikipedia.org/wiki/Conserved_sequence
several papers describing/using tools for indel detection.
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-14-S11-S2
https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-10-S11-S7
http://meme-suite.org
https://softgenetics.com/NextGENe_01.php
https://europepmc.org/article/pmc/pmc6735753
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3530679/
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0218862#sec002
https://bmcgenomics.biomedcentral.com/articles/10.1186/s12864-018-4671-4
https://academic.oup.com/nar/article/27/19/3899/1057228
Thank you so much for sharing these links and all of them are informative, really appreciate it a lot!