T-Cell Receptor Sequencing Clonotype Information
2
2
Entering edit mode
9.3 years ago

Hi,

I have RNA sequencing data of T-Cell Receptor from several patients before and after immunotherapy. The sequencing was done by a vendor and they provided us with tables of data for individual patients. These tables contain rows with clonotypes (as per my understanding defined as CDR3 sequences with a unique combination of V, D, and J segments, along with counts of the clonotype. The CDR3 AA sequences are given in each row along with the V, D, and J names. The D segments are missing (I believe that should be fine). But I found some of the AA sequences are duplicates and so are some of the V-J segment combinations. This is confusing to me because each row should contain a unique clonotype. I am new to this type of data and might not understand this type of data correctly. Can someone please help me out with understanding this data set.

Thank you,

- Pankaj

TCR-Sequencing-Clonotype • 17k views
ADD COMMENT
3
Entering edit mode
8.8 years ago

My 5 cents:

vdj

  • The number of V-J pairs is limited, most of TCR diversity is due to randomly added N-nucleotides

  • Several nucleotide variants can code for the same amino acid sequence, this is called "convergent recombination". This is common for clonotypes with few or no N-nucleotides. Such clonotypes are close to germline, frequently being public (shared across many individuals) and specific to commonly encountered pathogens (CMV, EBV, etc).

  • Clonotype should not be confused with "clone". The former typically refers to single chain (TCR beta, IGH, ...), while the latter refers to the antigen receptor heterodimer (TCR alpha-beta, IG heavy-light chain)

    • As TCR alpha is recombined after TCR beta, and has less diversity, so distinct TCR beta clonotypes are likely to correspond to distinct clones. Thus the number of unique TCR beta nucleotide sequences is a good measure of TCR repertoire diversity.
ADD COMMENT
0
Entering edit mode

This was very helpful. I had a couple of follow ups: 1. Is there a reference for the figure and the second bullet point. I would like to use them for as a references in a paper we are working on. 2. How variable is the N region in terms of number of nucleotides.

ADD REPLY
0
Entering edit mode

Hello!

ADD REPLY
1
Entering edit mode
9.3 years ago

The rows do contain distinct clonotypes, it's just that different clonotypes may not always be different on the protein level. You're thinking in terms of amino acids but defining types according to nucleic acids, which is where the confusion is coming from.

Edit: If you haven't done so, read this paper.

ADD COMMENT

Login before adding your answer.

Traffic: 2232 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6