How to check if a sample has been mixed with another sample based on NGS sequencing data
1
1
Entering edit mode
4.9 years ago
katjanjarosz ▴ 10

Hi. I suspect that the sample has been contaminated with another sample. Do you know how to check coverage between one sequenced data with another and see if one is in the other? Thanks

genome next-gen sequencing alignment sequence • 1.5k views
ADD COMMENT
1
Entering edit mode

how to check coverage between one sequenced data with another and see if one is in the other

Why the coverage? Unless you are expecting contamination from another species, the only way is to look at called variants.

ADD REPLY
0
Entering edit mode

Are the samples from the same genome or different? If they are from the same genome it would be impossible to check contamination/coverage unless you had UMI (unique molecular indexes) on your reads. If samples are from different genomes then how different are they (e.g. yeast/human)?

ADD REPLY
0
Entering edit mode

I have sequencing data from multiple strains of Saccharomyces cerevisiae. We think that one strain contaminated another (both strains were sequenced).

ADD REPLY
0
Entering edit mode

Then you need to see if there are variants from your suspected contaminant showing up with low allele frequencies in the sample which you suspect is contaminated.

ADD REPLY
0
Entering edit mode

I would take the experiment and PCA plot it, if this one sample looks weird I'll remove it.

ADD REPLY
1
Entering edit mode
4.9 years ago
tshtatland ▴ 190

Use variant allele fractions to detect contamination. See, for example:
Same-Species Contamination Detection with Variant Calling Information from Next Generation Sequencing. Tao Jiang, Martin Buchkovich, Alison Motsinger-Reif, bioRxiv 531558; doi: https://doi.org/10.1101/531558
Posted January 26, 2019.
https://www.biorxiv.org/content/10.1101/531558v1

ADD COMMENT

Login before adding your answer.

Traffic: 1865 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6