Question

Conclusions to draw from PCA analysis regarding mapping

1

Entering edit mode

9.2 years ago

Lisa Ha ▴ 120

Hey everyone,

I have performed a RNAseq experiment with two closely related species, for one of which I have an assembled transcriptome from previous sequencing. I tried various options for mapping both species to the existing transcriptome, once with CLC Genomics Workbench with looser parameters for cross-species mapping so that the mapping percentages were roughly equal (around 80%), and once in protein space with blat and default parameter for both species. I started exploring the datasets with the DESeq2 package in R and noticed that the PCA plots differ massively when performed for both species in regard to the mapping procedure. However, there is very little difference when looking at the species separately. When I plot the normalized values of random contigs I can also detect very few differences between the two procedures, so what exactly is happening here? And can I draw a conclusion from the PCA analysis in regard to which mapping procedure is better or is that not possible at all?

NB: the sequencing output was 50bp single-end reads, which I fear is not appropriate for blat mapping. All of the papers that use this method had at least 100 bp reads.

Thank you!

RNA-Seq R • 2.4k views

ADD COMMENT • link 9.2 years ago by Lisa Ha ▴ 120

score 0 · Answer 1 · 2015-09-25

What you're noticing is that there are likely a few genes that are highly divergent between the two species and therefore are only getting alignments appropriately with blat.

You can't make any alignment quality judgements from a PCA plot.

In general, I would expect blat to give a bit better results than NGS aligners, people just don't use it in this context because it's extremely slow in comparison. But of course if you've already spent the time to get things aligned with blat then I would probably use that, though it never hurts to find a few reads aligned differently (or only by blat) and see if they seem reasonable to you.

score 0 · Answer 2 · 2015-09-28

0

Entering edit mode

9.2 years ago

Lisa Ha ▴ 120

Thank you, Devon.

I will double-check the differently aligned reads to see what I can make of it. So you don't see a problem with using blat on short reads?

ADD COMMENT • link 9.2 years ago by Lisa Ha ▴ 120

0

Entering edit mode

I don't see any issues, but I've not done any benchmarking to be absolutely sure.

ADD REPLY • link 9.2 years ago by Devon Ryan 105k