RNA-seq of Covid-19

0

Entering edit mode

4.2 years ago

yoshifumimiya ▴ 50

I am a beginner in RNA-seq.

I am studying because I want to perform RNA-seq of COVID19.

I downloaded the SRA file. And then, the complementary sequence of the genomic sequence was created with biopython. Using it as a reference, indexing and quant were performed with salmon.

Then, the Mapping rate was 0.222043%, which was very low. What is the problem. I am aware of lack of study, but it would be helpful if there were comments.

RNA-Seq • 1.1k views

ADD COMMENT • link 4.2 years ago by yoshifumimiya ▴ 50

0

Entering edit mode

I downloaded the SRA file.

Which one?

And then, the complementary sequence of the genomic sequence was created with biopython

why did not you use the genome and annotation from NCBI?

ADD REPLY • link 4.2 years ago by JC 13k

0

Entering edit mode

For the mapping you can just use the normal +RNA genome of SARS-Cov2 without any further manipulation of the reference data. It's not clear from your question, whether you included the human transcripts in your salmon index. This could explain the low mapping rates.

Edit: Or maybe, if the paper used Vero cells (not human) and you used the human transcriptome in the reference, this low mapping rate could also happen.

ADD REPLY • link 4.2 years ago by caggtaagtat ★ 1.9k

0

Entering edit mode

Thanks for your valuable comments. Like advise, I'll try using the normal +RNA genome of SARS-Cov2.

ADD REPLY • link 4.2 years ago by yoshifumimiya ▴ 50

0

Entering edit mode

Thanks for your kind comments.

I downloaded the SRA file from https://trace.ddbj.nig.ac.jp/DRASearch/study?acc=SRP262058. I didn't understand the quote from NCBI. I would like to search from NCBI.

ADD REPLY • link 4.2 years ago by yoshifumimiya ▴ 50

0

Entering edit mode

That data is not COVID, it's human cells, check the full description in https://www.ncbi.nlm.nih.gov/gds/?term=SRP262058

The Reference Genome I mean is the https://www.ncbi.nlm.nih.gov/nuccore/NC_045512.2

ADD REPLY • link 4.2 years ago by JC 13k

0

Entering edit mode

thanks for your comment. I'll try to analyze with that data.

ADD REPLY • link 4.2 years ago by yoshifumimiya ▴ 50

0

Entering edit mode

Mapping rate was 0.222043%, which was very low.

If there are host sequences present the the file (very likely) then results of mapping to SRAS-CoV-2 genome may be very low (0.2% sounds extreme).

ADD REPLY • link 4.2 years ago by GenoMax 147k

0

Entering edit mode

Thanks for your kind comment.

You told me the possibility of including a host sequence. I will verify with different SRA data.

ADD REPLY • link 4.2 years ago by yoshifumimiya ▴ 50

Login before adding your answer.