Metatranscriptome pipeline: taxonomy and functional recover

1

Entering edit mode

11 months ago

l.gallucci ▴ 20

Hi all,

I'm actually trying to build a pipeline for microbial transcriptome data (Illumina 2x150 bp).

Basically up to now my work was based on classic qc and trimming with trimmomatic.

Then I moved to sortmerna for rRNA and non-rRNA separation. I'm a bit confused on how to proceed for getting taxonomy and functional genes now...I used Phyloflash (with trimmed reads, before SortMeRNA) but also this step is not really clear. I get assembly with spades, taxonomy, but I'm a bit lost on how to recover the taxonomy for external plotting outside the html of Phyloflash.

metatranscriptome pipeline phyloflash • 459 views

ADD COMMENT • link updated 11 months ago by jeevii • 0 • written 11 months ago by l.gallucci ▴ 20

0

Entering edit mode

Hello,

I would suggest to align your reads [trimmed reads] to non-coding database like Rfam , then get the unmapped reads then do an assembly and then gene prediction.

For taxonomy classification, you can use the gene prediction sequences to classify against NT database using kraken2 or centrifuge. And further to map functional annotation, you can go with uniprot protein seqeunces or NR database.

Hope it helps!

ADD REPLY • link 11 months ago by jeevii • 0

Login before adding your answer.