Question

Forum:Premade RNA-seq pipeline or custom

0

Entering edit mode

6 months ago

noodlejackson ▴ 40

Hi everyone,

In this great video by OMGenomics linked below, Maria said when writing CVs that if you say you made a custom RNA-seq pipeline, say why you made one and didn't use a ready-made pipeline from elsewhere.

I made a custom one because that's how I learned. However, is it common to use a pre-made one, are there common ones that are used? Where would you find them?

Thanks!

rna-seq pipeline • 1.3k views

ADD COMMENT • link updated 6 months ago by i.sudbery 21k • written 6 months ago by noodlejackson ▴ 40

1

Entering edit mode

Basically all I know today about bash, nextflow and containers I learned by writing my own pipelines. Ask yourself whether you can afford it in terms of spending time on writing pipelines versus getting other things done. It's a valuable learning experience.

ADD REPLY • link 6 months ago by ATpoint 88k

1

Entering edit mode

I learnt a lot of what I know the same way. Its always been my philosophy to do things the hard, but self improving way, if thats possible.

ADD REPLY • link 6 months ago by i.sudbery 21k

3

Entering edit mode

6 months ago

Pierre Lindenbaum 166k

However, is it common to use a pre-made one, are there common ones that are used?

https://nf-co.re/rnaseq/3.14.0/

ADD COMMENT • link 6 months ago by Pierre Lindenbaum 166k

3

Entering edit mode

6 months ago

dsull ★ 7.6k

I like writing my own CUSTOM pipelines so I can CUSTOMize it haha.

There are too many things I need to do with the data I have that can't be accomplished with pre-made pipelines.

If it's just doing some standard analysis, a pre-made pipeline works great.

ADD COMMENT • link 6 months ago by dsull ★ 7.6k

1

Entering edit mode

6 months ago

colindaven 7.7k

nf-core rnaseq is the most popular nf-core pipeline https://nf-co.re/pipelines/ so I would say it is common to use a pre-built one. The wheel has been rebuilt many times though. You can always play with settings and the tools within to learn more about the whole process. If you have the time and interest, sure, build your own, but it is unlikely to be as good or well optimised as well maintained community pipelines.

ADD COMMENT • link 6 months ago by colindaven 7.7k

1

Entering edit mode

6 months ago

Istvan Albert 102k

When it comes to skills that go into a CV:

You should learn how to run various pipeline tools even if you do not use them yourself.

In your CV, you should mention that you can do both: create a custom pipeline and run existing workflow engines.

Remember, when you apply for a job, you are marketing your skills; both are valuable skills.

ADD COMMENT • link 6 months ago by Istvan Albert 102k

score 3 · Accepted Answer · 2025-01-07

I think that RNA-seq pipelines for your bog-standard bulk experiment that are aimed at differential gene expression are so simple, that a) there is no great advantage in using a pre-built pipeline, and b) its no great demonstration of skill to say you've built your own.

Our standard RNA-seq pipeline is FastQC -> Cutadapt/Trimmomatic (depending on FastQC output) -> Salmon. Pretty much anyone can implement that, and I'm not sure what you gain from using NF-core in that situation. The skill in these simple RNAseq experiments is 1. interpreting QC metrics 2. Doing the downstream statistical analysis. Neither of which is really automatable.

If things are more complex, a prebuilt pipeline can be more helpful if there is one that does what you want, but I'd guess that despite all the complexities and options, when it actaully came to doing something out ot the ordinary, even something like the NF-core RNAseq pipeline wouldn't do quite precisely what you need.

For my first 4 years as a bioinformatician I only ran mapping through using a very complex, pre-built pipeline (built by my supervisor in this case). That meant that when I left, I didn't actaully know how to run STAR or Bowtie without it, which I think is a bad thing.

Note, that I'm not saying pipelines, built using proper workflow engines are a bad thing, or that you shouldn't use them or build them. Particularly if you have substantial number of samples. Just that you don't gain any advantage useing a complex all singing all dancing prebuilt one like NF-core.