Forum:Premade RNA-seq pipeline or custom
5
0
Entering edit mode
2 days ago

Hi everyone,

In this great video by OMGenomics linked below, Maria said when writing CVs that if you say you made a custom RNA-seq pipeline, say why you made one and didn't use a ready-made pipeline from elsewhere.

I made a custom one because that's how I learned. However, is it common to use a pre-made one, are there common ones that are used? Where would you find them?

Thanks!

rna-seq pipeline • 386 views
ADD COMMENT
0
Entering edit mode

Basically all I know today about bash, nextflow and containers I learned by writing my own pipelines. Ask yourself whether you can afford it in terms of spending time on writing pipelines versus getting other things done. It's a valuable learning experience.

ADD REPLY
0
Entering edit mode

I learnt a lot of what I know the same way. Its always been my philosophy to do things the hard, but self improving way, if thats possible.

ADD REPLY
3
Entering edit mode
2 days ago

I think that RNA-seq pipelines for your bog-standard bulk experiment that are aimed at differential gene expression are so simple, that a) there is no great advantage in using a pre-built pipeline, and b) its no great demonstration of skill to say you've built your own.

Our standard RNA-seq pipeline is FastQC -> Cutadapt/Trimmomatic (depending on FastQC output) -> Salmon. Pretty much anyone can implement that, and I'm not sure what you gain from using NF-core in that situation. The skill in these simple RNAseq experiments is 1. interpreting QC metrics 2. Doing the downstream statistical analysis. Neither of which is really automatable.

If things are more complex, a prebuilt pipeline can be more helpful if there is one that does what you want, but I'd guess that despite all the complexities and options, when it actaully came to doing something out ot the ordinary, even something like the NF-core RNAseq pipeline wouldn't do quite precisely what you need.

For my first 4 years as a bioinformatician I only ran mapping through using a very complex, pre-built pipeline (built by my supervisor in this case). That meant that when I left, I didn't actaully know how to run STAR or Bowtie without it, which I think is a bad thing.

Note, that I'm not saying pipelines, built using proper workflow engines are a bad thing, or that you shouldn't use them or build them. Particularly if you have substantial number of samples. Just that you don't gain any advantage useing a complex all singing all dancing prebuilt one like NF-core.

ADD COMMENT
3
Entering edit mode
2 days ago

However, is it common to use a pre-made one, are there common ones that are used?

https://nf-co.re/rnaseq/3.14.0/

ADD COMMENT
3
Entering edit mode
2 days ago
dsull ★ 7.1k

I like writing my own CUSTOM pipelines so I can CUSTOMize it haha.

There are too many things I need to do with the data I have that can't be accomplished with pre-made pipelines.

If it's just doing some standard analysis, a pre-made pipeline works great.

ADD COMMENT
1
Entering edit mode
2 days ago

nf-core rnaseq is the most popular nf-core pipeline https://nf-co.re/pipelines/ so I would say it is common to use a pre-built one. The wheel has been rebuilt many times though. You can always play with settings and the tools within to learn more about the whole process. If you have the time and interest, sure, build your own, but it is unlikely to be as good or well optimised as well maintained community pipelines.

ADD COMMENT
1
Entering edit mode
1 day ago

When it comes to skills that go into a CV:

You should learn how to run various pipeline tools even if you do not use them yourself.

In your CV, you should mention that you can do both: create a custom pipeline and run existing workflow engines.

Remember, when you apply for a job, you are marketing your skills; both are valuable skills.

ADD COMMENT

Login before adding your answer.

Traffic: 2809 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6