Suggestions for RNA seq pipeline
2
3
Entering edit mode
9.6 years ago
sangita_b ▴ 90

Hi

I am currently analysing RNA-SEQ data (differential expression). I have 20 samples in total: 5 conditions in 2 donors and each donors- each donor has 2 repeats (cell type- human bronchial epithelial cells).

I have 24fastq files per sample- these have been QC'd in fastqc and run through tophat to produce 1 BAM file per sample.

The BAM files were then run through cufflinks for transcript assembly (gtf files).

I have merged all gtf files in cuffmerge and am not sure whether to use cuffdiff or cuffnorm.

Could I ask if the pipeline I have used so far is correct and also ask for advice how to calculate differentail expression.

Thanks
Sangita

RNA-Seq • 3.9k views
ADD COMMENT
0
Entering edit mode

Do you have annotations available for organism of your interest ?

ADD REPLY
0
Entering edit mode

Hi, yes i do.

ADD REPLY
0
Entering edit mode

Solution given by Tom below would work best.

ADD REPLY
5
Entering edit mode
9.6 years ago
Tom ▴ 240

I do have bad experience using cuffdiff. For human samples and standard differential expression, I personally would recommend you

  • Download reference genomes, gtfs from iGenome project
  • Align with STAR
  • Summarization with htseq-count
  • Differentially gene expression with DESeq2 or edgeR
ADD COMMENT
3
Entering edit mode
9.6 years ago

You have many different choices

There is an international consortium that has published a paper comparing several RNA-Seq packages. It run into my attention that they recommend one named BitSeq running under R. It seems to be giving better output that some other packages.

So I downloaded the package, and gave a look to the vignette (a PDF document that can be found in the bioconductor web page containing instructions, tutorials, etc)

It turn out that BitSeq runs using SAM files. With samtools view you can easily convert your BAM files to SAM files. And I believe you don't need to extract the counts.

Now.. If you have the BAMs, have R, download BitSeq and follow the vignette instructions, you can with a little help or without that help to do your RNA-Seq analysis with the most recommended package according the consortium

ADD COMMENT
0
Entering edit mode

Hi Antonio.

Have you used BitSeq? Do you know if it is possible to do DE analysis with a paired-sample design? I am considering using BitSeq and had a quick look at the vignette but I am not sure I can use this kind of design.

Thanks,
Maria

ADD REPLY
0
Entering edit mode

Look for details in this publication about BitSeq in Bioinformatics

In the method section you can see you can use single-end or paired-end reads

ADD REPLY
0
Entering edit mode

Thanks for the link. Sorry if it was a bit confusing, but what I meant was that I have paired/matched samples: patient 1 control and patient 1 treatment; patient 2 control and patient 2 treatment... and so on.

In edge R I can specify in the design matrix this factor, but couldn't find if or how BitSeq can incorporate this in the test.

ADD REPLY

Login before adding your answer.

Traffic: 1812 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6