How Do I Get Started Working With Rna-Seq Data
6
9
Entering edit mode
10.8 years ago
HNK ▴ 150

hey everyone I have to start my work on RNA-seq. i am totally new to this RNA seq approach. I have to work on the data given by neurological department. The data has 96 samples(reads ..fastq files), the samples were derived from formalin fixed paraffin embedded. . I have to determine somatic variation, gene expression, SNV and fusion genes between subgroups from RNAseq. CAn any1 help me out, how should i start my work. How to analyse teh RNAseq and cancer genome data.

rna-seq gene-expression • 18k views
ADD COMMENT
3
Entering edit mode

Welcome to Biostar! This is not a great question, as there are many guides to RNA sequence analysis online just a search away. I'd recommend finding one, starting to follow it, and if you get stuck, then come back and ask specific questions. You're more likely to get useful responses that way. Look at Section 6 here for more details. (Do your homework before posting) http://www.ploscollections.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002202;jsessionid=A2C2B677241104800E044DA36AFB577B

ADD REPLY
2
Entering edit mode

I recently found this book, which I think does a good job of giving an overview of the RNA-Seq tools available and what they do. It also gives code snippets to explain how to execute them.

As slides: http://www.slideshare.net/binal000/rna-seq-data-analysis-a-practical-approach-chapman-hallcrc-mathematical-and-computational-biology-1-e-2015-pdf-unitedvrg

on amazon:

http://www.amazon.com/RNA-seq-Data-Analysis-Mathematical-Computational/dp/1466595000

ADD REPLY
6
Entering edit mode
9.7 years ago
Michele Busby ★ 2.2k

We have a blog post here that goes over basic concepts in RNA Seq:

http://michelebusby.tumblr.com/post/26913184737/thinking-about-designing-rna-seq-experiments-to

I have to edit it (I've been told by complaining readers) to add something on normalization and I also want to add stuff on biases and complexity.

Since those are FFPE samples some are likely crappy so you will have a lot of biases, etc. which means that just running it through an existing pipeline may not be optimal, though it may be a good first step. i.e. you may need to do something like principle component analysis to see what your confounders are. It's not trivial but others have done it.

ADD COMMENT
5
Entering edit mode
8.8 years ago
dnaseiseq ▴ 220

Hi

Just published: http://www.genomebiology.com/2016/17/1/13

A survey of best practices for RNA-seq data analysis

Genome Biology 2016, 17:13 doi:10.1186/s13059-016-0881-8

ADD COMMENT
4
Entering edit mode
10.8 years ago
Carlos Borroto ★ 2.1k

I would start by reading this paper from the authors of the Tuxedo pipeline.

Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks.

ADD COMMENT
1
Entering edit mode

this is a really good point, I always tell people the same thing. There is probably no better way to get started, just work through a paper or to.

ADD REPLY
0
Entering edit mode

Thankyou so much. I have started reading this paper.

ADD REPLY
4
Entering edit mode
9.2 years ago

We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at http://www.rnaseq.wiki/.

This material was released alongside this publication:

Malachi Griffith, Jason R. Walker, Nicholas C. Spies, Benjamin J. Ainscough, Obi L. Griffith. 2015. Informatics for RNA-seq: A web resource for analysis on the cloud.11(8):e1004393.

The Supplementary Information for this publication includes an extensive review of RNA-seq wet lab and analysis concepts, existing tools, common questions, etc.

All materials associated with this publication, including high resolution and original figure files, supplementary tables, etc. are available here: https://github.com/griffithlab/rnaseq_tutorial

This publication was inspired by workshops that we have taught at CBW, CSHL, and NYGC over the last few years. These workshops are ongoing and we hope to maintain and expand the content in the coming years.

ADD COMMENT
2
Entering edit mode
10.8 years ago

Somatic variation is really meant for DNA-Seq data. Although you can look for RNA-editing events with paired DNA-Seq and RNA-Seq data, I think you will have a hard time distinguishing true variants from tumor-specific RNA-editing events if you are comparing two RNA-Seq samples (or SNV calling in RNA-Seq sample against a reference genome).

For gene expression, I've included some benchmarks here (which I ran using paired tumor-normal RNA-Seq data):

http://cdwscience.blogspot.com/2013/11/rna-seq-differential-expression.html

http://bioinfo.aizeonpublishers.net/content/2013/6/285-292.html

I don't think there is a gold standard for gene fusion events, but I've liked chimerascan the best. TopHat-fusion is probably the most popular option.

ADD COMMENT
2
Entering edit mode
9.7 years ago
Czh3 ▴ 190

RNA-seq pipeline: https://github.com/Czh3/NGSTools

ADD COMMENT

Login before adding your answer.

Traffic: 2662 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6