Tutorial:High Throughput Sequence Analysis With R And Bioconductor
1
2
Entering edit mode
11.8 years ago

Visit the HT Sequence Analysis with R and Bioconductor

Synopsis: High throughput sequencing (HT-Seq or HTS), also known as next generation sequencing (NGS), presents a wide spectrum of opportunities for genome research. Unfortunately, many existing bioinformatic tools do not scale well to large datasets consisting of tens of millions of sequences generated by technologies like Illumina/Solexa, Roche/454, ABI/SOLiD and Helicos. The Bioconductor project fills this gap by providing a rapidly growing suite of well designed R packages for analyzing traditional and HT-Seq datasets. These 'BioC-Seq' packages allow to analyze these sequences with impressive speed performance. Their accelerations are achieved by using memory efficient string containers and performing the time consuming computations with calls to external programs that are implemented in compiled languages (e.g. C/C++). Together these packages form a novel framework that allows researchers to develop efficient pipelines by performing complex data analysis in a high level data analysis and programming environment.

bioconductor • 5.7k views
ADD COMMENT
0
Entering edit mode

Hi, Can you please check the link. It says "Page not found".

ADD REPLY
0
Entering edit mode

Aaah! Got it. "q" was missing at the end of the link.

ADD REPLY
4
Entering edit mode
11.8 years ago

And I would add the functionalities of seqinR (I am one of the developers but it still is a great tool :-)) :

  1. to query major databases from within R
  2. to manipulate fasta files and alignments files
  3. to compute all sorts of statistics on sequences

See here for the manual (which I also find quite entertaining for a software manual!).

ADD COMMENT

Login before adding your answer.

Traffic: 1978 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6