Alternative Splicing And Transcript Diversity Databases
1
2
Entering edit mode
12.2 years ago
Pierre ▴ 130

Hey guys,

EBI has this very nice database for fasta format sequences of all exons and transcripts. Unforuntanetely, these data are available currently only for human, mouse and rat.

I would like to have the same data for other species. Does anyone know if there are such databases - perhaps manually curated for a specific organism?

Thanks

splicing transcript exon database • 2.4k views
ADD COMMENT
2
Entering edit mode
12.2 years ago

There are countless ways to obtain such information:

  • From Ensembl you can get fasta files for many species from their FTP Download section. A GTF defining the structure of each transcript is also provided there.
  • You can get FASTA sequences for each exon from Ensembl BioMart. Select database, then desired species, then under 'attributes' select 'sequences', expand the sequences section and then select 'exon sequences'. You can retrieve the result in fasta format.
  • It is also possible to get exon sequences via the Ensembl API
  • You could use the tool gffread from Cufflinks along with a GTF file and reference genome sequence to produce these files using a command like this:

    gffread -w transcripts.fa -g /path/to/genome.fa transcripts.gtf

  • You could use the BEDTools utility fastaFromBed to produce these files. You would need a BED file of all exon coordinates. That could be obtained from the UCSC table browser among other options.

Choose your poison...

ADD COMMENT
0
Entering edit mode

Thanks a lot Malachi - I'll give a shot with BioMart.

ADD REPLY

Login before adding your answer.

Traffic: 2258 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6