Transform smallRNA SRA (Illumina) sequences to FASTA
1
0
Entering edit mode
9.9 years ago
juanma_lace ▴ 20

Hi all,

I'm running an analysis of small RNA and I want to use some libs from NCBI in SRA (Illumina).

The problem is that all the reads are 35 bp long (I understand they have some kind of adaptors).

I want to know how to obtain the real fasta sequences programmatically (I want to use them in a pipeline)

Thank you in advance

Seqs files:

http://www.ncbi.nlm.nih.gov/biosample?Db=biosample&DbFrom=bioproject&Cmd=Link&LinkName=bioproject_biosample&LinkReadableName=BioSample&ordinalpos=1&IdsFromResult=241290

sra adaptor illumina fasta smallRNA • 2.8k views
ADD COMMENT
0
Entering edit mode
9.9 years ago
Ram 44k

The SRA toolkit should help you: http://www.ncbi.nlm.nih.gov/sites/books/NBK158900/

ADD COMMENT
0
Entering edit mode

I see that illumina-dump creates a lot of files, how can I get the fasta from those files

ADD REPLY
0
Entering edit mode

Here you go. But seriously, this stuff is harder to find than it needs to be:

fastq-dump SRR1196045 --split-spot --fasta
ADD REPLY
0
Entering edit mode

Thank you for the sarcasm, anyway it does not answer my question. My question is about removing the adaptors, not just converting.

ADD REPLY
1
Entering edit mode

I'm sorry for the sarcasm - I think it's the first time I've used that site, and I agree now that it seems overly harsh. RamRS's link suggests cutadapt, which should trim any adapter sequence you specify. The hard part is finding the correct adapter sequence to trim, and FastQC might help you with that.

ADD REPLY
0
Entering edit mode

cutadapt, fastqc and trimmomatic might help with trimming adapters.

EDIT: Lorena Pantano has given a much better, detailed response here: Transform smallRNA SRA (Illumina) sequences to FASTA

And yes, lmgtfy can be a bit too condescending at times. I think the cultural difference amplifies the effect, unfortunately.

ADD REPLY
2
Entering edit mode

there are some adapter removal that are specific for mRNA and not small RNA. I would use cutadapt. The adapter in smallRNA is always the same, and it is enough to detect 8 nucleotides. The adapter should be something like AGATCGGAAGAGCAC, or without the first A if it is standard protocol. Fastqc was not working well the last time I used it (1/2 years ago, and the author admitted it was not prepare for small RNA).

ADD REPLY
0
Entering edit mode

Thank you for the detailed response. It is better for the OP to hear from someone that shares research domain.

ADD REPLY
0
Entering edit mode
ADD REPLY

Login before adding your answer.

Traffic: 1825 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6