Entering edit mode
6.5 years ago
aditigrawal
•
0
Hi All, I have 4 RNA-seq libraries ( control and treated in replicates). I merge them together to assemble them. Now, after assembly, I have got only single fasta file. How can I fetch sequences of individual libraries. Thanks Aditi Agrawal
Hallo aditigrawal,
some questions about this:
fin swimmer
If you have generated Denovo assembly one possible way could be, map raw reads individually on merged assembly using aligner like a
bowtie, bowtie2
etc. It will result in the individual alignment file (.sam; subsequently you can convert it into .bam usingsamtools view
utility). Then you can simply find out read count (i.e. the number of raw reads supported to a particular transcript) usingsamtools idxstat
. As your mapping each sample separately you will get separate read count. Then you can use the transcript ids obtained in read count table you can get sequences for individual samples using some programs likefaSomeRecord
developed under UCSC toolkit.By the way why you want sequences for each sample separately?