Question

RNA-seq alignment in UNIX

0

Entering edit mode

2.6 years ago

Hien • 0

Hello everyone

I am newer in bioinformatics. I would like to ask for your advice, helps, or comments on RNA-seq analysis in the class. Question: I have 4 FASTA sequencing files of 4 different cell lines. They have been prefiltered to contain only the sequencing reads from human chr21. They are VCap.fasta, LnCap.fasta, EP.fasta, and PrEC.fasta. NOTHING ELSE. In addition, I have received the reference sequence., the full sequence of chr21 in FASTA format (chr21.fa). Notice that all reference sequences and annotations here are from hg19 reference genome.

Use suitable software to align all four RNA-sequenced samples.

Normally, I use hisat2 or Kallisto for RNA-seq. I often have R1.fastq and R2.fastq for one sample, then I can run the command to change fastq - sam -bam file. Then I can align them with references. However, here I have only one FASTA file for one sample, and it is already prefiltered to contain only the sequencing reads from chr21. Could anyone help?

index sequencing hisat2 RNA • 883 views

ADD COMMENT • link updated 2.6 years ago by ATpoint 85k • written 2.6 years ago by Hien • 0

0

Entering edit mode

I think you can use hisat2 with fasta files in addition to fastq files. First thing I would do is to check if you have pared end or single en reads in those fasta file . If paired, I will separate then into 2 fasta files, one with R1 and one with R2 and run hisat2 giving as reference the chr21.fa file. If single end, you can run hisat2 in se mode (I dont remember the exact argument for this, but there is an option).

ADD REPLY • link 2.6 years ago by iraun 6.2k

0

Entering edit mode

thank you for your suggestion, I have checked it and I can use it for single alignment with -U option of hisat2

ADD REPLY • link 2.6 years ago by Hien • 0

0

Entering edit mode

You can make it cheap and transform them to fastq giving them any dummy quality line.

If you have

>readname
TAGCACGTACGTAGCGAC

You can make it as

@readname
TAGCACGTACGTAGCGAC
+
AAAAAAAAAAAAAAAAAA

Either you awk something together or use https://github.com/lh3/seqtk

ADD REPLY • link 2.6 years ago by ATpoint 85k

0

Entering edit mode

Thank you so much for suggestion!

ADD REPLY • link 2.6 years ago by Hien • 0