How to Check Per Base Sequence Content of a Fasta file
0
0
Entering edit mode
4.9 years ago
xiaoleiusc ▴ 140

Hi All,

What tool or program should I run to check "Per Base Sequence Content " of a fasta format file from NGS sequencing ? I know FastQC could check "Per Base Sequence Content " of a fastq file, but unfortunately FastQC does not take a fasta format as input.

Thanks ahead,

Xiao

Image link of an example of "Per Base Sequence Content " of a fastq format: https://photos.app.goo.gl/DuDEiXRUvT9x8Gkn6

rna-seq • 3.3k views
ADD COMMENT
0
Entering edit mode

What is in the fasta file? Are these reads or transcripts or something else entirely?

ADD REPLY
0
Entering edit mode

Thanks, It's a fasta file from Illumina sequencing but without quality data (processed fastq file, pcr duplications are collapsed).

ADD REPLY
0
Entering edit mode

Just add fake qualities then.

ADD REPLY
0
Entering edit mode

Thanks, I see. I could look for a tool to add fake qualities. But I noticed that in FastQC, the tool could give you "per base content" of a short read (say in my example, the read length is 36 bp), if I have a long read (say 75 bp single read), the FastQC only give me "per base content" of each base at the beginning, but not each base at full length (e.g. instead of base 37, FastQC would show 37-38 in one column). So I wonder if other tools could do this.

ADD REPLY
0
Entering edit mode

If you look at the options you'll find that you can change how it bins things.

ADD REPLY
0
Entering edit mode

xiaoleiusc : You have to add following option to FastQC to disable binning.

 --nogroup       Disable grouping of bases for reads >50bp. All reports will
                    show data for every base in the read.

Be aware that report file sizes will increase when you do this.

ADD REPLY
0
Entering edit mode

Thanks, it seems this option is only for command line, not in the GUI of FastQC

ADD REPLY
1
Entering edit mode

Run FastQC on the command line then. You should be able to do this on any OS.

ADD REPLY
0
Entering edit mode

I located the FastQC app on my MacOS, right click show package content, and do ./fastqc --nogroup, it works great. Thanks a lot Genomax!

ADD REPLY
1
Entering edit mode

Excellent. Thanks for confirmation.

ADD REPLY

Login before adding your answer.

Traffic: 1938 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6