Question

fastq file data

0

Entering edit mode

7.1 years ago

Sam ▴ 150

Hello

how I can show nucleotide distribution and sequence length of multi fastq file (1.fq, 2.fq,...,6.fq) in just 2 separate graph?

Thanks

RNA-seq fastq • 3.2k views

ADD COMMENT • link updated 7.1 years ago by Istvan Albert 102k • written 7.1 years ago by Sam ▴ 150

0

Entering edit mode

what have you tried/found so far ?

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

fastx-tool kit but it just for one lib and I want merge all lib data in one geraph

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

what about merging the fastq files before running fastx ?

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

I don't want merge all reads , just want show sequence length of multi fq file , separately (for instance according color ) in one graph

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

I don't want merge all reads

http://hannonlab.cshl.edu/fastx_toolkit/commandline.html

"Tools can read from STDIN "

I think you should consider this option.

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

should use output of fastx_quality_stats , as input for FASTA/Q Nucleotide Distribution

, so is it a right command ?

fastx_nucleotide_distribution_graph.sh -i file1.TXT file2.TXT file3.TXT [-t TITLE]  [-o OUTPUT]

ADD REPLY • link 7.1 years ago by Sam ▴ 150

1

Entering edit mode

No harm is trying the command out :)

ADD REPLY • link 7.1 years ago by GenoMax 148k

0

Entering edit mode

I think it should be something like

gunzip -c *.fq.gz | fastx_nucleotide_distribution_graph.sh - o OUTPUT

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

This is probably about Illumina data, but it never hurts to mention this!

ADD REPLY • link 7.1 years ago by WouterDeCoster 47k

0

Entering edit mode

above commands not work , except hurts and harm topic fast reply! could you help to find a way ?

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

You didn't tell us where you got the data from. So, Illumina data?

above commands not work

You'll have to tell us a bit more about how those don't work.

ADD REPLY • link 7.1 years ago by WouterDeCoster 47k

0

Entering edit mode

above commands not work

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

but I explained all story, fastx_nucleotide_distribution_graph.sh just take TXT out put file of fastx_quality_stats as input, with one txt input file fastx_nucleotide_distribution_graph.sh works well but with two input file( -i file1.txt file2.txt) I got this error:

gnuplot> set term png size 1048,768
                  ^
         line 0: unknown or ambiguous terminal type; type just 'set terminal' for a list

WARNING: Plotting with an 'unknown' terminal.

It's illumina fq files, could you introduce other scripts ?

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

Anything wrong with FastQC?

ADD REPLY • link 7.1 years ago by WouterDeCoster 47k

0

Entering edit mode

no all thing is OK with FASTQC , I can post only 5 post per 6hr, is there any way to improve it ?

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

I can post only 5 post per 6hr, is there any way to improve it ?

After you have been on Biostars for a while this restriction will be removed.

ADD REPLY • link 7.1 years ago by WouterDeCoster 47k

0

Entering edit mode

fastx calls gnuplot ; it seems that your version is not complete:

https://stackoverflow.com/questions/22816030

ADD REPLY • link 7.1 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

it's works without any problem with -i file1.text , and I have a nice plot in out put , problem is fastx_nucleotide_distribution_graph.sh is not compatible with more than one input file. so I should find other alternative scripts for merge nucleotide_distribution_graph from distinct fq file.

ADD REPLY • link 7.1 years ago by Sam ▴ 150

0

Entering edit mode

If you need just one plot for the entire dataset then cat'ing the fastq files together to generate one inout (per read, R1/R2) may be the way to go.

ADD REPLY • link 7.1 years ago by GenoMax 148k

0

Entering edit mode

I think to a awk code to have a sequence length frequency in each fq file and then merge them in Excel to have a unique sequence length graph with different color for each lib.

ADD REPLY • link 7.1 years ago by Sam ▴ 150

score 0 · Answer 1 · 2017-11-12

0

Entering edit mode

7.1 years ago

Istvan Albert 102k

FastQC produces both plots.

ADD COMMENT • link 7.1 years ago by Istvan Albert 102k