Hi guys, I am trying to trim my primer sequences off my sequence data. I wrote the command with cutadapt.
cutadapt -g F-primer sequence -o output file name directory of input file/input file name
it could cut the forward primer off. but if I use the same command for reverse primers
cutadapt -a R-primer sequence -o output file name directory of input file/input file name
it did not work. I tried to put the reverse complementary sequence of the 3´ sequence of the primer, it did not work either. If I merge the 2 reads and do the cutadapt,
cutadapt -a <3´sequence> -g <forward primer sequence> -o <output file name> directory of input file/input file name
neither works. So I wonder if anyone can give me some suggestions on how to make the right command, or is there any better software to use to trim the primer sequences?
Thanks Ramona
Can you use spacing to make your question clear - description and commands in separate lines. It is really confusing to see the commands you tried between the description you provided.
oh, sorry. It is actually like this: cutadapt -g <f-primer sequence=""> -o <output file="" name=""> <directory of="" input="" file="" input="" file="" name="">
cutadapt -a <r-primer sequence=""> -o <output file="" name=""> <directory of="" input="" file="" input="" file="" name=""> (this is for the reverse primer cutting, but I tried both reverse complementary sequence of the 3´sequence and the 3´sequence without making the reverse complementary, both did not work)
As for the sequence I merge with PEAR, I tried to trim the primers with the command below, it did not work
cutadapt -g <f-primer sequence=""> -a <r-primer sequence=""> -o <output file="" name=""> <directory of="" input="" file="" input="" file="" name="">
I do not know which version of cutadapt is being used here. This is the usual command I use -
cutadapt -A Rprimer -a Fprimer -o out_PE1 -p out_PE2 in_PE1 in_PE2
For single-end data
cutadapt -A Rprimer -a Fprimer -o outfile_SE infile_SE
Sorry I don't understand the question. Where did PEAR come from in between?
Are you using single-end data or paired-end. Which version of cutadapt are you using, and what is your primer sequence? And what is the actual command you tried (the complete command with file names and paths if possible). It is unclear what is the problem without letting us know what has been tried on your side.
Did you check the cutadapt manual - http://cutadapt.readthedocs.io/en/stable/guide.html The following is an example only -
cutadapt -a ATGCATGC -g GCATGCAT -o /path/to/file/outputfile.fastq /path/to/file/inputfile.fastq
Hi Rohit, I used the version of cutadapt==1.9.1 I think as i installed from this website : http://cutadapt.readthedocs.io/en/stable/installation.html. I used pair-end data, and the primer sequence is 20bp nucleotides used to amplify the target gene. I did follow the rules of the manual as I understand to put the command cutadapt -a <r-primer> -g <f-primer> -o
the input file could be Read1(no need to add -a as I understand) or the merged file of Read1 and Read2. I could trim the forward primer by only putting the Read1 and -g <f-primer>, but not the reverse primer by using command cutadapt -a<r-primer>. Hope my explanation is clear
I think you are missing an output file in the command, since -o requires a file to hold the data. The command below is for single-end data only or the merged reads you mention.
cutadapt -a <r-primer> -g <f-primer> -o /path/outputfile /path/inputfile
If it is paired end data, then you would need to use two outputs and two inputs as below
cutadapt -a <r-primer> -g <f-primer> -o /path/outputfile1 -p /path/outputfile2 /path/inputfile1 /path/inputfile2
Also, can you show the error log incase there is something missing which went unnoticed.
Hi. I also had the same problem
cutadapt -u 4 –a GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC –g TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGNNNNCCTACGGGNGGCWGCAG –o 21I-C1_S2_L001_R1_001_t.fastq -p 21I-C1_S2_L001_R2_001_t.fastq 21I_C1 21I-C1_S2_L001_R1_001.fastq 21I-C1_S2_L001_R2_001.fastq
I tried to remove the f and r primers with the removal of first 4 bases for each read but this error came out
/usr/lib/python2.7/dist-packages/pkg_resources.py:1031: UserWarning: /home/shaufi/.python-eggs is writable by group/others and vulnerable to attack when used with get_resource_filename. Consider a more secure location (set with .set_extraction_path or the PYTHON_EGG_CACHE environment variable). warnings.warn(msg, UserWarning) cutadapt version 1.11 Copyright (C) 2010-2016 Marcel Martin marcel.martin@scilifelab.se
cutadapt removes adapter sequences from high-throughput sequencing reads.
Usage: cutadapt -a ADAPTER [options] [-o output.fastq] input.fastq
For paired-end reads: cutadapt -a ADAPT1 -A ADAPT2 [options] -o out1.fastq -p out2.fastq in1.fastq in2.fastq
Replace "ADAPTER" with the actual sequence of your 3' adapter. IUPAC wildcard characters are supported. The reverse complement is not automatically searched. All reads from input.fastq will be written to output.fastq with the adapter sequence removed. Adapter matching is error-tolerant. Multiple adapter sequences can be given (use further -a options), but only the best-matching adapter will be removed.
Input may also be in FASTA format. Compressed input and output is supported and auto-detected from the file name (.gz, .xz, .bz2). Use the file name '-' for standard input/output. Without the -o option, output is sent to standard output.
Citation:
Marcel Martin. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal, 17(1):10-12, May 2011. http://dx.doi.org/10.14806/ej.17.1.200
Use "cutadapt --help" to see all command-line options. See http://cutadapt.readthedocs.org/ for full documentation.
cutadapt: error: Too many parameters. shaufi@ppplives-Precision-T5610[cutadapt-1.11] [12:03]
What do you think could be the problem? Thanks
Please post this as another question.
Thanks a lot, Rohit. I am running another round of sequencing now, and I will try this. See if it works.