Question

conditional expression in sed ...?

0

Entering edit mode

8.5 years ago

CAnna ▴ 20

Hi,

I have a file looking like this

@HISEQ:229:C81CCANXX:1:1101:10157:17161/1
AAAAAAAAAAAAAAAAAAAA
+
CCCCCGGGG/1GGGGGGCEEG
@HISEQ:229:C81CCANXX:1:1101:10741:22239/1
GCCTTGCTATTGACTCTACT
+
BBBB@EEGGGGGDGGEGGGG
@HISEQ:229:C81CCANXX:1:1101:10901:88419/1
GCTTAGGGATTTTATTGGTA

I would like to remove this /1 at the end of the lines (read names).

I did

sed -i -e 's/\/1//g' MyFile.txt

But the problem is that is also removes the /1 occurring in the middle of the 4th line (sequence quality).

Is there a way to substitute the /1 only on lines starting with @HISEQ (a sort of conditional expression) ?

I also tried:

awk -F "/" '/^@HISEQ/{ $2 = "" ; print $0 }' essai.change.name> file.txt

The problem then is that I have only the @HISEQ lines and I loose the lines in between.

Thank you!

C. Anna

sequence • 2.0k views

ADD COMMENT • link updated 8.5 years ago by Pierre Lindenbaum 164k • written 8.5 years ago by CAnna ▴ 20

0

Entering edit mode

Avoid the -i flag on sed if you are not sure what you are doing, better would be to just pipe the result to e.g. head to check if the command performed as you thought.

ADD REPLY • link 8.5 years ago by WouterDeCoster 47k

score 0 · Answer 1 · 2016-05-30

0

Entering edit mode

8.5 years ago

Pierre Lindenbaum 164k

'$' for end of the line:

sed 's%/1$%%'

Is there a way to substitute the /1 only on lines starting with @HISEQ

sed '/^@HISEQ/s%/1$%%'

ADD COMMENT • link 8.5 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

Does it work with GNU sed? (No change in result when I tried)

I did the following

sed '/^@HISEQ/s/\/1//' in.fq

ADD REPLY • link 8.5 years ago by venu 7.1k

0

Entering edit mode

$ echo -e "@HISEQa\1\nx\1" | sed '/^@HISEQ/s/\\1//'
@HISEQa
x\1

$ sed --version
sed (GNU sed) 4.2.2

ADD REPLY • link 8.5 years ago by Pierre Lindenbaum 164k

0

Entering edit mode

This one

sed '/^@HISEQ/s%/1$%%'

worked perfectly! Thank you

I would like to make sure I understand the syntax though.

It says: for the lines starting with @HISEQ (^@HISEQ), delete (s%), the character "1" (1$%%). Is this right ?

I'm not sure to understand the function of the %%

ADD REPLY • link 8.5 years ago by CAnna ▴ 20

0

Entering edit mode

sed 's%a%b%' is the same than 's/a/b/' using '%' instead of '/' make it simplier because I don't need to escape things. e.g: 's%http://google.com/%url%'

ADD REPLY • link 8.5 years ago by Pierre Lindenbaum 164k