sed - remove line break if line does not end on \"

Question

I have a tsv.-file and there are some lines which do not end with an '"'. So now I would like to remove every line break which is not directly after an '"'. How could I accomplish that with sed? Or any other bash shell program...

Kind regards, Snafu

jstevenco · Accepted Answer · 2015-07-24 23:25:33Z

To elaborate on @Lev's answer, the BSD (OSX) version of sed is less forgiving about the command syntax within the curly braces -- the semicolon command separator is required for both commands:

sed '/"$/!{N;s/\n//;}' file.txt

per the documentation here -- an excerpt:

Following an address or address range, sed accepts curly braces '{...}' so several commands may be applied to that line or to the lines matched by the address range. On the command line, semicolons ';' separate each instruction and must precede the closing brace.

Lev Levitsky · Accepted Answer · 2015-07-24 22:17:06Z

4

This sed command should do it:

sed '/"$/!{N;s/\n//}' file

It says: on every line not matching "$ do:

read next line, append it to pattern space;
remove linebreak between the two lines.

Example:

$ cat file.txt "test" "qwe rty" foo $ sed '/"$/!{N;s/\n//}' file.txt "test" "qwerty" foo

edited Jul 24, 2015 at 22:17

answered Jul 24, 2015 at 22:13

Lev Levitsky

66.4k23 gold badges155 silver badges184 bronze badges

8 Comments

SnafuBernd Over a year ago

I get: sed: 1: "/"$/!{N;s/\n//}": bad flag in substitute command: '}'

SnafuBernd Over a year ago

I am on a mac. If this is relevant.

Lev Levitsky Over a year ago

@SnafuBernd Are you using double quotes for the sed command? If so, please try using single quotes. Otherwise, it's one of those mac sed peculiarities

Lev Levitsky Over a year ago

@SnafuBernd there's got to be a way with Mac sed as well. I'm sure someone else will know one.

Lev Levitsky Over a year ago

@jspencer For the last line it would be just $. An address enclosed in /slashes/ matches by regex. The regex is "$, i.e. a quote followed by the end of line. This is the reverse of the OP's requirement of "a line break which is not directly after an "".

|

Kent · Accepted Answer · 2015-07-24 22:52:02Z

give this awk one-liner a try:

awk '{printf "%s%s",$0,(/"$/?"\n":"")}' file

test

kent$ cat f "foo" "bar" "a long text with many many lines" "lalala" kent$ awk '{printf "%s%s",$0,(/"$/?"\n":"")}' f "foo" "bar" "a longtext withmany manylines" "lalala"

potong · Accepted Answer · 2015-07-25 07:28:20Z

This might work for you (GNU sed):

sed ':a;/"$/!{N;s/\n//;ta}' file

This checks if the last character of the pattern space is a " and if not appends another line, removes a newline and repeats until the condition is met or the end-of-file is encountered.

An alternative is:

sed -r ':a;N;s/([^"])\n/\1/;ta;P;D' file

The mechanism is left for the reader to ponder.

Collectives™ on Stack Overflow

sed - remove line break if line does not end on \"

4 Answers 4

Comments

8 Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

8 Comments

Comments

Comments

Related