How to remove multiple newlines at EOF?

Question

I have files that end in one or more newlines and should end in only one newline. How can I do that with Bash/Unix/GNU tools?

Example bad file:

1\n \n 2\n \n \n 3\n \n \n \n

Example corrected file:

1\n \n 2\n \n \n 3\n

In other words: There should be exactly one newline between the EOF and the last non-newline character of the file.

Reference Implementation

Read file contents, chop off a single newline till there no further two newlines at the end, write it back:

#! /bin/python import sys with open(sys.argv[1]) as infile: lines = infile.read() while lines.endswith("\n\n"): lines = lines[:-1] with open(sys.argv[2], 'w') as outfile: for line in lines: outfile.write(line)

Clarification: Of course, piping is allowed, if that is more elegant.

Oleksii Shmalko · Accepted Answer · 2013-07-04 00:38:02Z

32

From useful one-line scripts for sed.

# Delete all trailing blank lines at end of file (only). sed -e :a -e '/^\n*$/{$d;N;};/\n$/ba' file

answered Jul 4, 2013 at 0:38

Oleksii Shmalko

4893 silver badges5 bronze badges

6

Thanks, I used the following to do it in place for multiple files: find . -type f -name '*.js' -exec sed --in-place -e :a -e '/^\n*$/{$d;N;};/\n$/ba' {} \;

jakub.g
– jakub.g

2013-11-22 09:48:56 +00:00
Commented Nov 22, 2013 at 9:48
@jakub.g in place and recursive is exactly what I needed. thank you.

Buttle Butkus
– Buttle Butkus

2015-12-13 10:41:56 +00:00
Commented Dec 13, 2015 at 10:41
1

To add to the excellent comment from @jakub.g you can invoke the command like this on OS X: find . -type f -name '*.js' -exec sed -i '' -e :a -e '/^\n*$/{$d;N;};/\n$/ba' {} \;

davejagoda
– davejagoda

2018-02-19 18:35:06 +00:00
Commented Feb 19, 2018 at 18:35

Add a comment |

Hauke Laging · Accepted Answer · 2013-07-04 00:40:34Z

21

awk '/^$/ {nlstack=nlstack "\n";next;} {printf "%s",nlstack; nlstack=""; print;}' file

answered Jul 4, 2013 at 0:40

Hauke Laging

94.8k21 gold badges132 silver badges185 bronze badges

4

+1: awk's solutions are (almost) always elegant and readable!

Olivier Dulac
– Olivier Dulac

2013-07-04 09:37:55 +00:00
Commented Jul 4, 2013 at 9:37
@OlivierDulac Indeed. When I saw the sed proposal I just thought OMG...

Hauke Laging
– Hauke Laging

2013-07-04 11:32:22 +00:00
Commented Jul 4, 2013 at 11:32
1

this doesn't work on OSX Mavericks using the latest available awk from Homebrew. It errors with awk: illegal statement. brew install mawk and changing the command to mawk works though.

tjmcewan
– tjmcewan

2014-05-09 05:02:09 +00:00
Commented May 9, 2014 at 5:02
@noname I don't even understand the question...

Hauke Laging
– Hauke Laging

2018-10-16 20:22:59 +00:00
Commented Oct 16, 2018 at 20:22
1

+1. Add BEGINFILE {nlstack=""} if operating on multiple files.

wchargin
– wchargin

2022-03-27 20:49:37 +00:00
Commented Mar 27, 2022 at 20:49

| Show 1 more comment

Gilles 'SO- stop being evil' · Accepted Answer · 2013-07-05 23:01:31Z

19

Since you already have answers with the more suitable tools sed and awk; you could take advantage of the fact that $(< file) strips off trailing blank lines.

a=$(<file); printf '%s\n' "$a" > file

That cheap hack wouldn't work to remove trailing blank lines which may contain spaces or other non-printing characters, only to remove trailing empty lines. It also won't work if the file contains null bytes.

In shells other than bash and zsh, use $(cat file) instead of $(<file).

edited Jul 5, 2013 at 23:01

Gilles 'SO- stop being evil'

866k205 gold badges1.8k silver badges2.3k bronze badges

answered Jul 4, 2013 at 0:47

llua

7,08827 silver badges31 bronze badges

+1 to point out what looks like a bug to me : $(<file) isn't really reading the file? why does it discard trailing newlines? (it does, i just tested, thanks for pointing it out!)

Olivier Dulac
– Olivier Dulac

2013-07-04 09:23:22 +00:00
Commented Jul 4, 2013 at 9:23
3

@OlivierDulac $() discards trailing newlines. That's a design decision. I assume that this shall make the integration in other strings easier: echo "On $(date ...) we will meet." would be evil with the newline that nearly every shell command outputs at the end.

Hauke Laging
– Hauke Laging

2013-07-04 11:31:05 +00:00
Commented Jul 4, 2013 at 11:31
@HaukeLaging: good point, it's probably the source of that behaviour

Olivier Dulac
– Olivier Dulac

2013-07-04 12:13:13 +00:00
Commented Jul 4, 2013 at 12:13
I added a special case to avoid appending "\n" to empty files: [[ $a == '' ]] || printf '%s\n' "$a" >"$file".

davidchambers
– davidchambers

2014-04-14 19:11:24 +00:00
Commented Apr 14, 2014 at 19:11
To strip multiple newlines off the start of a file, insert tac into the process (I use gnu coreutils on Mac, so gtac for me) : a=$(gtac file.txt); printf '%s\n' "$a" | gtac > file.txt

r_alex_hall
– r_alex_hall

2018-10-03 12:53:13 +00:00
Commented Oct 3, 2018 at 12:53

Add a comment |

Bengt · Accepted Answer · 2013-07-07 15:25:11Z

You can use this trick with cat & printf:

$ printf '%s\n' "`cat file`"

For example

$ printf '%s\n' "`cat ifile`" > ofile $ cat -e ofile 1$ $ 2$ $ $ 3$

The $ denotes the end of a line.

References

Removing trailing blank lines

Kusalananda · Accepted Answer · 2021-07-14 17:58:02Z

This question is tagged with ed, but nobody has proposed an ed solution.

Here's one:

ed -s file <<'ED_END' a . ?.?+1,$d w ED_END

or, equivalently,

printf '%s\n' a '' . '?.?+1,$d' w | ed -s file

ed will place you at the last line of the editing buffer by default upon startup.

The first command (a) adds an empty line to the end of the buffer (the empty line in the editing script is this line, and the dot (.) is just for coming back into command mode).

The address of the second command (?.?) looks for the nearest previous line that contains something (even white-space characters), and then deletes (d) everything to the end of the buffer from the next line on.

The third command (w) writes the file back to disk.

The added empty line protects the rest of the file from being deleted in the case that there aren't any empty lines at the end of the original file.

Ilmari Karonen · Accepted Answer · 2013-07-04 10:16:43Z

Here's a Perl solution that doesn't require reading more than one line into memory at a time:

my $n = 0; while (<>) { if (/./) { print "\n" x $n, $_; $n = 0; } else { $n++; } }

or, as a one-liner:

perl -ne 'if (/./) { print "\n" x $n, $_; $n = 0 } else { $n++ }'

This reads the file a line at a time and checks each line to see if contains a non-newline character. If it doesn't, it increments a counter; if it does, it prints the number of newlines indicated by the counter, followed by the line itself, and then resets the counter.

Technically, even buffering a single line in memory is unnecessary; it would be possible to solve this problem using a constant amount of memory by reading the file in fixed-length chunks and processing it character by character using a state machine. However, I suspect that would be needlessly complicated for the typical use case.

terdon · Accepted Answer · 2013-07-04 00:51:38Z

If your file is small enough to slurp into memory, you can use this

perl -e 'local($/);$f=<>; $f=~s/\n*$/\n/;print $f;' file

jfg956 · Accepted Answer · 2013-07-09 10:19:08Z

In python (I know it is not what you want, but it is much better as it is optimized, and a prelude to the bash version) without rewriting the file and without reading all the file (which is a good thing if the file is very large):

#!/bin/python import sys infile = open(sys.argv[1], 'r+') infile.seek(-1, 2) while infile.read(1) == '\n': infile.seek(-2, 1) infile.seek(1, 1) infile.truncate() infile.close()

Note that it does not work on files where the EOL character is not '\n'.

jfg956 · Accepted Answer · 2013-07-09 10:27:14Z

A bash version, implementing the python algorithm, but less efficient as it needs many processes:

#!/bin/bash n=1 while test "$(tail -n $n "$1")" == ""; do ((n++)) done ((n--)) truncate -s $(($(stat -c "%s" "$1") - $n)) "$1"

Stack Exchange Network

How to remove multiple newlines at EOF?

Reference Implementation

9 Answers 9

For example

References

You must log in to answer this question.

Linked

Hot Network Questions

How to remove multiple newlines at EOF?

Reference Implementation

9 Answers 9

For example

References

You must log in to answer this question.

Linked

Related

Hot Network Questions