How can I remove the text from a line after a certain character with awk

Question

How can I use awk to remove all text after a certain character ; that appears on every line of my text file? (I then need to run for loops on the text)

Jenny,Sarah,John;North Dakota Henry,Frank;Illinois Aaron,Kathryn,Caitlin,Harris;New York

Stéphane Chazelas · Accepted Answer · 2014-02-28 06:47:30Z

There are two general approaches.

Set awk's field separator to that character. You can then get the parts you want as $1:
```
$ echo "Today was cloudy; yesterday too" | awk -F';' '{print $1}' Today was cloudy 
```

Use gsub() to substitute it with an empty string:

$ echo "Today was cloudy; yesterday too" | awk '{sub(/;.*/,""); print}' Today was cloudy

So, for your example:

$ awk -F';' '{print $1}' file Jenny,Sarah,John Henry,Frank Aaron,Kathryn,Caitlin,Harris

Chris Down · Accepted Answer · 2014-02-28 04:10:16Z

9

Here's an answer with sed -- since you're not really doing any field processing, awk is probably overkill.

sed 's/;.*//'

answered Feb 28, 2014 at 4:10

Chris Down

130k26 gold badges277 silver badges268 bronze badges

1

+1 but based on the OP's comments, I am assuming this is all part of a larger script. @Jenny, that's the kind of detail you should include in your questions by the way.

terdon
– terdon ♦

2014-02-28 04:13:26 +00:00
Commented Feb 28, 2014 at 4:13

Add a comment |

Scrutinizer · Accepted Answer · 2014-02-28 05:38:46Z

5

And also just cut ..

cut -d\; -f1 file

answered Feb 28, 2014 at 5:38

Scrutinizer

1,1425 silver badges7 bronze badges

Add a comment |

HalosGhost · Accepted Answer · 2016-08-30 21:55:03Z

Sometimes you may want to replace all characters after a certain word with another string. For example:

original_string="abc blabla foo bar" and you want to replace words after blabla with 'hello world'

echo $original_string | sed -E 's/(.+ blabla) .+/\1 hello world/'

jubilatious1 · Accepted Answer · 2021-07-16 15:47:16Z

Using Raku (formerly known as Perl_6):

raku -pe 's:g/ \; .*? $$//;'

OUTPUTS:

Jenny,Sarah,John Henry,Frank Aaron,Kathryn,Caitlin,Harris

The above code implements the command line -pe linewise-autoprinting flags, in conjunction with the well-known s/// substitution construct. The code tells Raku to :g globally search for a ;, identify .*? 0-or-more characters that follow (? means non-greedily), up to the end-of-line ($$).

(Actually, since the OP seems to indicate that the ; only occurs once-per-line, the :g can be omitted. Also, since the -pe command line linewise-autoprinting flags are in use, you can use the $ end-of-string assertion, instead of the $$ end-of-line assertion).

The OP seems to indicate that he/she will be running for-loops over the text. This sounds like a simple comma-separated list of names is desired? If so, the following code works:

raku -e 'lines.grep(*.chars).map(*.subst(/\; .*? $$/)).join(",").put;'

OUTPUTS:

Jenny,Sarah,John,Henry,Frank,Aaron,Kathryn,Caitlin,Harris

https://raku.org/

Stack Exchange Network

How can I remove the text from a line after a certain character with awk

5 Answers 5

You must log in to answer this question.

Linked

Hot Network Questions

How can I remove the text from a line after a certain character with awk

5 Answers 5

You must log in to answer this question.

Linked

Related

Hot Network Questions