Get contents before a colon

Question

I have a text file on Linux where the contents are like below:

help.helloworld.com:latest.world.com dev.helloworld.com:latest.world.com

I want to get the contents before the colon like below:

help.helloworld.com dev.helloworld.com

How can I do that within the terminal?

The grep utility is used for looking for lines matching regular expressions. You could possibly use it here, but it would be more appropriate to use a tool that extracts data from fields given some delimiter, such as the cut utility. — Kusalananda
– Kusalananda ♦, Commented Aug 27, 2019 at 17:23
I've submitted an edit to take out the word "grep" and replace it with "find" in the title and "get" in the question body, to avoid the X/Y issue of assuming grep is the right tool to solve the actual problem. — Monty Harder
– Monty Harder, Commented Aug 28, 2019 at 18:21
All I can say is that the contents before the colon is much better than the contents after the colon ;-). — Peter - Reinstate Monica
– Peter - Reinstate Monica, Commented Aug 30, 2019 at 14:02

terdon · Accepted Answer · 2019-08-27 17:21:51Z

This is what cut is for:

$ cat file help.helloworld.com:latest.world.com dev.helloworld.com:latest.world.com foo:baz:bar foo $ cut -d: -f1 file help.helloworld.com dev.helloworld.com foo foo

You just set the delimiter to : with -d: and tell it to only print the 1st field (-f1).

Freddy · Accepted Answer · 2019-08-27 17:08:15Z

Or an alternative:

$ grep -o '^[^:]*' file help.helloworld.com dev.helloworld.com

This returns any characters beginning at the start of each line (^) which are no colons ([^:]*).

Centimane · Accepted Answer · 2019-08-27 17:20:14Z

Would definitely recommend awk:

awk -F ':' '{print $1}' file

Uses : as a field separator and prints the first field.

kGdmioT · Accepted Answer · 2019-08-29 07:32:27Z

5

updated answer

Considering the following file file.txt:

help.helloworld.com:latest.world.com dev.helloworld.com:latest.world.com no.colon.com colon.at.the.end.com:

You can use sed to remove everything after the colon:

sed -e 's/:.*//' file.txt

This works for all the corner cases pointed out in the comments—if it ends in a colon, or if there is no colon, although these weren't mentioned in the question itself. Thanks to @Rakesh Sharma, @mirabilos, and @Freddy for their comments. Answering questions is a great way to learn.

edited Aug 29, 2019 at 7:32

answered Aug 28, 2019 at 1:41

kGdmioT

2051 silver badge6 bronze badges

4

sed -e 's/:.*//' file.txt is another way with Posix sed.

Rakesh Sharma
– Rakesh Sharma

2019-08-28 04:02:44 +00:00
Commented Aug 28, 2019 at 4:02
1

sed -ne 'y/:/\n/;P' file.txt also can be used.

Rakesh Sharma
– Rakesh Sharma

2019-08-28 04:05:00 +00:00
Commented Aug 28, 2019 at 4:05
Make .+ to .*

Rakesh Sharma
– Rakesh Sharma

2019-08-28 04:37:18 +00:00
Commented Aug 28, 2019 at 4:37
@Randy Joselyn Since there's an implicit if in the s///p syntax, you need to modify your regex to take care of lines with no colons, something like, sed -nEe 's/([^:]*)(:.*|)/\1/p'. Note this requires GNU sed but since anyway you are on GNU sed so this shouldn't matter.

Rakesh Sharma
– Rakesh Sharma

2019-08-28 05:05:14 +00:00
Commented Aug 28, 2019 at 5:05
This answer could have been my favourite, but the ERE are unnecessary. sed -n '/:/s/^$[^:]*$:.*$/\1/p (add --posix if you use GNU sed, just to spite the extensionism of theirs)

mirabilos
– mirabilos

2019-08-28 18:09:27 +00:00
Commented Aug 28, 2019 at 18:09

Add a comment |

schrodingerscatcuriosity · Accepted Answer · 2019-08-27 17:30:47Z

4

Requires GNU grep. It would not work with the default grep on e.g. macOS or any of the other BSDs.

Do you mean like this:

grep -oP '.*(?=:)' file

Output:

help.helloworld.com dev.helloworld.com

edited Aug 27, 2019 at 17:30

answered Aug 27, 2019 at 16:58

schrodingerscatcuriosity

12.8k5 gold badges38 silver badges64 bronze badges

4

If there are two or more colons on the line, this will print everything until the last one, so not what the OP needs. Try echo foo:bar:baz | grep -oP '.*(?=:)'. This will work for the OP's example, but not for the general case as described in the question.

terdon
– terdon ♦

2019-08-27 17:19:56 +00:00
Commented Aug 27, 2019 at 17:19
there is only one colon and its working fine , but thanks for the update

Joel Deleep
– Joel Deleep

2019-08-27 17:25:11 +00:00
Commented Aug 27, 2019 at 17:25

Add a comment |

Jim Rippon · Accepted Answer · 2019-08-30 13:04:40Z

You could achieve this with bash string handling, by removing the longest match from the string directly for each line read like so:

for line in $(cat inputfile); do echo "${line%%:*}"; done

This might be a useful alternative if you are parsing the file in a shell script (though I suspect using cut might be more efficient).

please read Why is using a shell loop to process text considered bad practice? — αғsнιη
– αғsнιη, Commented Aug 31, 2019 at 8:29

Léa Gris · Accepted Answer · 2019-08-31 00:18:20Z

-2

In pure POSIX shell without using external commands, I'd do:

#/bin/sh IFS=: while read -r a _; do echo "$a" done < file.txt unset IFS

answered Aug 31, 2019 at 0:18

Léa Gris

5575 silver badges7 bronze badges

1

please read Why is using a shell loop to process text considered bad practice?

αғsнιη
– αғsнιη

2019-08-31 08:30:36 +00:00
Commented Aug 31, 2019 at 8:30

Add a comment |

Stack Exchange Network

Get contents before a colon

7 Answers 7

updated answer

You must log in to answer this question.

Linked

Hot Network Questions

Get contents before a colon

7 Answers 7

updated answer

You must log in to answer this question.

Linked

Related

Hot Network Questions