Remove a specified word appeared only as the first word in a text file in terminal

Question

I am working with tweeter text data in JSON format which I have stored in a text file. I am not interested in retweets and i created a parser that could extract most of the text, but somehow some retweets also came along. So i was wondering for a quick solution for this problem, i.e. to remove the text that starts with RT.

So a text in the file looks like

`"RT ...... RT ....."`

"..." are the other words in the sentence. I would like to only remove the lines starting with the word "RT" and save it in another file. The same word RT might come in the middle of text that doesn't start with RT, such texts should not be removed. I tried with the following command, which I am not entirely sure

grep -v "RT" twitterDataset.txt > clean_RT.txt

I would really appreciate for a solution to this problem and an explanation of the code would be also helpful.

Welcome to the site. If possible, please add possibly anonymized, but "full" input examples for your question. It will make it easier for contributors to help you find the problem. — AdminBee
– AdminBee, Commented Feb 3, 2020 at 8:54
That said, did you try anchoring your regular expression to the beginning of the line, as in grep -v "^RT"? — AdminBee
– AdminBee, Commented Feb 3, 2020 at 8:56
You mentioned JSON, but I see no JSON document in your question. There exists tools for working with JSON data in the terminal or in scripts. These tools makes it possible to parse, extract or modify JSON data in a safe and robust way (note that your grep would also remove any key whose name contained RT). Please include a representable sample of your data. — Kusalananda
– Kusalananda ♦, Commented Feb 3, 2020 at 9:02

Romeo Ninov · Accepted Answer · 2020-02-03 09:23:40Z

If the file in question is plain text you can do something like:

grep -v "^RT" twitterDataset.txt > clean_RT.txt

This will not match lines which start with string "RT"

Stack Exchange Network

Remove a specified word appeared only as the first word in a text file in terminal

1 Answer 1

You must log in to answer this question.

Hot Network Questions

Remove a specified word appeared only as the first word in a text file in terminal

1 Answer 1

You must log in to answer this question.

Related

Hot Network Questions