bash: read file line by line and sed to append

Question

I have a text file that can have X number of fields, each separated by a comma. In my script I reading line by line, checking how many fields have been populated on that line and determining how many commas i need to append to the end of that line to represent all the fields. For instance a file looks like this:

Address,nbItems,item1,item2,item3,item4,item5,item6,item7 2325988023,7,1,2,3,4,5,6,7 2327036284,5,1,2,3,4,5 2326168436,4,1,2,3,4

Should become this:

Address,nbItems,item1,item2,item3,item4,item5,item6,item7 2325988023,7,1,2,3,4,5,6,7 2327036284,5,1,2,3,4,5,, 2326168436,4,1,2,3,4,,,

My script below works, but it seems terribly inefficient. Is it the reading line by line that has a hard time on large files? Is it the sed that causes the slowdown? Better way to do this?

#!/bin/bash lineNum=0 numFields=`head -1 File.txt | egrep -o "," | wc -l` cat File.txt | while read LINE do lineNum=`expr 1 + $lineNum` num=`echo $LINE | egrep -o "," | wc -l` needed=$(( numFields - num )) for (( i=0 ; i < $needed ; i++ )) do sed -i "${lineNum}s/$/,/" File.txt done done

Scrutinizer · Accepted Answer · 2013-03-01 16:14:10Z

11

This type of thing is usually best done with a language like awk, for example:

awk 'NR==1{n=NF}{$n=$n}1' FS=, OFS=, file

edited Mar 1, 2013 at 16:14

answered Mar 1, 2013 at 16:08

Scrutinizer

9,9661 gold badge24 silver badges23 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

ssbsts Over a year ago

Wow, thank you so much. Not only does it achieve my goal perfectly, it is ridiculously simple and fast!

Akshay Hegde Over a year ago

+1 Scrutinizer elegant solution as always, -- Akshay

chepner · Accepted Answer · 2013-03-02 00:00:54Z

0

Here's a full bash solution.

( IFS="," read hdrLine echo "$hdrLine" read -a header <<< "$hdrLine" numFields="${#header[@]}" while read -a line; do pad=${#line[@]} while (( pad < numFields )); do line[pad++]= done echo "${line[*]}" done ) < File.txt > newFile.txt mv newFile.txt File.txt

The awk solution is far better; this is best viewed as a bash demo.

edited Mar 2, 2013 at 0:00

answered Mar 1, 2013 at 16:08

chepner

538k77 gold badges594 silver badges746 bronze badges

2 Comments

ssbsts Over a year ago

thanks for you input, however it doesn't actually achieve my goal. From what i can tell it only appends a single comma to every line, even when not necessary, i.e. all fields are already accounted for.

chepner Over a year ago

That's what I get for not testing first. I couldn't have sworn I read recently that the array would be filled with the intermediary slots if you assigned to a larger index. I wonder what I'm thinking of, because it sure does not appear to be bash! I'll leave this answer for a bit to see if I can salvage it; otherwise I'll delete.

Collectives™ on Stack Overflow

bash: read file line by line and sed to append

2 Answers 2

2 Comments

2 Comments

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

2 Comments

Related