6

I'd like to run a system process, intercept the output, and modify it real-time, line by line, in a Python script.

My best attempt, which waits for the process to complete before printing, is:

#!/usr/bin/env python import subprocess cmd = "waitsome.py" proc = subprocess.Popen(cmd, shell=True, bufsize=256, stdout=subprocess.PIPE) for line in proc.stdout: print ">>> " + line.rstrip() 

The script waitsome.py simply prints a line every half a second:

#!/usr/bin/env python import time from sys import stdout print "Starting" for i in range(0,20): time.sleep(0.5) print "Hello, iteration", i stdout.flush() 

Is there an easy solution to get subprocess to allow iterating over the output in real time? Do I have to use threads?

Once upon a time, I scripted in Perl, and this was a piece of cake:

open(CMD, "waitsome.py |"); while (<CMD>) { print ">>> $_"; } close(CMD); 
2

2 Answers 2

16

Looping over a file unavoidably buffers things in pretty large chunks -- a known issue with all Python 2.* implementations. It works as you intend in Python 3.1, with the final loop being slightly different:

for line in proc.stdout: print(">>> " + str(line.rstrip())) 

If upgrading to Python 3.1 is impractical (and I know it will often be!), go the other way and write the loop in an old-fashioned manner -- the following version of the loop does work as you intend in Python 2.*:

while True: line = proc.stdout.readline() if not line: break print ">>> " + line.rstrip() 
Sign up to request clarification or add additional context in comments.

3 Comments

Yep, but now that Python 3.1 is out (with a much better implementation of the whole I/O stack) there's no reason to stick with 3.0 (it was definitely a transitional version;-).
for line in iter(proc.stdout.readline, ''): print ">>>", line, could be used instead of the while loop on Python 2.
People talking about impractical Python 3 upgrade in 2009: Python 3 is bad... . Everyone else in 2024: Why would you use Python 2? Are you using legacy computers or something.
0

This whole thing can be encapsulated in an iterator as:

def subprocess_readlines(out): while True: line = out.readline() if not line: return yield line 

And called as:

for line in subprocess_readlines(proc.stdout): print ">>>", line.rstrip() 

3 Comments

Buffering in Python has changed a lot since I wrote the original question :) Yes, what took a lot of effort for python 2.5 can now be done in a few lines.
I dont understand where subprocess_readlines needs to be defined for this approach. Can you elaborate?
Ooops. I'd not added the call to subprocess_readlines(). I just updated that.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.