逐行读取子进程标准输出

我的python脚本使用subprocess调用一个非常吵闹的linux实用程序。我想将所有输出存储到一个日志文件中，并将其中一些显示给用户。我认为下面的代码可以工作，但是直到实用程序产生大量输出，输出才显示在我的应用程序中。

#fake_utility.py, just generates lots of output over time
import time
i = 0
while True:
   print hex(i)*512
   i += 1
   time.sleep(0.5)

#filters output
import subprocess
proc = subprocess.Popen(['python','fake_utility.py'],stdout=subprocess.PIPE)
for line in proc.stdout:
   #the real code does filtering here
   print "test:", line.rstrip()

我真正想要的行为是让过滤器脚本打印从子进程接收到的每一行。有点像tee，但用的是python代码。

我错过了什么?这可能吗?

更新:

如果将sys.stdout.flush()添加到fake_utility.py中，代码在python 3.1中具有所需的行为。我使用的是python 2.6。您可能认为使用proc.stdout.xreadlines()的工作方式与py3k相同，但事实并非如此。

更新2:

下面是最小的工作代码。

#fake_utility.py, just generates lots of output over time
import sys, time
for i in range(10):
   print i
   sys.stdout.flush()
   time.sleep(0.5)

#display out put line by line
import subprocess
proc = subprocess.Popen(['python','fake_utility.py'],stdout=subprocess.PIPE)
#works in python 3.0+
#for line in proc.stdout:
for line in iter(proc.stdout.readline,''):
   print line.rstrip()

当前回答

python 3.5在subprocess模块中添加了run()和call()方法，两者都返回一个CompletedProcess对象。这样你就可以使用proc.stdout.splitlines():

proc = subprocess.run( comman, shell=True, capture_output=True, text=True, check=True )
for line in proc.stdout.splitlines():
   print "stdout:", line

请参见如何使用Subprocess Run方法在Python中执行Shell命令

2020-03-22 09:04:20

其他回答

有点晚了，但很惊讶没有看到我认为最简单的解决方案:

import io
import subprocess

proc = subprocess.Popen(["prog", "arg"], stdout=subprocess.PIPE)
for line in io.TextIOWrapper(proc.stdout, encoding="utf-8"):  # or another encoding
    # do something with line

(这需要Python 3。)

2016-01-22 03:56:27

我尝试用python3，它工作，源代码

当你使用popen生成新线程时，你告诉操作系统PIPE子进程的stdout，这样父进程就可以读取它，在这里，stderr被复制到父进程的stderr。

在output_reader中，我们读取子进程的每一行stdout，方法是将它包装在迭代器中，每当有新行准备好时，迭代器就会逐行填充子进程的输出。

def output_reader(proc):
    for line in iter(proc.stdout.readline, b''):
        print('got line: {0}'.format(line.decode('utf-8')), end='')


def main():
    proc = subprocess.Popen(['python', 'fake_utility.py'],
                            stdout=subprocess.PIPE,
                            stderr=subprocess.STDOUT)

    t = threading.Thread(target=output_reader, args=(proc,))
    t.start()

    try:
        time.sleep(0.2)
        import time
        i = 0
    
        while True:
        print (hex(i)*512)
        i += 1
        time.sleep(0.5)
    finally:
        proc.terminate()
        try:
            proc.wait(timeout=0.2)
            print('== subprocess exited with rc =', proc.returncode)
        except subprocess.TimeoutExpired:
            print('subprocess did not terminate in time')
    t.join()

2018-01-21 12:00:42

python 3.5在subprocess模块中添加了run()和call()方法，两者都返回一个CompletedProcess对象。这样你就可以使用proc.stdout.splitlines():

proc = subprocess.run( comman, shell=True, capture_output=True, text=True, check=True )
for line in proc.stdout.splitlines():
   print "stdout:", line

请参见如何使用Subprocess Run方法在Python中执行Shell命令

2020-03-22 09:04:20

我认为问题在于proc.stdout中的for line语句，它在迭代整个输入之前读取它。解决方案是使用readline()代替:

#filters output
import subprocess
proc = subprocess.Popen(['python','fake_utility.py'],stdout=subprocess.PIPE)
while True:
  line = proc.stdout.readline()
  if not line:
    break
  #the real code does filtering here
  print "test:", line.rstrip()

当然，您仍然需要处理子进程的“缓冲”。

注意:根据文档，使用迭代器的解决方案应该等效于使用readline()，除了预读缓冲区，但(或正因为如此)建议的更改确实为我产生了不同的结果(Windows XP上的Python 2.5)。

2010-05-11 18:48:31

以下对Rômulo的回答的修改适用于Python 2和3(2.7.12和3.6.1):

import os
import subprocess

process = subprocess.Popen(command, stdout=subprocess.PIPE)
while True:
  line = process.stdout.readline()
  if line != '':
    os.write(1, line)
  else:
    break

2017-04-02 17:14:13

逐行读取子进程标准输出

推荐文章

最新文章

标签