如何在Python中连接文本文件?

我有一个20个文件名的列表，比如['file1.txt'， 'file2.txt'，…]。我想写一个Python脚本将这些文件连接到一个新文件中。我可以通过f = open(…)打开每个文件，通过调用f.r edline()逐行读取，并将每行写入新文件。这对我来说似乎不是很“优雅”，尤其是我必须一行一行地读/写的部分。

在Python中是否有更“优雅”的方式来做到这一点?

当前回答

我不知道什么叫优雅，但这招管用:

    import glob
    import os
    for f in glob.glob("file*.txt"):
         os.system("cat "+f+" >> OutFile.txt")

2014-06-03 01:39:23

其他回答

outfile.write(infile.read()) # time: 2.1085190773010254s
shutil.copyfileobj(fd, wfd, 1024*1024*10) # time: 0.60599684715271s

一个简单的基准测试表明，shutil的性能更好。

2018-04-26 08:10:36

UNIX命令有什么问题?(假设你不是在Windows上工作):

Ls | xargs cat | tee output.txt完成这项工作(如果你想要，你可以从python用subprocess调用它)

2012-11-28 20:00:34

@inspectorG4dget答案的替代答案(到2016年3月29日为止的最佳答案)。我测试了3个436MB的文件。

@inspectorG4dget答案:162秒

解决方案:125秒

from subprocess import Popen
filenames = ['file1.txt', 'file2.txt', 'file3.txt']
fbatch = open('batch.bat','w')
str ="type "
for f in filenames:
    str+= f + " "
fbatch.write(str + " > file4results.txt")
fbatch.close()
p = Popen("batch.bat", cwd=r"Drive:\Path\to\folder")
stdout, stderr = p.communicate()

其思想是创建一个批处理文件并执行它，利用“旧的好技术”。它是半python，但运行速度更快。适用于windows。

2016-03-29 03:53:56

如果目录中有很多文件，那么glob2可能是生成文件名列表的更好选择，而不是手工编写它们。

import glob2

filenames = glob2.glob('*.txt')  # list of all .txt files in the directory

with open('outfile.txt', 'w') as f:
    for file in filenames:
        with open(file) as infile:
            f.write(infile.read()+'\n')

2017-05-06 09:45:00

这样就行了

对于大文件:

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for fname in filenames:
        with open(fname) as infile:
            for line in infile:
                outfile.write(line)

对于小文件:

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for fname in filenames:
        with open(fname) as infile:
            outfile.write(infile.read())

我还想到了另一个有趣的问题:

filenames = ['file1.txt', 'file2.txt', ...]
with open('path/to/output/file', 'w') as outfile:
    for line in itertools.chain.from_iterable(itertools.imap(open, filnames)):
        outfile.write(line)

遗憾的是，最后一个方法留下了一些打开的文件描述符，无论如何GC都应该处理这些描述符。我只是觉得很有趣

2012-11-28 19:57:47

如何在Python中连接文本文件?

推荐文章

最新文章

标签