如何从文件中读取特定的行(按行号)?

我使用for循环来读取文件，但我只想读取特定的行，比如第26行和第30行。是否有任何内置功能来实现这一点?

当前回答

您可以执行seek()调用，将读头定位到文件中的指定字节。这对您没有帮助，除非您确切地知道在要读取的行之前文件中写入了多少字节(字符)。也许你的文件是严格格式化的(每一行是X个字节?)或者，如果你真的想提高速度，你可以自己计算字符的数量(记得包括不可见的字符，比如换行符)。

否则，您必须在您想要的行之前阅读每一行，就像这里已经提出的许多解决方案之一一样。

2010-01-17 17:26:08

其他回答

为了完整起见，这里还有一个选项。

让我们从python文档中的定义开始:

通常包含序列的一部分的对象。slice使用下标符号[]创建，当给出几个数字时，数字之间使用冒号，例如variable_name[1:3:5]。括号(下标)表示法在内部使用切片对象(或在旧版本中使用__getslice__()和__setslice__())。

虽然slice表示法一般不直接适用于迭代器，但itertools包包含一个替换函数:

from itertools import islice

# print the 100th line
with open('the_file') as lines:
    for line in islice(lines, 99, 100):
        print line

# print each third line until 100
with open('the_file') as lines:
    for line in islice(lines, 0, 100, 3):
        print line

该函数的另一个优点是，它直到结束才读取迭代器。所以你可以做更复杂的事情:

with open('the_file') as lines:
    # print the first 100 lines
    for line in islice(lines, 100):
        print line

    # then skip the next 5
    for line in islice(lines, 5):
        pass

    # print the rest
    for line in lines:
        print line

为了回答最初的问题:

# how to read lines #26 and #30
In [365]: list(islice(xrange(1,100), 25, 30, 4))
Out[365]: [26, 30]

2014-11-24 15:49:56

@OP，你可以使用枚举

for n,line in enumerate(open("file")):
    if n+1 in [26,30]: # or n in [25,29] 
       print line.rstrip()

2010-01-18 00:32:05

如果你不介意导入，那么fileinput确实是你需要的(这是你可以读取当前行的行号)

2010-01-17 17:21:58

不要使用阅读线!

我的解决方案是:


with open(filename) as f:
    specify = [26, 30]
    results = list(
        map(lambda line: line[1],
            filter(lambda line: line[0] in specify,
                   enumerate(f))
            )
    )

对6.5G文件进行如下测试:

import time

filename = 'a.txt'
start = time.time()
with open(filename, 'w') as f:
    for i in range(10_000_000):
        f.write(f'{str(i)*100}\n')       
end1 = time.time()

with open(filename) as f:
    specify = [26, 30]
    results = list(
        map(lambda line: line[1],
            filter(lambda line: line[0] in specify,
                   enumerate(f))
            )
    )
end2 = time.time()
print(f'write time: {end1-start}')
print(f'read time: {end2-end1}')
# write time: 14.38945460319519
# read time: 8.380386352539062

2022-04-07 13:01:15

你可以用已经有人提到过的语法很简单地做到这一点，但这是迄今为止最简单的方法:

inputFile = open("lineNumbers.txt", "r")
lines = inputFile.readlines()
print (lines[0])
print (lines[2])

2016-02-21 20:48:12

如何从文件中读取特定的行(按行号)?

推荐文章

最新文章

标签