如何读取文件的前N行?

我们有一个很大的原始数据文件，我们想把它修剪成指定的大小。

如何在python中获取文本文件的前N行?所使用的操作系统对实现有任何影响吗?

当前回答

我想通过读取整个文件来处理小于n行的文件

def head(filename: str, n: int):
    try:
        with open(filename) as f:
            head_lines = [next(f).rstrip() for x in range(n)]
    except StopIteration:
        with open(filename) as f:
            head_lines = f.read().splitlines()
    return head_lines

这要归功于约翰·拉·鲁伊和伊莲·伊利耶夫。使用异常句柄函数以获得最佳性能

修改1:感谢FrankM的反馈，处理文件存在和读取权限我们可以进一步增加

import errno
import os

def head(filename: str, n: int):
    if not os.path.isfile(filename):
        raise FileNotFoundError(errno.ENOENT, os.strerror(errno.ENOENT), filename)  
    if not os.access(filename, os.R_OK):
        raise PermissionError(errno.EACCES, os.strerror(errno.EACCES), filename)     
   
    try:
        with open(filename) as f:
            head_lines = [next(f).rstrip() for x in range(n)]
    except StopIteration:
        with open(filename) as f:
            head_lines = f.read().splitlines()
    return head_lines

您可以使用第二个版本，也可以使用第一个版本，稍后再处理文件异常。从性能的角度来看，检查是快速的，而且大部分是免费的

2021-07-08 22:07:39

其他回答

如果你想快速读取第一行并且不关心性能，你可以使用.readlines()返回列表对象，然后对列表进行切片。

例如，前5行:

with open("pathofmyfileandfileandname") as myfile:
    firstNlines=myfile.readlines()[0:5] #put here the interval you want

注意:整个文件是读取的，所以不是最好的从性能的角度来看，但它是易于使用，快速编写和易于记忆，所以如果你只是想执行一些一次性计算非常方便

print firstNlines

与其他答案相比，一个优点是可以轻松地选择行范围，例如跳过前10行[10:30]或最后10行[:-10]或只选择偶数行[::2]。

2013-12-07 12:59:02

这对我很有效

f = open("history_export.csv", "r")
line= 5
for x in range(line):
    a = f.readline()
    print(a)

2019-08-23 19:18:43

我所做的就是用熊猫形来称呼N行。我认为性能不是最好的，但是举个例子，如果N=1000:

import pandas as pd
yourfile = pd.read_csv('path/to/your/file.csv',nrows=1000)

2017-04-11 14:54:59

有一个简单的方法来获取前10行:

with open('fileName.txt', mode = 'r') as file:
    list = [line.rstrip('\n') for line in file][:10]
    print(list)

2023-01-06 08:46:35

N = 10
with open("file.txt", "a") as file:  # the a opens it in append mode
    for i in range(N):
        line = next(file).strip()
        print(line)

2009-11-20 02:04:36

如何读取文件的前N行?

推荐文章

最新文章

标签