如何评测Python脚本？

Project Euler和其他编码竞赛通常有最长的运行时间，或者人们吹嘘他们的特定解决方案运行速度有多快。对于Python，有时方法有些笨拙——即向__main__添加计时代码。

描述Python程序运行时间的好方法是什么？

当前回答

pprofile文件

line_profiler（此处已介绍）也启发了pprofile，其描述如下：

行粒度、线程感知确定性和统计纯python剖面仪

它提供了line_profiler的行粒度，是纯Python，可以用作独立命令或模块，甚至可以生成callgrind格式的文件，这些文件可以很容易地使用[k|q]cachegrind进行分析。

vprof公司

还有vprof，一个Python包，描述如下：

[…]为各种Python程序特性（如运行时间和内存使用）提供丰富的交互式可视化。

2015-03-02 11:36:47

其他回答

根据乔·肖（Joe Shaw）关于多线程代码无法按预期工作的回答，我认为cProfile中的runcall方法只是围绕着已配置的函数调用执行self.enable（）和self.disable（）调用，因此您可以简单地自己执行，并在对现有代码的干扰最小的情况下使用任何代码。

2011-11-09 12:59:04

还值得一提的是GUI cProfile转储查看器RunSnakeRun。它允许您排序和选择，从而放大程序的相关部分。图片中矩形的大小与所用时间成正比。如果您将鼠标悬停在一个矩形上，它将突出显示表中的调用以及地图上的任何位置。双击矩形时，它会放大该部分。它将显示谁调用了该部分以及该部分调用了什么。

描述性信息非常有用。它向您显示了该位的代码，当您处理内置库调用时，该代码会很有用。它告诉要查找代码的文件和行。

还想指出，OP说的是“剖析”，但似乎他是指“时机”。请记住，程序在评测时运行速度会变慢。

2015-02-22 16:18:05

如果你想做一个累积分析器，意思是连续运行函数几次并观察结果的总和。

您可以使用此cumulative_profiler装饰器：

它是python>=3.6特定的，但您可以删除非本地的，因为它可以在旧版本上工作。

import cProfile, pstats

class _ProfileFunc:
    def __init__(self, func, sort_stats_by):
        self.func =  func
        self.profile_runs = []
        self.sort_stats_by = sort_stats_by

    def __call__(self, *args, **kwargs):
        pr = cProfile.Profile()
        pr.enable()  # this is the profiling section
        retval = self.func(*args, **kwargs)
        pr.disable()

        self.profile_runs.append(pr)
        ps = pstats.Stats(*self.profile_runs).sort_stats(self.sort_stats_by)
        return retval, ps

def cumulative_profiler(amount_of_times, sort_stats_by='time'):
    def real_decorator(function):
        def wrapper(*args, **kwargs):
            nonlocal function, amount_of_times, sort_stats_by  # for python 2.x remove this row

            profiled_func = _ProfileFunc(function, sort_stats_by)
            for i in range(amount_of_times):
                retval, ps = profiled_func(*args, **kwargs)
            ps.print_stats()
            return retval  # returns the results of the function
        return wrapper

    if callable(amount_of_times):  # incase you don't want to specify the amount of times
        func = amount_of_times  # amount_of_times is the function in here
        amount_of_times = 5  # the default amount
        return real_decorator(func)
    return real_decorator

实例

剖析函数baz

import time

@cumulative_profiler
def baz():
    time.sleep(1)
    time.sleep(2)
    return 1

baz()

baz跑了5次并打印了以下内容：

         20 function calls in 15.003 seconds

   Ordered by: internal time

   ncalls  tottime  percall  cumtime  percall filename:lineno(function)
       10   15.003    1.500   15.003    1.500 {built-in method time.sleep}
        5    0.000    0.000   15.003    3.001 <ipython-input-9-c89afe010372>:3(baz)
        5    0.000    0.000    0.000    0.000 {method 'disable' of '_lsprof.Profiler' objects}

指定次数

@cumulative_profiler(3)
def baz():
    ...

2019-09-11 19:52:11

这取决于您希望从分析中看到什么。简单的时间度量可以由（bash）给出。

time python python_prog.py

甚至“/usr/bin/time”也可以使用“--verbose”标志输出详细的度量。

为了检查每个函数给出的时间度量，并更好地了解在函数上花费的时间，可以使用python中的内置cProfile。

进入更详细的指标，如绩效，时间不是唯一的指标。您可以担心内存、线程等问题。分析选项：line_profiler是另一个通常用于逐行查找定时度量的分析器。2.memory_profiler是一个评测内存使用情况的工具。3.heapy（来自项目Guppy）描述如何使用堆中的对象。

这些是我常用的一些。但如果你想了解更多，试试看这本书这是一本非常好的书，讲述了如何从性能出发。您可以转到使用Cython和JIT（实时）编译的python的高级主题。

2017-04-19 19:42:32

当我不是服务器的根用户时，我使用lsprofcalltree.py并像这样运行我的程序：

python lsprofcalltree.py -o callgrind.1 test.py

然后我可以用任何callgrind兼容的软件打开报告，比如qcachegrind

2017-02-02 10:18:41

如何评测Python脚本？

推荐文章

最新文章

标签