最有效的方法映射函数在numpy数组

在numpy数组上映射函数的最有效方法是什么?我目前正在做:

import numpy as np 

x = np.array([1, 2, 3, 4, 5])

# Obtain array of square of each element in x
squarer = lambda t: t ** 2
squares = np.array([squarer(xi) for xi in x])

然而，这可能非常低效，因为我在将新数组转换回numpy数组之前，使用列表推导式将其构造为Python列表。我们能做得更好吗?

当前回答

使用numpy.fromfunction(function, shape， **kwargs)

看到“https://docs.scipy.org/doc/numpy/reference/generated/numpy.fromfunction.html”

2019-12-06 03:13:49

其他回答

就像在这篇文章中提到的，像这样使用生成器表达式:

numpy.fromiter((<some_func>(x) for x in <something>),<dtype>,<size of something>)

2016-02-05 02:22:52

似乎没有人提到在numpy包中生成ufunc的内置工厂方法:np.frompyfunc，我已经对np进行了测试。矢量化，并且比它的表现好大约20~30%。当然，它不能像规定的C代码或numba(我没有测试过)那样执行，但它是比np.vectorize更好的选择

f = lambda x, y: x * y
f_arr = np.frompyfunc(f, 2, 1)
vf = np.vectorize(f)
arr = np.linspace(0, 1, 10000)

%timeit f_arr(arr, arr) # 307ms
%timeit vf(arr, arr) # 450ms

我也测试了更大的样本，改进是成比例的。请在这里查看文档

2019-05-15 21:41:52

使用numpy.vectorize怎么样?

import numpy as np
x = np.array([1, 2, 3, 4, 5])
squarer = lambda t: t ** 2
vfunc = np.vectorize(squarer)
vfunc(x)
# Output : array([ 1,  4,  9, 16, 25])

2016-02-05 02:29:37

我已经测试了所有建议的方法加上np。数组(list(map(f, x)))和perfplot(我的一个小项目)。

消息#1:如果可以使用numpy的本机函数，就使用它。

如果你试图向量化的函数已经被向量化了(就像原始文章中的x**2例子)，使用它比其他任何方法都快得多(注意对数尺度):

如果你真的需要向量化，用哪个变量并不重要。

代码重现图:

import numpy as np
import perfplot
import math


def f(x):
    # return math.sqrt(x)
    return np.sqrt(x)


vf = np.vectorize(f)


def array_for(x):
    return np.array([f(xi) for xi in x])


def array_map(x):
    return np.array(list(map(f, x)))


def fromiter(x):
    return np.fromiter((f(xi) for xi in x), x.dtype)


def vectorize(x):
    return np.vectorize(f)(x)


def vectorize_without_init(x):
    return vf(x)


b = perfplot.bench(
    setup=np.random.rand,
    n_range=[2 ** k for k in range(20)],
    kernels=[
        f,
        array_for,
        array_map,
        fromiter,
        vectorize,
        vectorize_without_init,
    ],
    xlabel="len(x)",
)
b.save("out1.svg")
b.show()

2017-09-28 13:28:35

编辑:原来的答案是误导性的，np。SQRT直接应用于数组，开销很小。

在多维情况下，您希望应用一个内建函数，操作1d数组numpy。Apply_along_axis是一个不错的选择，对于numpy和scipy中更复杂的函数组合也是如此。

先前的误导性陈述:

添加方法:

def along_axis(x):
    return np.apply_along_axis(f, 0, x)

perfplot代码给出接近np.sqrt的性能结果。

2019-10-29 20:17:49

最有效的方法映射函数在numpy数组

推荐文章

最新文章

标签