快速python numpy的功能在哪里？

事实证明，在这种情况下，纯Python循环比NumPy索引（或对np.where的调用）要快得多。

考虑以下替代方案：

import numpy as np
import collections
import itertools as IT

shape = (2600,5200)
# shape = (26,52)
emiss_data = np.random.random(shape)
obj_data = np.random.random_integers(1, 800, size=shape)
UNIQ_IDS = np.unique(obj_data)

def using_where():
    max = np.max
    where = np.where
    MAX_EMISS = [max(emiss_data[where(obj_data == i)]) for i in UNIQ_IDS]
    return MAX_EMISS

def using_index():
    max = np.max
    MAX_EMISS = [max(emiss_data[obj_data == i]) for i in UNIQ_IDS]
    return MAX_EMISS

def using_max():
    MAX_EMISS = [(emiss_data[obj_data == i]).max() for i in UNIQ_IDS]
    return MAX_EMISS

def using_loop():
    result = collections.defaultdict(list)
    for val, idx in IT.izip(emiss_data.ravel(), obj_data.ravel()):
        result[idx].append(val)
    return [max(result[idx]) for idx in UNIQ_IDS]

def using_sort():
    uind = np.digitize(obj_data.ravel(), UNIQ_IDS) - 1
    vals = uind.argsort()
    count = np.bincount(uind)
    start = 0
    end = 0
    out = np.empty(count.shape[0])
    for ind, x in np.ndenumerate(count):
        end += x
        out[ind] = np.max(np.take(emiss_data, vals[start:end]))
        start += x
    return out

def using_split():
    uind = np.digitize(obj_data.ravel(), UNIQ_IDS) - 1
    vals = uind.argsort()
    count = np.bincount(uind)
    return [np.take(emiss_data, item).max()
            for item in np.split(vals, count.cumsum())[:-1]]

for func in (using_index, using_max, using_loop, using_sort, using_split):
    assert using_where() == func()

以下是基准测试shape = (2600,5200)：

In [57]: %timeit using_loop()
1 loops, best of 3: 9.15 s per loop

In [90]: %timeit using_sort()
1 loops, best of 3: 9.33 s per loop

In [91]: %timeit using_split()
1 loops, best of 3: 9.33 s per loop

In [61]: %timeit using_index()
1 loops, best of 3: 63.2 s per loop

In [62]: %timeit using_max()
1 loops, best of 3: 64.4 s per loop

In [58]: %timeit using_where()
1 loops, best of 3: 112 s per loop

因此，using_loop（纯Python）的运行速度比快11倍以上using_where。

我不完全确定为什么纯Python比NumPy更快。我的猜测是纯Python版本一次通过两个数组（是，是双关语）。它利用了这样一个事实，即尽管进行了所有花哨的索引编制，但实际上我们只想访问每个值一次。因此，它避免了必须确定每个值所属的组的问题emiss_data。但这只是模糊的推测。我不知道在进行基准测试之前会更快。

python 2022/1/1 18:35:26 有217人围观

撰写回答

你尚未登录，登录后可以

和开发者交流问题的细节

关注并接收问题和回答的更新提醒

参与内容的编辑和改进，让解决方法与时俱进

请先登录

快速python numpy的功能在哪里？

撰写回答

推荐问题

使用Alamofire快速上传多张图片

快速请求每个请求的更改会话

堆压缩如何快速工作？

如何快速将数据转换为十六进制字符串

快速取模3或除法算法？

n个皇后（n> 1000）的快速启发式算法

如何快速将double转换为字节数组？

在JAVA中快速简单的字符串加密/解密

用于Python和C ++应用程序的简单但快速的IPC方法？

如何将数据从一个容器快速传递到另一个容器，而这两个容器都迅速地嵌入了同一个uiviewcontroller中？

MySQL快速从60万行中选择10条随机行

快速性能：map（）和reduce（）与for循环

LLDB（快速）：将原始地址转换为可用类型

是否可以在快速单击时防止Chrome中的元素以蓝色突出显示？

在C＃中快速使用位图

快速将200万行插入SQL Server

弹性重启节点后快速恢复

快速正确地继承UITextField

快速计算圆内的点数

Android SDK的快速位图模糊

分类汇总

您的鼓励是对我最大的支持