通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值

2024-01-14

我正在尝试提高返回输入 2D NumPy 数组的局部最小值和最大值的函数的性能。该函数按预期工作，但对于我的用例来说太慢了。我想知道是否可以创建此函数的矢量化版本以提高其性能。

Here is the formal definition for defining whether an element is a local minima (maxima):

where

A=[a_m,n]是二维矩阵，m and n分别是行和列，w_h and w_w分别是滑动窗口的高度和宽度。

我尝试过使用skimage.morphology.local_minimum and skimage.morphology.local_maxima，但如果某个元素的值小于或等于（大于或等于）其所有邻居，则他们将其视为最小值（最大值）。
就我而言，如果一个元素严格小于（大于）其所有邻居，我需要该函数将其视为最小（最大）。

当前的实现使用滑动窗口方法numpy.lib.stride_tricks.sliding_window_view，但函数不一定非要使用这种方式。

这是我当前的实现：

import numpy as np

def get_local_extrema(array, window_size=(3, 3)):
    # Check if the window size is valid
    if not all(size % 2 == 1 and size >= 3 for size in window_size):
        raise ValueError("Window size must be odd and >= 3 in both dimensions.")

    # Create a map to store the local minima and maxima
    minima_map = np.zeros_like(array)
    maxima_map = np.zeros_like(array)

    # Save the shape and dtype of the original array for later
    original_size = array.shape
    original_dtype = array.dtype
    # Get the halved window size
    half_window_size = tuple(size // 2 for size in window_size)

    # Pad the array with NaN values to handle the edge cases
    padded_array = np.pad(array.astype(float),
                         tuple((size, size) for size in half_window_size),
                         mode='constant', constant_values=np.nan)

    # Generate all the sliding windows
    windows = np.lib.stride_tricks.sliding_window_view(padded_array, window_size).reshape(
        original_size[0] * original_size[1], *window_size)

    # Create a mask to ignore the central element of the window
    mask = np.ones(window_size, dtype=bool)
    mask[half_window_size] = False

    # Iterate through all the windows
    for i in range(windows.shape[0]):
        window = windows[i]
        # Get the value of the central element
        center_val = window[half_window_size]
        # Apply the mask to ignore the central element
        masked_window = window[mask]

        # Get the row and column indices of the central element
        row = i // original_size[1]
        col = i % original_size[1]

        # Check if the central element is a local minimum or maximum
        if center_val > np.nanmax(masked_window):
            maxima_map[row, col] = center_val
        elif center_val < np.nanmin(masked_window):
            minima_map[row, col] = center_val

    return minima_map.astype(original_dtype), maxima_map.astype(original_dtype)

a = np.array([[8, 8, 4, 1, 5, 2, 6, 3],
              [6, 3, 2, 3, 7, 3, 9, 3],
              [7, 8, 3, 2, 1, 4, 3, 7],
              [4, 1, 2, 4, 3, 5, 7, 8],
              [6, 4, 2, 1, 2, 5, 3, 4],
              [1, 3, 7, 9, 9, 8, 7, 8],
              [9, 2, 6, 7, 6, 8, 7, 7],
              [8, 2, 1, 9, 7, 9, 1, 1]])

(minima, maxima) = get_local_extrema(a)

print(minima)
# [[0 0 0 1 0 2 0 0]
#  [0 0 0 0 0 0 0 0]
#  [0 0 0 0 1 0 0 0]
#  [0 1 0 0 0 0 0 0]
#  [0 0 0 1 0 0 3 0]
#  [1 0 0 0 0 0 0 0]
#  [0 0 0 0 6 0 0 0]
#  [0 0 1 0 0 0 0 0]]

print(maxima)
# [[0 0 0 0 0 0 0 0]
#  [0 0 0 0 7 0 9 0]
#  [0 8 0 0 0 0 0 0]
#  [0 0 0 4 0 0 0 8]
#  [6 0 0 0 0 0 0 0]
#  [0 0 0 0 0 0 0 8]
#  [9 0 0 0 0 0 0 0]
#  [0 0 0 9 0 9 0 0]]

expected_minima = np.array([[0, 0, 0, 1, 0, 2, 0, 0],
                            [0, 0, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 0, 1, 0, 0, 0],
                            [0, 1, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 1, 0, 0, 3, 0],
                            [1, 0, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 0, 6, 0, 0, 0],
                            [0, 0, 1, 0, 0, 0, 0, 0]])

expected_maxima = np.array([[0, 0, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 0, 7, 0, 9, 0],
                            [0, 8, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 4, 0, 0, 0, 8],
                            [6, 0, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 0, 0, 0, 0, 8],
                            [9, 0, 0, 0, 0, 0, 0, 0],
                            [0, 0, 0, 9, 0, 9, 0, 0]])

np.testing.assert_array_equal(minima, expected_minima)
np.testing.assert_array_equal(maxima, expected_maxima)

print('All tests passed')

任何有关如何向量化此函数的建议或想法将不胜感激。

提前致谢！

EDIT #1
在玩了一下 NumPy 后，如果我理解正确的话，我设法让以下代码几乎以完全矢量化的方式工作：

def get_local_extrema_2(img):
  minima_map = np.zeros_like(img)
  maxima_map = np.zeros_like(img)

  minima_map[1:-1, 1:-1] = np.where(
    (a[1:-1, 1:-1] < a[:-2, 1:-1]) &
    (a[1:-1, 1:-1] < a[2:, 1:-1]) &
    (a[1:-1, 1:-1] < a[1:-1, :-2]) &
    (a[1:-1, 1:-1] < a[1:-1, 2:]) &
    (a[1:-1, 1:-1] < a[2:, 2:]) &
    (a[1:-1, 1:-1] < a[:-2, :-2]) &
    (a[1:-1, 1:-1] < a[2:, :-2]) &
    (a[1:-1, 1:-1] < a[:-2, 2:]),
    a[1:-1, 1:-1],
    0)
  
  maxima_map[1:-1, 1:-1] = np.where(
    (a[1:-1, 1:-1] > a[:-2, 1:-1]) &
    (a[1:-1, 1:-1] > a[2:, 1:-1]) &
    (a[1:-1, 1:-1] > a[1:-1, :-2]) &
    (a[1:-1, 1:-1] > a[1:-1, 2:]) &
    (a[1:-1, 1:-1] > a[2:, 2:]) &
    (a[1:-1, 1:-1] > a[:-2, :-2]) &
    (a[1:-1, 1:-1] > a[2:, :-2]) &
    (a[1:-1, 1:-1] > a[:-2, 2:]),
    a[1:-1, 1:-1],
    0)

  return minima_map, maxima_map

get_local_extrema_2 的输出是：
最小地图：

[[0 0 0 0 0 0 0 0]
 [0 0 0 0 0 0 0 0]
 [0 0 0 0 1 0 0 0]
 [0 1 0 0 0 0 0 0]
 [0 0 0 1 0 0 3 0]
 [0 0 0 0 0 0 0 0]
 [0 0 0 0 6 0 0 0]
 [0 0 0 0 0 0 0 0]]

千里马地图：

[[0 0 0 0 0 0 0 0]
 [0 0 0 0 7 0 9 0]
 [0 8 0 0 0 0 0 0]
 [0 0 0 4 0 0 0 0]
 [0 0 0 0 0 0 0 0]
 [0 0 0 0 0 0 0 0]
 [0 0 0 0 0 0 0 0]
 [0 0 0 0 0 0 0 0]]

上述问题是未检测到边界上的最小值或最大值的像素。

EDIT #2
即使输出数组中有 1 而不是局部最小值（最大值）的值，即 0 和 1（或 False 和 True）的二维数组，也没关系。

EDIT #3
这是该函数的一个版本基于克里斯·卢恩戈 https://stackoverflow.com/users/7328782/cris-luengo's answer https://stackoverflow.com/a/75049306/5495385。请注意使用“镜像”模式（相当于 NumPy 的“反射”），这样如果最小值或最大值位于边缘，它就不会被复制到边界之外，并且会脱颖而出。这样，就不需要用矩阵的最小或最大元素填充图像。我认为这是完成此任务的最有效的方法：

import numpy as np
import scipy

def get_local_extrema_v3(image):
    footprint = np.ones((3, 3), dtype=bool)
    footprint[1, 1] = False
    minima = image * (scipy.ndimage.grey_erosion(image, footprint=footprint, mode='mirror') > image)
    maxima = image * (scipy.ndimage.grey_dilation(image, footprint=footprint, mode='mirror') < image)
    return minima, maxima

您对局部最大值的定义是有缺陷的。例如，在一维数组中[1,2,3,4,4,3,2,1]，存在局部最大值，但您的定义忽略了它。skimage.morphology.local_maxima将正确识别该局部最大值。

如果您确实需要实现您的定义，我将使用带有窗口大小的方形结构元素的膨胀（侵蚀），但不包括中心像素。原始图像中比滤波图像中更大（更小）的任何像素都将满足局部最大值（最小值）的定义。

我使用 scikit-image 实现了这一点，但发现它在图像边缘做了奇怪的事情，因此它不会检测边缘附近的局部最大值或最小值：

se = np.ones((3, 3))
se[1, 1] = 0
minima = a * (skimage.morphology.erosion(a, footprint=se) > a)
maxima = a * (skimage.morphology.dilation(a, footprint=se) < a)

使用 DIPlib （披露：我是作者）这也可以在图像边缘正常工作：

import diplib as dip

se = np.ones((3, 3), dtype=np.bool_)
se[1, 1] = False
minima = a * (dip.Erosion(a, se) > a)
maxima = a * (dip.Dilation(a, se) < a)

^{Looking at the source code for skimage.morphology.dilation, it calls scipy.ndimage.grey_dilation with the default boundary extension, which is 'reflect'. This means that every local maximum at the image edge will have a neighbor with the same value, and hence not detected as local maximum in this definition. Instead, it should use the 'constant' extension, with cval set to the minimum possible value for the data type. For example, for an uint8 input array, it should do ndi.grey_dilation(image, footprint=footprint, output=out, mode='constant', cval=0). GitHub issue https://github.com/scikit-image/scikit-image/issues/6665}

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值的相关文章

为什么 Mypy 在 __init__ 中分配已在类主体中进行类型提示的属性时不给出键入错误？

这是我的示例 python 文件 class Person name str age int def init self name age self name name self age age p Person 5 5 但当我跑步时myp
App Engine 上的 Django 与 webapp2 [关闭]

就目前情况而言这个问题不太适合我们的问答形式我们希望答案得到事实参考资料或专业知识的支持但这个问题可能会引发辩论争论民意调查或扩展讨论如果您觉得这个问题可以改进并可能重新开放访问帮助中心 help reopen questi
Python：json_normalize pandas 系列给出 TypeError

我在 pandas 系列中有数万行像这样的 json 片段df json IDs lotId 1 Id 123456 date 2009 04 17 bidsCount 2 IDs lotId 2 Id 123456 date 2009 0
使用 Boto3 超时的 AWS Lambda 函数

我已经解决了我自己的问题但无论如何我都会发布它希望能节省其他人几个小时我在 AWS 上有一个无服务器项目使用 Python 将记录插入到 kinesis 队列中但是当我使用 boto3 client kinesis 或 put
Python，Google Places API - 给定一组纬度/经度查找附近的地点

我有一个由商店 ID 及其纬度经度组成的数据框我想迭代该数据框并使用 google api 为每个商店 ID 查找附近的关键地点例如输入 Store ID LAT LON 1 1 222 2 222 2 2 334 4 555 3
使用 Pandas 从 csv 文件读取标题信息

我有一个包含 14 行标题的数据文件在标头中有经纬度坐标和时间的元数据我目前正在使用 pandas read csv filename delimiter header 14 读取文件但这只是获取数据我似乎无法获取元数据有人知道
获取 Keras model.summary() 作为表

我在 Keras 中创建了相当大的模型我正在用 LaTeX 写一篇关于它的文章为了很好地描述 LaTeX 中的 keras 模型我想用它创建一个 LaTeX 表我可以手动实现它但我想知道是否有任何更好的方法来实现这一点我四处
无法通过 Android 应用程序访问我的笔记本电脑的本地主机

因此我在发布此内容之前做了一项研究我发现的解决方案不起作用更准确地说连接到我的笔记本电脑的 IPv4192 168 XXX XXX 没用连接到10 0 2 2 加上端口不起作用我需要测试使用 Django Rest 框架构建的
Pandas 字典键到列[重复]

这个问题在这里已经有答案了我有一个像这样的数据框 index column1 e1 u c680 5 u c681 1 u c682 2 u c57 e2 u c680 6 u c681 2 u c682 1 u c57 e3 u c68
Python在没有pandas的情况下解码excel表

我正在尝试在 python 中读取 excel 文件而不使用pandas or xlrd 我一直在尝试将结果转换为bytes to utf 8没有任何成功 xls 文件中的数据 colA colB colC spc 1D0 20190705
Flymake的临时文件可以在系统临时目录下创建吗？

我目前正在使用以下代码在 emacs 中连接 Flymake 和 Pyflakes defun flymake create temp in system tempdir filename prefix make temp file or
Snakemake：将多个输入用于具有多个子组的一个输出的规则

我有一个工作管道用于下载比对和对公共测序数据执行变体调用问题是它目前只能在每个样本的基础上工作 i e作为每个单独测序实验的样本如果我想对一组实验例如样本的生物和或技术复制执行变体调用则它不起作用我试图解决它但我无法让它
如何将 URL 添加到 Telegram Bot 的 InlineKeyboardButton

我想制作一个按钮可以从 Telegram 聊天中在浏览器中打开 URL 外部超链接目前我只开发了可点击的操作按钮 update message reply text Subscribe to us on Facebook and Te
在Python中使用pil读取tif图像时出现值错误？

我必须读取尺寸的tif图像2200 2200并输入 uint16 我将 PIL 库与 anaconda python 一起使用如下所示 from PIL import Image img Image open test tif img i
如何创建增量加载网页

我正在编写一个处理大量数据的页面它会永远持续到我的结果页面加载几乎无限因为返回的数据太大了因此我需要实现一个增量加载页面例如 url 中的页面 http docs python org http docs python org
如何在引发异常时将变量传递给异常并在异常时检索它？

现在我只有一个空白的异常类我想知道如何在引发变量时给它一个变量然后在 try except 中处理它时检索该变量 class ExampleException Exception pass 为其构造函数提供一个参数将其存储为属性然后
从给定的项目列表创建子列表

我首先要说的是以下问题不是为了家庭作业目的即使因为我几个月前就完成了软件工程师的工作无论如何今天我正在工作一位朋友向我询问了这个奇怪的排序问题我有一个包含 1000 行的列表每行代表一个数字我想创建 10 个子列表每个子列表都
tf.print() vs Python print vs tensor.eval()

看来在Tensorflow中至少有三种方法可以打印出张量的值我一直在读here https www freecodecamp org news debugging tensorflow a starter e6668ce72617 an
py2exe ImportError：没有名为的模块

我已经实现了一个名为 myUtils 的包它由文件夹 myUtils 文件组成 init py 和许多名称为 myUtils 的 py 文件该包包含在 myOtherProject py 中当我从 Eclipse 运行它们时可以找到
超过两个点的Python相对导入

是否可以使用路径中包含两个以上点的模块引用就像这个例子一样 Project structure sound init py codecs init py echo init py nix init py way1 py way2 py w

随机推荐

如何使用 Quart Python 停止将访问日志记录到 stdout

我有用 Quart python 编写的微服务我想停止登录到标准输出到目前为止我已经尝试过 app logger disabled True 和 Flask 类似的方法导入日志记录日志 logging getLogger werk
使用 SHA-256AndMGF1Padding 分解 RSA/ECB/OAEP

Java有一种模式叫做RSA ECB OAEPWithSHA 256AndMGF1Padding 那有什么意思 RFC3447 https www rfc editor org rfc rfc3447 section 7 1 2 公钥加密标
Spring Security 身份验证在我的配置中未按预期工作

我已经配置了 spring 身份验证如下所示但它没有按预期工作
fiddler可以抓取什么样的流量？

当我打开fiddler时可以捕获来自浏览器的http流量我用 net HttpWebRequest写了一个程序也可以捕获流量同样使用 python urllib2 fiddler 捕获 http 流量当我打开 fiddler 而不
即使我有错误，GCC 也不会在我的内联 asm 函数调用周围推送寄存器

我有一个修改 ecx 或任何其他寄存器的函数 C int proc int n int ret asm volatile movl 1 ecx n t mov n to ecx addl 10 ecx n t add 10 to ecx
Android：当应用程序关闭 30 秒时，通知会延迟显示并停止更新（在 OnePlus 8T 上）

谷歌有自己的时钟应用程序其中包括秒表我目前正在尝试在我的应用程序中创建一个计数计时器或者您可以将其称为秒表它将能够在后台运行当它在后台运行时我希望它也显示通知显示计时时间和停止按钮所有这些都发生在谷歌时钟应用程序中
实体框架代码优先：Configuration.cs 种子或自定义初始值设定项

我第一次使用实体框架的 Code First 风格我想设置一些默认数据我遇到的第一个方法是创建一个自定义初始化程序 https stackoverflow com questions 5655841 entity framework c
jQuery 附加功能在 Internet Explorer 8 中不起作用

这是我的代码 body append div ul li a href Add a li li a href Edit a li ul div
致命错误：未捕获异常“RedisException”，消息为“Redis 服务器消失”

我的一个应用程序突然开始出现错误 Fatal error Uncaught exception RedisException with message Redis server went away in var www slim core
ReactJS无法读取未定义的属性“绑定”[重复]

这个问题在这里已经有答案了我正在尝试通过制作一些简单的应用程序来学习reactjs 我以为我已经弄清楚了基础知识直到我偶然发现了我使用 bind 的情况我正在尝试制作一个小列表单击该列表时将删除单击的列表项它背后的逻辑尚未实现
PHP CLI：如何从 TTY 读取输入的单个字符（无需等待回车键）？

我想从 PHP 的命令行一次读取一个字符但是似乎有某种输入缓冲从某个地方阻止了这一点考虑这段代码 usr bin php 输入 foo 作为输入并按 Enter 键我得到的输出是 input foo Read from STDIN
当第一个观察值是 na 时，使用 na.locf 向前移动最后一个值，忽略第一行

我想利用na locf https www rdocumentation org packages zoo versions 1 8 0 topics na locf结转数据帧的非缺失值其中第一次观察可能为零 Problem dta lt
如何更改 WPF 进度条上的颜色

我有一个 WPF vista 风格的进度条我想在其上更改画笔我已将前景画笔设置为另一种颜色但有一种嗖嗖的动画效果其颜色仍然是默认的绿色我怎样才能改变这个为此您需要编辑项目中进度栏控件的 ControlTemplate 样式
Spring：日志记录不适用于 log4j 或 logback

我正在开发一个 Spring MVC 应用程序我正在尝试让日志记录再次工作不幸的是有时它停止工作我不知道是什么原因造成的我尝试了网上的一些建议但没有什么用处有什么建议么 Pom xml
在 Windows 窗体应用程序中创建新对象时，如何防止先前绘制的对象消失？

我的问题是在我的 Windows 窗体应用程序中每次在特定图片框中单击鼠标时我想绘制一个椭圆并且我希望之前绘制的椭圆保留在图片框中在当前状态下一旦单击鼠标先前绘制的椭圆将被在光标新位置绘制的新椭圆替换 Ball Paint 绘
MKMapView 路线/方向

我发现 Google Maps API 通过以下方式支持路线 var map var directionsPanel var directions function initialize map new GMap2 document get
S3 静态主机重定向和剥离 slug

是否可以创建一个静态网站重定向规则来重定向www domain1 com gt www domain2 com不维护子弹如果允许通配符类似这样
如何在Python中打印raw_input的行？

通常当raw input要求您输入内容并按回车键反馈将打印在新行上如何打印提示行 CR 在这种情况下可以发挥作用吗 Demo prompt Question answer raw input prompt print answer p
为什么转发的请求会再次通过过滤器链？

我为 Grails 应用程序实现了不常见的架构因为我制作了仅进一步转发请求的前端控制器基于某些标准我还将语言环境解析器实现为 http servlet 请求过滤器事实证明转发的请求再次通过过滤器链所以流程看起来像这样请求到达
通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值

我正在尝试提高返回输入 2D NumPy 数组的局部最小值和最大值的函数的性能该函数按预期工作但对于我的用例来说太慢了我想知道是否可以创建此函数的矢量化版本以提高其性能 Here is the formal definition fo

通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值

通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值 的相关文章

随机推荐

热门标签

通过严格比较对函数进行向量化，以在 2D 数组中查找局部最小值和最大值的相关文章