加速Python中的位串/位运算？

2023-12-11

我使用编写了一个素数生成器埃拉托斯特尼筛法和Python 3.1。代码在 0.32 秒内正确且优雅地运行ideone.com生成最多 1,000,000 个质数。

# from bitstring import BitString

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...    
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5) 
    flags = [False, False] + [True] * (limit - 2)   
#     flags = BitString(limit)
    # Step through all the odd numbers
    for i in range(3, limit, 2):       
        if flags[i] is False:
#        if flags[i] is True:
            continue
        yield i
        # Exclude further multiples of the current prime number
        if i <= sub_limit:
            for j in range(i*3, limit, i<<1):
                flags[j] = False
#                flags[j] = True

问题是，当我尝试生成最多 1,000,000,000 的数字时，我的内存不足。

    flags = [False, False] + [True] * (limit - 2)   
MemoryError

正如你可以想象的那样，分配 10 亿个布尔值（~~1 byte~~ 4 或 8 个字节（见注释）在 Python 中每个）确实不可行，所以我研究了位串。我想，每个标志使用 1 位会更加节省内存。然而，该程序的性能急剧下降 - 对于 1,000,000 以内的质数，运行时间为 24 秒。这可能是由于位串的内部实现造成的。

您可以注释/取消注释这三行，以查看我更改为使用 BitString 的内容，如上面的代码片段。

我的问题是，有没有办法加速我的程序，无论有没有位串？

编辑：请在发布之前自行测试代码。当然，我不能接受比我现有代码运行速度慢的答案。

再次编辑：

我已经在我的机器上编制了一份基准测试列表。

您的版本有一些小的优化。通过颠倒真与假的角色，你可以改变“if flags[i] is False:" to "if flags[i]:“。以及第二个的起始值range声明可以是i*i代替i*3。你的原始版本在我的系统上需要 0.166 秒。经过这些更改，下面的版本在我的系统上花费了 0.156 秒。

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5)
    flags = [True, True] + [False] * (limit - 2)
    # Step through all the odd numbers
    for i in range(3, limit, 2):
        if flags[i]:
            continue
        yield i
        # Exclude further multiples of the current prime number
        if i <= sub_limit:
            for j in range(i*i, limit, i<<1):
                flags[j] = True

但这对你的记忆问题没有帮助。

进入 C 扩展的世界，我使用了开发版本gmpy。（免责声明：我是维护者之一。）开发版本称为 gmpy2，支持称为 xmpz 的可变整数。使用 gmpy2 和以下代码，我的运行时间为 0.140 秒。 1,000,000,000 的限制运行时间为 158 秒。

import gmpy2

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5)
    # Actual number is 2*bit_position + 1.
    oddnums = gmpy2.xmpz(1)
    current = 0
    while True:
        current += 1
        current = oddnums.bit_scan0(current)
        prime = 2 * current + 1
        if prime > limit:
            break
        yield prime
        # Exclude further multiples of the current prime number
        if prime <= sub_limit:
            for j in range(2*current*(current+1), limit>>1, prime):
                oddnums.bit_set(j)

为了推动优化并牺牲清晰度，我使用以下代码获得了 0.107 秒和 123 秒的运行时间：

import gmpy2

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5)
    # Actual number is 2*bit_position + 1.
    oddnums = gmpy2.xmpz(1)
    f_set = oddnums.bit_set
    f_scan0 = oddnums.bit_scan0
    current = 0
    while True:
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1
        if prime > limit:
            break
        yield prime
        # Exclude further multiples of the current prime number
        if prime <= sub_limit:
            list(map(f_set,range(2*current*(current+1), limit>>1, prime)))

编辑：基于这个练习，我修改了 gmpy2 以接受xmpz.bit_set(iterator)。使用以下代码，对于小于 1,000,000,000 的所有素数，Python 2.7 的运行时间为 56 秒，Python 3.2 的运行时间为 74 秒。（正如评论中指出的，xrange比range.)

import gmpy2

try:
    range = xrange
except NameError:
    pass

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5)
    oddnums = gmpy2.xmpz(1)
    f_scan0 = oddnums.bit_scan0
    current = 0
    while True:
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1
        if prime > limit:
            break
        yield prime
        if prime <= sub_limit:
            oddnums.bit_set(iter(range(2*current*(current+1), limit>>1, prime)))

编辑＃2：再试一次！我修改了 gmpy2 以接受xmpz.bit_set(slice)。使用以下代码，对于 Python 2.7 和 Python 3.2，所有小于 1,000,000,000 的素数的运行时间约为 40 秒。

from __future__ import print_function
import time
import gmpy2

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    yield 2
    sub_limit = int(limit**0.5)
    flags = gmpy2.xmpz(1)
    # pre-allocate the total length
    flags.bit_set((limit>>1)+1)
    f_scan0 = flags.bit_scan0
    current = 0
    while True:
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1
        if prime > limit:
            break
        yield prime
        if prime <= sub_limit:
            flags.bit_set(slice(2*current*(current+1), limit>>1, prime))

start = time.time()
result = list(prime_numbers(1000000000))
print(time.time() - start)

编辑 #3：我已经更新了 gmpy2 以正确支持 xmpz 位级别的切片。性能没有变化，但 API 非常好。我做了一些调整，把时间缩短到了 37 秒左右。（请参阅编辑 #4 以更改 gmpy2 2.0.0b1。）

from __future__ import print_function
import time
import gmpy2

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    sub_limit = int(limit**0.5)
    flags = gmpy2.xmpz(1)
    flags[(limit>>1)+1] = True
    f_scan0 = flags.bit_scan0
    current = 0
    prime = 2
    while prime <= sub_limit:
        yield prime
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1
        flags[2*current*(current+1):limit>>1:prime] = True
    while prime <= limit:
        yield prime
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1

start = time.time()
result = list(prime_numbers(1000000000))
print(time.time() - start)

编辑#4：我在 gmpy2 2.0.0b1 中做了一些更改，打破了前面的示例。 gmpy2 不再将 True 视为提供无限 1 位源的特殊值。应使用 -1 代替。

from __future__ import print_function
import time
import gmpy2

def prime_numbers(limit=1000000):
    '''Prime number generator. Yields the series
    2, 3, 5, 7, 11, 13, 17, 19, 23, 29 ...
    using Sieve of Eratosthenes.
    '''
    sub_limit = int(limit**0.5)
    flags = gmpy2.xmpz(1)
    flags[(limit>>1)+1] = 1
    f_scan0 = flags.bit_scan0
    current = 0
    prime = 2
    while prime <= sub_limit:
        yield prime
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1
        flags[2*current*(current+1):limit>>1:prime] = -1
    while prime <= limit:
        yield prime
        current += 1
        current = f_scan0(current)
        prime = 2 * current + 1

start = time.time()
result = list(prime_numbers(1000000000))
print(time.time() - start)

编辑#5：我对 gmpy2 2.0.0b2 进行了一些增强。您现在可以迭代所有已设置或清除的位。运行时间提高了约 30%。

from __future__ import print_function
import time
import gmpy2

def sieve(limit=1000000):
    '''Returns a generator that yields the prime numbers up to limit.'''

    # Increment by 1 to account for the fact that slices do not include
    # the last index value but we do want to include the last value for
    # calculating a list of primes.
    sieve_limit = gmpy2.isqrt(limit) + 1
    limit += 1

    # Mark bit positions 0 and 1 as not prime.
    bitmap = gmpy2.xmpz(3)

    # Process 2 separately. This allows us to use p+p for the step size
    # when sieving the remaining primes.
    bitmap[4 : limit : 2] = -1

    # Sieve the remaining primes.
    for p in bitmap.iter_clear(3, sieve_limit):
        bitmap[p*p : limit : p+p] = -1

    return bitmap.iter_clear(2, limit)

if __name__ == "__main__":
    start = time.time()
    result = list(sieve(1000000000))
    print(time.time() - start)
    print(len(result))

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

加速Python中的位串/位运算？的相关文章

如何向 scikit-learn KD 树添加/删除数据点？

我想知道是否可以在创建 scikit learn KDTree 实例后添加或删除数据点例如 from sklearn neighbors import KDTree import numpy as np X np array 1 1 2
ipython/ pylab/ matplotlib安装和初始化错误

我在 OS X El Captain 上安装了 matplotlib anaconda ipython 然而即使在尝试以所有可能的方式设置环境变量之后我仍无法启动 ipython shell pylab 版本这是错误 ImportEr
requestAnimationFrame 垃圾回收

我正在使用 Chrome 开发工具 v27 中的时间轴分析以下代码的内存使用情况
While 在范围内循环用户输入

我有一些代码我想要求用户输入 1 100 之间的数字如果他们在这些数字之间输入一个数字它将打印 Size input 并打破循环但是如果他们在外部输入一个数字1 100 它将打印大小输入并继续向他们重新询问一个数字但我遇到
删除aws beanstalk上的uuid python包

这是针对所提出问题的后续帖子问题here https stackoverflow com questions 44421761 flask beanstalk deployment errors 以防万一对其他人有用自从第一篇文章以来
Flask-migrate：更改模型属性并重命名相应的数据库列

我对 Flask 有一些经验但对数据库 Flask migrate alembic SqlAlchemy 不太了解我正在跟进this https blog miguelgrinberg com post the flask mega t
python 排列有问题

我在排列方面遇到一些问题当谈到Python时我真的是一个大菜鸟所以任何帮助将不胜感激假设我在文本文件中有一个范围为 1 6 的列表例如它看起来像 1 2 3 4 5 6 我想打开所述 txt 文件并计算这 6 个数字中 N 的所
Java 中查看 ArrayList 是否包含对象的最有效方法

我有一个 Java 对象的 ArrayList 这些对象有四个字段我用其中两个字段来将对象视为与另一个对象相等我正在寻找最有效的方法给定这两个字段以查看数组是否包含该对象问题在于这些类是基于 XSD 对象生成的因此我无法修改类本
如何使用Python3、Selenium Chrome WebDriver在第一次请求之前预加载cookie？

是否可以使用添加cookieadd cookie 对于一个域比如说stackoverflow com在使用 Selenium Chrome WebDriver 进行实际请求之前get 到域上的页面stackoverflow com 尝试时
导入错误：无法导入名称“FFProbe”

我无法获取ffprobe包 https github com simonh10 ffprobe在 Python 3 6 中工作我使用 pip 安装它但是当我输入import ffprobe it says Traceback most
检查 IP 地址是否在给定范围内

我想检查一下是否有IP180 179 77 11位于特定范围之间例如180 179 0 0 180 179 255 255 我编写了一个函数它将每个 IP 八位字节与其他八位字节进行比较 def match mask IP min ip
检查php中位字段是否打开的正确方法是什么

检查位字段是否打开的正确方法是什么在 php 中我想检查来自 db mysql 的位字段是否打开这是正确的方法吗 if bit 1 还有其他方法吗我看到有人使用代码ord http jameslow com 2008 08 12 m
如何同时使用不和谐机器人命令和事件？

我需要制作一个机器人来监听服务器中写入的消息同时接受命令 Create the Discord client client discord Client client commands Bot command prefix client
Python 3 中 int() 和 Floor() 有什么区别？

在Python 2中 floor 返回一个浮点值虽然对我来说并不明显但我发现了一些解释来澄清为什么它可能有用floor 返回浮点数对于类似的情况float inf and float nan 然而在Python 3中 floor 返
错误优化器参数在 Keras 函数中不合法

我使用以下代码来计算数据生成质量指标的拟合优度研究的概率标签 from sklearn model selection import StratifiedKFold from sklearn model selection import K
无限实时连续传输音频信号，Python

我有一个简单的问题在 Python 中从音频插孔流式传输音频信号时使用 pyaudio 库如何继续流式传输音频信号直到我选择停止程序示例我们的方式捕捉我们的网络摄像头 https docs opencv org 3 0 bet
如何传递架构以从现有数据帧创建新数据帧？

要将 schema 传递到 json 文件我们这样做 from pyspark sql types import StructField StringType StructType IntegerType data schema Stru
pytest找不到模块[重复]

这个问题在这里已经有答案了我正在关注pytest 良好实践 https docs pytest org en latest explanation goodpractices html test discovery或者至少我认为我是但是
在Python中从日期时间中减去秒

我有一个 int 变量它实际上是秒让我们调用这个秒数X 我需要得到当前日期和时间以日期时间格式减去的结果X秒 Example If X是 65 当前日期是2014 06 03 15 45 00 那么我需要得到结果2014 06 03
使用属性和性能

我正在优化我的代码我注意到使用属性甚至自动属性对执行时间有深远的影响请参阅下面的示例 Test public void GetterVsField PropertyTest propertyTest new PropertyTest

随机推荐

实体框架向导在 MySQL 上崩溃

我的问题类似于this one但崩溃发生在稍后我must在 SQL Server 和 MySQL 之间互操作 EF 数据库优先模型更具体地说我的应用程序根据连接字符串和配置设置向 SQL Server 或 MySQL 发起普通 SQL
在模板化控件中实现级联 DropDownList 绑定

我有2个DropDownList我的表单上的控件其中第二个使用SelectedValue第一个作为其绑定参数之一 Both DropDownList控件位于FormView InsertItemTemplate with Selected
如何配置战争以便依赖战争中导出的库而不是服务器中的库？

我想在 JBoss 4 2 中部署战争但我无法控制它的目录所以我无法替换jsf impl jar and jsf api jar 我的问题是如何配置战争才能取决于jsf impl jar and jsf api jar在战争中导出而不
保存 woocommerce 产品后，什么是 wordpress/woocommerce 钩子？

我想在用户创建并保存产品数据后通过肥皂客户端将 woocommerce 产品数据发送到邮政公司的网络服务所以我需要一个在用户创建产品后触发的钩子我搜索了很多并找到了一些 wordpress 和 woocommerce 钩子但它们都没有
使用 javascript 和 django 将文本文件上传到服务器中的某个文件夹

我需要上传一个包含一些数据的文本文件来自我的local filesystem to say MEDIA URL uploadfolder 该文件将由我的函数之一处理django view 我创建了一个
使用 EC2 作为代理连接到 Aurora Serverless

不是重复的AWS Aurora MySQL 无服务器如何从 MySQL Workbench 进行连接 Aurora Serverless 尚不支持公共连接我使用 Cloud9 在与数据库相同的 VPN 上创建 EC2 实例然后我从 C
scanf 在 C 循环后跳过[重复]

这个问题在这里已经有答案了 define len 100 char sourceString len char command int main void while 1 printf nEnter source fgets sourceS
C# 中 Queue 的排队事件

我是活动和代表的新手如何为类型的对象实现排队事件Queue
Android IP地址与java

我正在编写一款支持多人游戏的 Android 视频游戏有一个正在运行的专用服务器当通过打开套接字单击多人游戏按钮时机器人会连接到该服务器这工作正常服务器基本上只是充当匹配系统当客户端托管游戏时服务器会将该客户端添加到主机列表中
CSS 形状创建曲线波

这就是我到目前为止所得到的查看教程后 I want know how curved effect is generated on divs the only question that i found near to what i was
TwiML 应用程序意外结束调用：找不到元素“响应”的声明

我创建TwiML应用程序并设置语音请求 URL Web 部署 ASP NET MVC 应用程序操作方法 http voiceapp 001 site1 myasp net voice 当有人访问上面发布的 URL 时将调用此操作方法 pu
桌面 VSCode 与 Anaconda VSCode？

我的电脑上已经安装了 vscode 不久前我注意到它现在包含在 anaconda 发行版中与我已经安装的 vscode 相比使用 anaconda 附带的 vscode 有什么好处吗仅供参考我是一名 Python 开发人员如果这有
MAUI AAD 身份验证 ClaimsPrincipal 且 IsAuthenticated 始终为 false

我有一个在本地主机上运行的 MAUI 应用程序当您启动该应用程序时它会触发 microsoft 登录在用户成功登录后我发现 ClaimsPrincipal 和 IsAuthenticated 始终为 false 根据我的 Main
python 2.6-有效地删除和计算字典列表中的重复项

我正在尝试有效地改变 text hallo world num 1 text hallo world num 2 text hallo world num 1 text haltlo world num 1 text hallo world
静态参数初始化顺序

几周前我从 Java 转向 C 今天我有一个奇怪的行为我尝试在这个简单的示例中重现它我正在使用 net FW 4 我有三门课首先抽象的 namespace ReadonlyStaticOrder using System usi
如何在 Mac OS X 中的多个命令行工具安装之间切换（无需安装 XCode）

是否可以在 Mac OS X 中安装和切换多个版本的命令行工具我想知道是否可以使用xcode select但命令行工具安装程序没有选择安装目标的选项我尝试通过手动重命名来创建多个安装Library Developer CommandLi
Linux x86-64 上的物理内存中的用户空间和内核之间是否存在明确的划分？

也就是说给定一个物理地址我可以判断这个地址是否来自用户空间吗据我所知在虚拟地址空间中内核将使用上半部分和用户空间将使用下半部分但是关于在物理地址空间让问题变得复杂的是我想检查KVM中的客户物理地址这意味着我无法调用客户操
如何在 keycloak 中为 Node.js 应用程序启用策略执行？

我必须将node js应用程序与keycloak集成该应用程序位于express中但策略不强制执行它授予所有用户访问所有api的权限对于 test api 只有具有首席角色的用户才有访问权限我已在 keycloak 管理控制台
如何将对象类型与布尔值进行比较

import java util HashMap public class file public static void main String args Object a a true if a true System out prin
加速Python中的位串/位运算？

我使用编写了一个素数生成器埃拉托斯特尼筛法和Python 3 1 代码在 0 32 秒内正确且优雅地运行ideone com生成最多 1 000 000 个质数 from bitstring import BitString def pri

加速Python中的位串/位运算？

加速Python中的位串/位运算？ 的相关文章

随机推荐

热门标签

加速Python中的位串/位运算？的相关文章