将 tf.distribute 策略与 tf.keras 模型子类化结合使用

2024-01-29

我目前有一个 tf.keras 模型子类，但无法使用 GPU 分发策略，尽管 Tensorflow 网站上声明我可能收到一个错误，告诉我相反的情况。

我发现的一种解决方案是将模型包装在 tf.keras.models.Model 中，但这导致ValueError: We currently do not support distribution strategy with a `Sequential` model that is created without `input_shape`/`input_dim` set in its first layer or a subclassed model.这对我来说是无法解决的，因为我的输入形状是（无，无），因为输入是一组形状不同的序列，并且我没有将它们定义为相同的形状。

有没有办法解决这个问题或将 tf.distribute 与模型子类一起使用？

nlupy_1     |   File "/app/src/main/python/mosaix/serve/api/__init__.py", line 201, in main
nlupy_1     |     init_state()
nlupy_1     |   File "/app/src/main/python/mosaix/serve/api/__init__.py", line 135, in init_state
nlupy_1     |     network_dict[key] = ParsingPipeline(pipeline_params, 'predict', path)
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/pipelines.py", line 130, in __init__
nlupy_1     |     self.build()
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/pipelines.py", line 173, in build
nlupy_1     |     self._load_model()
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/base_pipeline.py", line 38, in wrapped
nlupy_1     |     return func(*args, **kwargs)
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/base_pipeline.py", line 169, in _load_model
nlupy_1     |     self.model_inference(warm_up_query, warm_up_annotated)
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/pipelines.py", line 186, in model_inference
nlupy_1     |     return self._model_inference((raw, annotated))
nlupy_1     |   File "/app/src/main/python/mosaix/learn/pipelines/pipelines.py", line 198, in _model_inference
nlupy_1     |     bio_logits, intent_logits = self.model.predict(inputs)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/engine/training.py", line 909, in predict
nlupy_1     |     use_multiprocessing=use_multiprocessing)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/engine/training_distributed.py", line 760, in predict
nlupy_1     |     callbacks=callbacks)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/engine/training_arrays.py", line 189, in model_iteration
nlupy_1     |     f = _make_execution_function(model, mode)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/engine/training_arrays.py", line 564, in _make_execution_function
nlupy_1     |     return distributed_training_utils._make_execution_function(model, mode)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/distribute/distributed_training_utils.py", line 842, in _make_execution_function
nlupy_1     |     return _make_execution_function_with_cloning(model, mode)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/distribute/distributed_training_utils.py", line 935, in _make_execution_function_with_cloning
nlupy_1     |     _make_replicated_models_with_cloning(model, mode)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/distribute/distributed_training_utils.py", line 915, in _make_replicated_models_with_cloning
nlupy_1     |     _build_distributed_network(model, strategy, mode)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/distribute/distributed_training_utils.py", line 783, in _build_distributed_network
nlupy_1     |     args=(model, mode, inputs, targets))
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/distribute/distribute_lib.py", line 1787, in call_for_each_replica
nlupy_1     |     return self._call_for_each_replica(fn, args, kwargs)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/distribute/parameter_server_strategy.py", line 442, in _call_for_each_replica
nlupy_1     |     self._container_strategy(), self._device_map, fn, args, kwargs)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/distribute/mirrored_strategy.py", line 196, in _call_for_each_replica
nlupy_1     |     coord.join(threads)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/training/coordinator.py", line 389, in join
nlupy_1     |     six.reraise(*self._exc_info_to_raise)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/six.py", line 693, in reraise
nlupy_1     |     raise value
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/training/coordinator.py", line 297, in stop_on_exception
nlupy_1     |     yield
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/distribute/mirrored_strategy.py", line 879, in run
nlupy_1     |     self.main_result = self.main_fn(*self.main_args, **self.main_kwargs)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/distribute/distributed_training_utils.py", line 743, in _build_network_on_replica
nlupy_1     |     model, input_tensors=inputs, layer_fn=models.share_weights)
nlupy_1     |   File "/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/keras/models.py", line 165, in _clone_functional_model
nlupy_1     |     raise ValueError('Expected `model` argument '
nlupy_1     | ValueError: Expected `model` argument to be a functional `Model` instance, but got a subclass model instead.
nlupy_1     |

None

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

python3x

tensorflow

Keras

将 tf.distribute 策略与 tf.keras 模型子类化结合使用的相关文章

为什么我不能使用“exclude”从 python 轮子中排除“tests”目录？

考虑以下包结构与以下setup py内容 from setuptools import setup find packages setup name dfl client packages find packages exclude te
DataFrame 在函数内部修改

我面临一个我以前从未观察到的函数内数据帧修改的问题有没有一种方法可以处理这个问题以便初始数据帧不被修改 def test df df tt np nan return df dff pd DataFrame data 现在当我打印时d
Python的reduce()短路了吗？

If I do result reduce operator and False 1000 得到第一个结果后它会停止吗自从False anything False 相似地 result reduce operator or True 10
为什么在 Windows 中使用 GetConsoleScreenBufferInfoEx 时控制台窗口会缩小？

我正在尝试使用 GetConsoleScreenBufferInfoEx 和 SetConsoleScreenBufferInfoEx 设置 Windows 命令行控制台的背景和前景色我正在 Python 中使用 wintypes 进行此
Python sqlite3游标没有属性commit

当我运行这段代码时 path Scripts wallpapers single png conn sqlite3 connect Users Heaven Library Application Support Dock desktopp
Scikit-learn 的内核 PCA：如何在 KPCA 中实现各向异性高斯内核或任何其他自定义内核？

我目前正在使用Scikit learn 的 KPCA https scikit learn org stable modules generated sklearn decomposition KernelPCA html对我的数据集执行降
字典中的列表，Python 中的循环

我有以下代码 TYPES hotmail type hotmail lookup mixed dkim no signatures S Return Path email protected cdn cgi l email protecti
错误：无法访问文件“$libdir/plpython2”：没有这样的文件或目录

我正在运行 postgresql 9 4 PostgreSQL 9 4 4 on x86 64 unknown linux gnu compiled by gcc GCC 4 1 2 20070626 Red Hat 4 1 2 14 64
如何使用 Python 多处理避免在分叉进程中加载父模块

当您创建一个Pool使用Python的进程multiprocessing 这些进程将分叉父进程中的全局变量将显示在子进程中如下面的问题所述如何限制多处理进程的范围 https stackoverflow com questions 2
一行Python和SQLite代码，为什么需要加“，”？ [复制]

这个问题在这里已经有答案了 c execute INSERT INTO numbers VALUES random randint 0 100 如果我将上面的代码更改为 c execute INSERT INTO numbers VALUE
杂乱的扭曲连接在不干净的时尚中消失了。没有代理。已经尝试过标题

我正在尝试抓取这个网站 https www5 apply2jobs com jupitermed ProfExt index cfm fuseaction mExternal searchJobs https www5 apply2jobs
如何使用 PySpark 有效地将这么多 csv 文件（大约 130,000 个）合并到一个大型数据集中？

我之前发布了这个问题并得到了一些使用 PySpark 的建议如何有效地将这一大数据集合并到一个大数据框中 https stackoverflow com questions 60259271 how can i merge this la
Matplotlib 图例不工作

自从升级 matplotlib 以来每当尝试创建图例时我都会收到以下错误 usr lib pymodules python2 7 matplotlib legend py 610 UserWarning Legend does not
pip 安装软件包两次

不幸的是我无法重现它但我们已经见过几次了 pip 将一个软件包安装两次如果卸载第一个第二个就会可见并且也可以被卸载我的问题如果一个包安装了两次如何用 python 检查背景我想编写一个测试来检查这一点 devOp Updat
求解不等式系统时“多项式错误：仅允许使用单变量多项式”

我想找到以下两个常数的区间cons1 and cons2我写了下面的代码 from sympy import Poly from sympy import Abs from sympy solvers inequalities import
smooth_idf 是多余的吗？

The scikit learn 文档 http scikit learn org stable modules generated sklearn feature extraction text TfidfTransformer html
Django 接受 AM/PM 作为表单输入

我试图弄清楚如何使用 DateTime 字段在 Django 中接受 am pm 作为时间格式但我遇到了一些麻烦我尝试在 forms py 文件中这样设置 pickup date time from DateTimeField inpu
如何向 SCons 构建添加预处理和后处理操作？

我正在尝试在使用 SCons 构建项目时添加预处理和后处理操作 SConstruct 和 SConscript 文件位于项目的顶部预处理动作生成代码通过调用不同的工具 gt 不知道在此预处理之后将生成的确切文件可以创建用于决定生成哪
Jupyter Notebook：没有名为 pandas 的模块

我搜索了其他问题但没有找到任何有帮助的内容大多数只是建议您使用 conda 或 pip 安装 pandas 在我的 jupyter 笔记本中我试图导入 pandas import pandas as pd 但我收到以下错误 Modul
为什么 bot.get_channel() 会产生 NoneType？

我正在制作一个 Discord 机器人来处理公告命令当使用该命令时我希望机器人在特定通道中发送一条消息并向用户发送一条消息以表明该命令已发送但是我无法将消息发送到频道我尝试了这段代码 import discord import

随机推荐

Java win32 库/api

是否有合适的 Java win32 库例如显示当前进程查找进程占用的端口号等或者像 WMI 库之类的东西看一下JNA https github com twall jna 这是与本机代码通信的 100 纯 java 方式他们有一
如何抑制 Delphi DataSnap 错误消息对话框？

我们在 Windows 2003 Server 上运行 DataSnap Delphi 2009 应用程序 DataSnap 客户端和服务器位于同一台计算机上通过 Borland Socketserver 使用 DCOM 客户端运行后台批
为什么将 Avro 与 Kafka 结合使用 - 如何处理 POJO

我有一个 Spring 应用程序它是我的 kafka 生产者我想知道为什么 avro 是最好的选择我读到了它以及它提供的所有内容但为什么我不能序列化我用 jackson 自己创建的 POJO 并将其发送到 kafka 我这样说是因为
Amazon MWS Feed API 更新订单状态时出现问题

我正在使用 amazon mws feed api 来更新我网站上的订单状态当我打电话给提交供稿api 提交成功但是当我打电话给获取Feed提交结果它返回给我一个错误指出
使用 UITextView 和 NSMutableAttributedString 对齐文本

我正在尝试为一个合理的文本UITextView with NSMutableAttributedString the NSMutableAttributedString是由不同的NSAttributedString因为我需要粗体和常规字体
什么是 Chocolatey“安装”包？

关于审查巧克力包装 https chocolatey org packages可用时我遇到了一些有两个或有时更多显然用于同一产品的软件包乍一看无法区分例如有自动热键包然后还有一个自动热键 install 在这里查看 Choco
为什么 JavaScript 中 (([]===[])+/-/)[1] = 'a' 和 (1+{})[(1<<1)+1] = 'b' ？

最近我发现了一个有趣的网站其中展示了 Javascript 混淆器 http bl ocks org jasonsperske 5400283 http bl ocks org jasonsperske 5400283 例如 1 give
为什么从文件中读取 1 个字节比读取 2、3、4、... 字节慢 20 倍？

我一直试图理解之间的权衡read and seek 对于小的跳跃读取不需要的数据比使用跳过它更快seek 在计时不同的读取查找块大小以找到临界点时我遇到了一个奇怪的现象 read 1 大约慢20倍read 2 read 3 等对于
更改表设置标识列

我有一个 sybase 表测试其中包含以下列 MyIdentity numberic 9 0 Name User 桌子上写满了很多记录我想将身份列更改为身份当前表中的 MyIdentity 没有重复值我如何更改表并将 MyIden
TensorFlow：训练for循环中的每次迭代速度较慢[重复]

这个问题在这里已经有答案了我正在 TensorFlow 中训练一个标准的简单的多层感知器 ANN 它具有三个隐藏层我添加了一个文本进度条这样我就可以观察迭代各个纪元的进度我发现每次迭代的处理时间在前几个时期之后会增加这是一个示例
在过滤器 SQLAlchemy 中进行日期时间比较

我对 SQLAlchemy 中的过滤有点困惑我目前正在尝试过滤掉超过 10 周的条目所以我有 current time datetime datetime utcnow potential session query Subject f
SAS HashTable 中由 hashexp 指定的表大小到底是多少？

我想对 SAS 哈希表中存储桶的定义进行一些澄清问题正是关于hashexp范围根据 SAS DOC 的说法 hashexp is 哈希对象的内表大小其中哈希表的大小为2n HASHEXP 的值用作 2 的指数来创建哈希表大小例如 H
Xamarin Studio 5.9.7（内部版本 9）和 Xamarin.iOS 9.0.1.18 中的代码设计错误

我已将 Xamarin Studio 升级到 beta 通道中的最新版本当我尝试运行该应用程序时出现以下错误 Signing application codesign v force sign hex entitlements User
IE 9 上等待异步脚本超时

我正在尝试在 Internet Explorer 9 上运行一些测试使用 Protractor 每个包含 driver executeScript 的测试都会给出错误等待异步脚本超时警告服务器未提供任何堆栈跟踪信息其他测试效果很好
为什么 TypeError 上的 JSON.stringify 返回一个空对象[重复]

这个问题在这里已经有答案了我正在使用节点 4 2 并且捕获错误并在其上使用 JSON stringify 对于大多数对象来说这都可以正常工作但是当抛出 TypeError callback is not a function 时它会
QT：隐藏 QML 调试警告

QML 调试已启用仅在安全环境中使用此功能我实际上正在为 QtCreator 项目开发 Python 测试软件该项目需要使用 QML 调试 python 软件正在运行构建的项目并测试其功能我想实际上隐藏该消息而不禁用 QML 调试
蓝牙设备发现错误

我尝试使用以下代码来发现蓝牙设备 import java io BufferedReader import java io IOException import java io InputStreamReader import java u
ElasticSearch 和 Nest 过滤不起作用

我运行一个查询返回 10 个结果我的文档中有一个名为 Type 的属性对于某些记录此属性的值为空字符串对于其他一些记录该属性的值为 AudioAlbum 或 AudioRington 我想做两件事 1 从搜索结果中排除其 Typ
SwiftUI 中的 Alert + ProgressView（活动指示器）

Is there any way to add Activity View Indicator into SwiftUI Alert somewhere I m just curious because I haven t found an
将 tf.distribute 策略与 tf.keras 模型子类化结合使用

我目前有一个 tf keras 模型子类但无法使用 GPU 分发策略尽管 Tensorflow 网站上声明我可能收到一个错误告诉我相反的情况我发现的一种解决方案是将模型包装在 tf keras models Model 中但这导致

将 tf.distribute 策略与 tf.keras 模型子类化结合使用

将 tf.distribute 策略与 tf.keras 模型子类化结合使用 的相关文章

随机推荐

热门标签

将 tf.distribute 策略与 tf.keras 模型子类化结合使用的相关文章