在 while_loop 的上下文中使用 TensorArray 来累加值

2024-01-02

下面是 Tensorflow RNN Cell 的实现，旨在模拟本文中 Alex Graves 的算法 ACT：http://arxiv.org/abs/1603.08983 http://arxiv.org/abs/1603.08983.

在通过 rnn.rnn 调用的序列中的单个时间步（使用静态 sequence_length 参数，因此 rnn 是动态展开的 - 我使用固定批量大小 20），我们递归调用 ACTStep，生成 size(1,200) 的输出，其中RNN 单元的隐藏维度为 200，批量大小为 1。

使用 Tensorflow 中的 while 循环，我们进行迭代，直到累积的停止概率足够高。所有这些工作都相当顺利，但我在 while 循环内累积状态、概率和输出时遇到问题，我们需要这样做才能创建这些的加权组合作为最终的单元输出/状态。

我尝试使用一个简单的列表，如下所示，但是当编译图时，由于输出不在同一帧中，因此会失败（是否可以使用 control_flow_ops 中的“switch”函数将张量转发到它们是必需的，即在我们返回值之前的 add_n 函数？）。我也尝试过使用 TensorArray 结构，但我发现这很难使用，因为它似乎破坏了形状信息，并且手动替换它不起作用。我还没有找到太多关于 TensorArrays 的文档，我想它们可能主要供内部 TF 使用。

任何有关如何正确累积 ACTStep 生成的变量的建议将不胜感激。

class ACTCell(RNNCell):
"""An RNN cell implementing Graves' Adaptive Computation time algorithm"""
def __init__(self, num_units, cell, epsilon, max_computation):

    self.one_minus_eps = tf.constant(1.0 - epsilon)
    self._num_units = num_units
    self.cell = cell
    self.N = tf.constant(max_computation)
@property
def input_size(self):
    return self._num_units
@property
def output_size(self):
    return self._num_units
@property
def state_size(self):
    return self._num_units

def __call__(self, inputs, state, scope=None):

    with vs.variable_scope(scope or type(self).__name__):

        # define within cell constants/ counters used to control while loop
        prob = tf.get_variable("prob", [], tf.float32,tf.constant_initializer(0.0))
        counter = tf.get_variable("counter", [],tf.float32,tf.constant_initializer(0.0))
        tf.assign(prob,0.0)
        tf.assign(counter, 0.0)

        # the predicate for stopping the while loop. Tensorflow demands that we have
        # all of the variables used in the while loop in the predicate.
        pred = lambda prob,counter,state,input,\
                      acc_state,acc_output,acc_probs:\
            tf.logical_and(tf.less(prob,self.one_minus_eps), tf.less(counter,self.N))

        acc_probs = []
        acc_outputs = []
        acc_states = []


        _,iterations,_,_,acc_states,acc_output,acc_probs = \
        control_flow_ops.while_loop(pred,
        self.ACTStep,
        [prob,counter,state,input,acc_states,acc_outputs,acc_probs])

    # TODO:fix last part of this, need to use the remainder.
    # TODO: find a way to accumulate the regulariser

    # here we take a weighted combination of the states and outputs 
    # to use as the actual output and state which is passed to the next timestep.

    next_state = tf.add_n([tf.mul(x,y) for x,y in zip(acc_probs,acc_states)])
    output = tf.add_n([tf.mul(x,y) for x,y in zip(acc_probs,acc_outputs)])


    return output, next_state

def ACTStep(self,prob,counter,state,input, acc_states,acc_outputs,acc_probs):

    output, new_state = rnn.rnn(self.cell, [input], state, scope=type(self.cell).__name__)

    prob_w = tf.get_variable("prob_w", [self.cell.input_size,1])
    prob_b = tf.get_variable("prob_b", [1])
    p = tf.nn.sigmoid(tf.matmul(prob_w,new_state) + prob_b)

    acc_states.append(new_state)
    acc_outputs.append(output)
    acc_probs.append(p)

    return [tf.add(prob,p),tf.add(counter,1.0),new_state, input,acc_states,acc_outputs,acc_probs]

我将在这个回复前言，这不是一个完整的解决方案，而是一些关于如何改进你的细胞的评论。

首先，在 ACTStep 函数中，您调用rnn.rnn对于一个时间步（定义为[input]。如果您正在执行单个时间步，那么简单地使用实际的时间步可能会更有效self.cell通话功能。您将看到张量流中使用相同的机制细胞包装器 https://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/ops/rnn_cell.py#L708

你提到你已经尝试过使用TensorArrays。您是否正确打包和解包张量数组？这里有一个repo https://github.com/ofirnachum/sequence_gan/blob/master/model.py你会在哪里找到model.py张量数组已正确打包和解包。

您还问是否有一个功能control_flow_ops这将需要累积所有张量。我想你正在寻找tf.control_dependencies https://www.tensorflow.org/versions/r0.9/api_docs/python/framework.html#control_dependencies

您可以在 control_dependicies 中列出所有输出张量操作，这将需要张量流来计算该点之前的所有张量。

另外，它看起来像你的counter变量是可训练的。您确定要这样吗？如果您将计数器加一，则可能不会产生正确的结果。另一方面，您可以故意使其保持可训练性，以便在思考成本函数的最后对其进行区分。

另外我相信 Remainder 函数应该在您的脚本中：

remainder = 1.0 - tf.add_n(acc_probs[:-1])
#note that there is a -1 in the list as you do not want to grab the last probability

这是我编辑的代码版本：

class ACTCell(RNNCell):
    """An RNN cell implementing Graves' Adaptive Computation time algorithm
    Notes: https://www.evernote.com/shard/s189/sh/fd165646-b630-48b7-844c-86ad2f07fcda/c9ab960af967ef847097f21d94b0bff7

    """
    def __init__(self, num_units, cell, max_computation = 5.0, epsilon = 0.01):

        self.one_minus_eps = tf.constant(1.0 - epsilon) #episolon is 0.01 as found in the paper
        self._num_units = num_units
        self.cell = cell
        self.N = tf.constant(max_computation)

    @property
    def input_size(self):
        return self._num_units
    @property
    def output_size(self):
        return self._num_units
    @property
    def state_size(self):
        return self._num_units

    def __call__(self, inputs, state, scope=None):

        with vs.variable_scope(scope or type(self).__name__):

            # define within cell constants/ counters used to control while loop
            prob = tf.constant(0.0, shape = [batch_size]) 
            counter = tf.constant(0.0, shape = [batch_size])

            # the predicate for stopping the while loop. Tensorflow demands that we have
            # all of the variables used in the while loop in the predicate.
            pred = lambda prob,counter,state,input,acc_states,acc_output,acc_probs:\
                tf.logical_and(tf.less(prob,self.one_minus_eps), tf.less(counter,self.N))

            acc_probs, acc_outputs, acc_states  = [], [], []

            _,iterations,_,_,acc_states,acc_output,acc_probs = \
            control_flow_ops.while_loop(
            pred,
            self.ACTStep, #looks like he purposely makes the while loop here
            [prob, counter, state, input, acc_states, acc_outputs, acc_probs])

        '''mean-field updates for states and outputs'''
        next_state = tf.add_n([x*y for x,y in zip(acc_probs,acc_states)])
        output = tf.add_n([x*y for x,y in zip(acc_probs,acc_outputs)])

        remainder = 1.0 - tf.add_n(acc_probs[:-1]) #you take the last off to avoid a negative ponder cost #the problem here is we need to take the sum of all the remainders
        tf.add_to_collection("ACT_remainder", remainder) #if this doesnt work then you can do self.list based upon timesteps
        tf.add_to_collection("ACT_iterations", iterations)
        return output, next_state 

    def ACTStep(self,prob, counter, state, input, acc_states, acc_outputs, acc_probs):

        '''run rnn once'''
        output, new_state = rnn.rnn(self.cell, [input], state, scope=type(self.cell).__name__)

        prob_w = tf.get_variable("prob_w", [self.cell.input_size,1]) 
        prob_b = tf.get_variable("prob_b", [1])
        halting_probability = tf.nn.sigmoid(tf.matmul(prob_w,new_state) + prob_b) 


        acc_states.append(new_state)
        acc_outputs.append(output)
        acc_probs.append(halting_probability) 

        return [p + prob, counter + 1.0, new_state, input,acc_states,acc_outputs,acc_probs]


    def PonderCostFunction(self, time_penalty = 0.01):
        '''
        note: ponder is completely different than probability and ponder = roe

        the ponder cost function prohibits the rnn from cycling endlessly on each timestep when not much is needed
        '''
        n_iterations = tf.get_collection_ref("ACT_iterations")
        remainder = tf.get_collection_ref("ACT_remainder")
        return tf.reduce_sum(n_iterations + remainder) #completely different from probability

这是一篇我自己一直在努力实现的复杂论文。我不介意与您合作在 Tensorflow 中完成它。如果您有兴趣，请在 Skype 上添加我的 LeavesBreathe，我们可以从那里开始。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

tensorflow

recurrentneuralnetwork

在 while_loop 的上下文中使用 TensorArray 来累加值的相关文章

在自定义 keras 层的调用函数中传递附加参数

我创建了一个自定义 keras 层目的是在推理过程中手动更改前一层的激活以下是基本层它只是将激活值乘以一个数字 import numpy as np from keras import backend as K from keras
无法从 DenseVariational 获得合理的结果

我正在尝试使用以下大小的数据集正弦曲线进行回归问题500 首先我尝试使用 2 个密集层每个层有 10 个单元 model tf keras Sequential tf keras layers Dense 10 activation
张量流如何处理复杂的梯度？

Let z是一个复变量 C z 是它的共轭在复分析理论中导数C z w r t z不存在但在张量流中我们可以计算dC z dz结果就是1 这是一个例子 x tf placeholder complex64 2 2 y tf redu
如何保存 Tensorflow.js 模型？

我想制作一个创建保存和训练 tensorflow js 模型的用户界面但我无法在创建模型后保存模型我什至从tensorflow js文档复制了这段代码但它不起作用 const model tf sequential layers t
TensorFlow CUDA_ERROR_OUT_OF_MEMORY

我正在尝试在 TensorFlow 中构建一个大型 CNN 并打算在多 GPU 系统上运行它我采用了塔式系统并为两个 GPU 拆分批次同时将变量和其他计算保留在 CPU 上我的系统有 32GB 内存但是当我运行代码时出现错误
keras LSTM 以正确的形状提供输入

我从具有以下形状的 pandas 数据框中获取一些数据 df head gt gt gt Value USD Drop 7 Up 7 Mean Change 7 Change Predict 0 06480 2 0 4 0 0 000429
Keras 中的条件批量归一化

我正在尝试在 Keras 中实现条件批量标准化我假设我必须创建一个自定义层因此我从正常化 https github com keras team keras blob master keras layers normalization
在 Android 上保持 TensorFlow 模型加密

我搜索了解是否有一种技术可以在 Android 应用程序中保持经过训练的张量流模型 pb 文件的安全但没有找到任何有用的东西我正在发布一个包含我在训练集上构建的张量流模型的应用程序当我发布该应用程序时任何人都可以访问该模型并将其用
从图中删除节点或重置整个默认图

使用默认全局图时是否可以在添加节点后将其删除或者将默认图重置为空当我在 IPython 中交互地使用 TF 时我发现自己必须反复重新启动内核如果可能的话我希望能够更轻松地尝试图表更新 11 2 2016 tf reset de
如何将 std::vector 转换为张量而不在 C++ 中的张量流中进行复制？

在c 中多维矩阵存储在std vector
您必须使用 dtype float(Tensorflow) 为占位符张量“Placeholder”提供值

import tensorflow as tf import os import sklearn preprocessing import pandas as pd import numpy as np print os getcwd os
Tensorflow图像读取空

这个问题是基于 Tensorflow图像读取与显示 https stackoverflow com questions 33648322 tensorflow image reading display 根据他们的代码我们得到以下内容 s
AttributeError：模块“keras.engine”没有属性“Layer”

当我试图运行时Parking Slot mask rcnn py文件我收到如下错误mrcnn model py文件我该如何解决 gt 2021 06 17 08 25 18 585897 W tensorflow stream execut
使用 tf.keras.Models.Sequential 构建的架构是否比使用 Tensorflow 的功能 API 构建的架构运行得更慢、更准确？

我只是比较了 2 个我认为等效的 VGG ish 架构一个是使用构建的tf keras Models Sequential 另一个用了Tensorflow 的函数式 API 每个人都试图解决cats vs dogs 数据集经过 10
如何在 Tensorflow 中使用预训练的 Word2Vec 模型

我有一个Word2Vec训练过的模型Gensim 我如何使用它Tensorflow for Word Embeddings 我不想在 Tensorflow 中从头开始训练嵌入有人可以告诉我如何用一些示例代码来做到这一点吗假设您有一个字典
将 Pytorch 模型 .pth 转换为 onnx 模型

我有一个预训练的模型其格式为 pth 扩展名我想将其转换为 Tensorflow protobuf 但我没有找到任何方法来做到这一点我见过 onnx 可以将模型从 pytorch 转换为 onnx 然后从 onnx 转换为 Tenso
Tensorflow seq2seq 获取序列隐藏状态

我不久前才开始研究tensorflow 我正在研究 seq2seq 模型并以某种方式让教程起作用但我一直坚持获取每个句子的状态据我了解 seq2seq 模型采用输入序列并通过 RNN 为序列生成隐藏状态随后模型使用序列的隐藏状态来
Tensorboard——High-level节点的计算时间与其子节点计算时间的总和不同

继tutorial https www tensorflow org programmers guide graph viz在 TensorFlow 上我试图使用张量板来理解运行时统计数据我发现代表名称范围的高级节点的计算时间不等于其子
在tensorflow.js中对张量进行分区、屏蔽或过滤

我有 2 个相同长度的张量 data and groupIds 我想分开data通过相应的值分成几组groupId 例如 const data tf tensor 1 2 3 4 5 const groupIds tf tensor 0 1
从 swift 数组创建张量

这工作正常 import TensorFlow var t Tensor

随机推荐

如何以 OOP 风格使用 TensorFlow？

具体来说当使用 TensorFlow 以 OOP 风格构建模型时我应该在哪里构建图我应该在哪里启动会话来运行图表此案例的最佳实践是什么 In TensorFlow 力学 101 https www tensorflow org tu
ES6 fetch 函数返回未定义[重复]

这个问题在这里已经有答案了我有以下代码 function fetchDemo var result fetch countriesUrl then function response return response json then f
画布未在reactjs中渲染

我想在我正在开发的网站上添加画布但我似乎可以理解为什么画布没有显示可能是什么问题以下是我尝试过的当我将鼠标悬停在标题上时它显示画布正在更新但屏幕上没有显示任何内容画布 jsx export class Canvas exten
在 R 中按模式重命名列

我想按特定模式重命名数据框中的所有列我的输入 Log NE122 Log NE244 Log NE144 0 33 0 98 1 0 我的预期输出 NE122 NE244 NE144 0 33 0 98 1 0 Cheers 您可以使用正
在 Visual Studio 中开发 Azure Function 时存储帐户无效

我正在使用 C 在 Visual Studio 中开发 Azure Function 我在位于代理后面的开发机器上本地运行它但是不断收到此错误 Exception binding parameter Invalid storage acc
打字稿路径无法解析

Here https github com oleersoy typescript pathsGithub MCVE 显示了一个问题 npm run compile显示错误我正在尝试这样做 import Todo from test 但这
检测用户是否在颤动上按下 home / tab 的代码？

是否有任何代码可以检测用户是否按下了 home tab 我想让我的音乐在按下时暂停通过添加观察者来跟踪生命周期事件WidgetsBinding然后在应用程序暂停时暂停音乐你可以看看this https github com flutte
核心数据executeFetchRequest抛出NSGenericException（枚举时集合发生了变化）

我正在使用 Core Data 开发 iPhone 应用程序所有用户数据应与我们的服务器同步为此我创建了 NSOperation 的子类它从我们的 Web 服务加载新数据并创建相应的托管对象为了维护它们之间的关系每个对象都使用远
哪个是最好的 git 托管软件？ - Gitolite vs. Gitlab vs. Gitorius [关闭]

Closed 这个问题是基于意见的 help closed questions 目前不接受答案我正在寻找适合多个用户的 git 托管环境因此我搜索了之间的比较Gitolite Gitlab and Gitorius 但我没有得到任何有用
YAML：YAML 中的字符串需要引号吗？

我正在尝试编写一个用于 Rails 项目国际化的 YAML 字典不过我有点困惑因为在某些文件中我看到字符串用双引号引起来而在某些文件中则没有需要考虑的几点示例1 https github com plataformatec dev
Powershell：使用字符串匹配条件将单个文件拆分为多个文件

我有一个包含 1GB 数据的文件该数据实际上是数十个或数千个单独的迷你文件我需要提取每个单独的文件并将它们放入自己单独的不同文件中所以本质上我需要从单个文件变成 30K 单独的文件这是我的文件的示例文件名 1 版本 1 32
CRUDRespository 中的更新或 SaveorUpdate，是否有任何可用选项

我正在尝试使用 My Entity bean 执行 CRUD 操作 CRUDRepository提供标准方法find delete and save但没有可用的通用方法例如saveOrUpdate Entity entity 进而调用Hi
如何将json对象显示为html？

我的 Json 对象是这样的 attributes Code SGL Total 19421340 27 DayPrice Date 2016 07 22 Rate 4900439 85 Date 2016 07 23 Rate 48451
绕过 Google 电子表格中的循环引用

我有一个谷歌文档电子表格有两列 A 和 B B 的值只是 A 中不同格式的值并且我在 B 列中有一个公式可以进行转换有时我没有 A 格式的值但有 B 格式的值我想通过在 A 列中添加进行反向转换的公式来自动获取 A 列中 A 格式
如何在 vue.js 构建上重命名 index.html？

我想重命名index html产生于npm run build 我在 webpack 配置中找不到任何内容我还创建了一个vue config js此处描述 https github com vuejs vue cli tree dev d
React Redux 工具包：类型错误：无法读取未定义的属性“值”

在我的项目中我为 2 个不同的状态场景实现了 React Redux 工具包并且它们工作得很好现在我需要为 Redux 实现第三个状态场景因此我遵循与前 2 个状态场景相同的模式灵感来自 https react redux js
为什么我的 Django 表单没有引发任何错误？

我有一个简单的表单每当用户在表单上做错事时我想在 Django 上引发验证错误问题是我设置了表单验证但是当提交表单时使用错误的值时它会通过我想知道为什么会发生这种情况以及如何避免这种情况这是 html 形式
如何检查浏览器是否支持flash？

我有一个 Flash 横幅如果客户端浏览器没有启用 Flash 我想用静态图像替换它我想知道我是否可以用 php 做到这一点或者是否有人知道一个好方法 Thanks 允许您的 Flash 影片降级
使用 Flask-limiter 限制端点速率

我知道并且爱flask limiter来自较旧的项目现在我想用它在我的flask restplus为基础的项目我的最终解决方案将使我能够在每个方法级别上进行速率限制因此 post 方法的费率与 get 方法的费率不同但如果我可以定义
在 while_loop 的上下文中使用 TensorArray 来累加值

下面是 Tensorflow RNN Cell 的实现旨在模拟本文中 Alex Graves 的算法 ACT http arxiv org abs 1603 08983 http arxiv org abs 1603 08983 在通过

在 while_loop 的上下文中使用 TensorArray 来累加值

在 while_loop 的上下文中使用 TensorArray 来累加值 的相关文章

随机推荐

热门标签

在 while_loop 的上下文中使用 TensorArray 来累加值的相关文章