TensorFlow：有没有办法将冻结图转换为检查点模型？

2024-05-12

可以将检查点模型转换为冻结图（.ckpt 文件转换为 .pb 文件）。但是，是否有反向方法将 pb 文件再次转换为检查点文件？

我想它需要将常量转换回变量 - 有没有办法将正确的常量识别为变量并将它们恢复回检查点模型？

目前支持将变量转换为常量：https://www.tensorflow.org/api_docs/python/tf/graph_util/convert_variables_to_constants https://www.tensorflow.org/api_docs/python/tf/graph_util/convert_variables_to_constants

但反之则不然。

这里提出了类似的问题：Tensorflow：将常量张量从预训练的 Vgg 模型转换为变量 https://stackoverflow.com/questions/40071584/tensorflow-convert-constant-tensor-from-pre-trained-vgg-model-to-variable

但该解决方案依赖于使用 ckpt 模型来恢复权重变量。有没有办法从 PB 文件而不是检查点文件恢复权重变量？这对于权重修剪可能很有用。

There is一种通过图形编辑器将常量转换回 TensorFlow 中可训练变量的方法。但是，您需要指定要转换的节点，因为我不确定是否有办法以可靠的方式自动检测这一点。

步骤如下：

第 1 步：加载冻结图

我们加载我们的.pb文件到图形对象中。

import tensorflow as tf

# Load protobuf as graph, given filepath
def load_pb(path_to_pb):
    with tf.gfile.GFile(path_to_pb, 'rb') as f:
        graph_def = tf.GraphDef()
        graph_def.ParseFromString(f.read())
    with tf.Graph().as_default() as graph:
        tf.import_graph_def(graph_def, name='')
        return graph

tf_graph = load_pb('frozen_graph.pb')

步骤2：找到需要转换的常量

以下是列出图中节点名称的两种方法：

Use 这个脚本 https://gist.github.com/sunsided/88d24bf44068fe0fe5b88f09a1bee92a打印它们
print([n.name for n in tf_graph.as_graph_def().node])

您想要转换的节点可能被命名为“Const”。可以肯定的是，将图表加载到Netron https://github.com/lutzroeder/netron查看哪些张量正在存储可训练权重。通常，可以安全地假设所有 const 节点都曾经是变量。

识别出这些节点后，让我们将它们的名称存储到列表中：

to_convert = [...] # names of tensors to convert

步骤 3：将常量转换为变量

运行此代码以转换您指定的常量。它本质上为每个常量创建相应的变量，并使用 GraphEditor 从图表中取消常量，并挂上变量。

import numpy as np
import tensorflow as tf
import tensorflow.contrib.graph_editor as ge

const_var_name_pairs = []
with tf_graph.as_default() as g:

    for name in to_convert:
        tensor = g.get_tensor_by_name('{}:0'.format(name))
        with tf.Session() as sess:
            tensor_as_numpy_array = sess.run(tensor)
        var_shape = tensor.get_shape()
        # Give each variable a name that doesn't already exist in the graph
        var_name = '{}_turned_var'.format(name)
        # Create TensorFlow variable initialized by values of original const.
        var = tf.get_variable(name=var_name, dtype='float32', shape=var_shape, \  
                      initializer=tf.constant_initializer(tensor_as_numpy_array))
        # We want to keep track of our variables names for later.
        const_var_name_pairs.append((name, var_name))

    # At this point, we added a bunch of tf.Variables to the graph, but they're
    # not connected to anything.

    # The magic: we use TF Graph Editor to swap the Constant nodes' outputs with
    # the outputs of our newly created Variables.

    for const_name, var_name in const_var_name_pairs:
        const_op = g.get_operation_by_name(const_name)
        var_reader_op = g.get_operation_by_name(var_name + '/read')
        ge.swap_outputs(ge.sgv(const_op), ge.sgv(var_reader_op))

第 4 步：将结果另存为`.ckpt`

    with tf.Session() as sess:
        sess.run(tf.global_variables_initializer())
        save_path = tf.train.Saver().save(sess, 'model.ckpt')
        print("Model saved in path: %s" % save_path)

还有中提琴！您应该在这一点上完成:)我自己能够完成这项工作，并验证了模型权重已保留 - 唯一的区别是该图现在是可训练的。如果有任何问题，请告诉我。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

tensorflow