如何以 HDF5 格式提供 caffe 多标签数据？

2024-05-03

我想将 caffe 与矢量标签一起使用，而不是整数。我检查了一些答案，似乎 HDF5 是更好的方法。但后来我陷入了这样的错误：

precision_layer.cpp:34] 检查失败：outer_num_ * inner_num_ == bottom[1]->count()（50 vs. 200）标签数量必须与预测数量匹配；例如，如果标签轴 == 1 并且预测形状为 (N, C, H, W)，则标签计数（标签数量）必须为N*H*W，整数值为 {0, 1, ..., C-1}。

HDF5 创建为：

f = h5py.File('train.h5', 'w')
f.create_dataset('data', (1200, 128), dtype='f8')
f.create_dataset('label', (1200, 4), dtype='f4')

我的网络是由以下内容生成的：

def net(hdf5, batch_size):
    n = caffe.NetSpec()
    n.data, n.label = L.HDF5Data(batch_size=batch_size, source=hdf5, ntop=2)
    n.ip1 = L.InnerProduct(n.data, num_output=50, weight_filler=dict(type='xavier'))
    n.relu1 = L.ReLU(n.ip1, in_place=True)
    n.ip2 = L.InnerProduct(n.relu1, num_output=50, weight_filler=dict(type='xavier'))
    n.relu2 = L.ReLU(n.ip2, in_place=True)
    n.ip3 = L.InnerProduct(n.relu1, num_output=4, weight_filler=dict(type='xavier'))
    n.accuracy = L.Accuracy(n.ip3, n.label)
    n.loss = L.SoftmaxWithLoss(n.ip3, n.label)
    return n.to_proto()

with open(PROJECT_HOME + 'auto_train.prototxt', 'w') as f:
f.write(str(net('/home/romulus/code/project/train.h5list', 50)))

with open(PROJECT_HOME + 'auto_test.prototxt', 'w') as f:
f.write(str(net('/home/romulus/code/project/test.h5list', 20)))

看来我应该增加标签数量并将内容放入整数而不是数组中，但如果我这样做，caffe 会抱怨数据数量和标签不相等，然后存在。

那么，输入多标签数据的正确格式是什么？

另外，我很想知道为什么没有人简单地编写 HDF5 映射到 caffe blob 的数据格式？

回答这个问题的标题：

HDF5 文件的根目录中应有两个数据集，分别命名为“data”和“label”。形状是（data amount, dimension）。我只使用一维数据，所以我不确定顺序是什么channel, width, and height。也许这并不重要。dtype应该是浮动或双精度。

创建训练集的示例代码h5py is:



import h5py, os
import numpy as np

f = h5py.File('train.h5', 'w')
# 1200 data, each is a 128-dim vector
f.create_dataset('data', (1200, 128), dtype='f8')
# Data's labels, each is a 4-dim vector
f.create_dataset('label', (1200, 4), dtype='f4')

# Fill in something with fixed pattern
# Regularize values to between 0 and 1, or SigmoidCrossEntropyLoss will not work
for i in range(1200):
    a = np.empty(128)
    if i % 4 == 0:
        for j in range(128):
            a[j] = j / 128.0;
        l = [1,0,0,0]
    elif i % 4 == 1:
        for j in range(128):
            a[j] = (128 - j) / 128.0;
        l = [1,0,1,0]
    elif i % 4 == 2:
        for j in range(128):
            a[j] = (j % 6) / 128.0;
        l = [0,1,1,0]
    elif i % 4 == 3:
        for j in range(128):
            a[j] = (j % 4) * 4 / 128.0;
        l = [1,0,1,1]
    f['data'][i] = a
    f['label'][i] = l

f.close()

此外，不需要精度层，只需将其删除即可。下一个问题是损失层。自从SoftmaxWithLoss只有一个输出（具有最大值的维度的索引），它不能用于多标签问题。感谢 Adian 和 Shai，我发现SigmoidCrossEntropyLoss在这种情况下很好。

下面是完整的代码，从数据创建、训练网络到获取测试结果：

main.py（根据caffelanet示例修改）



import os, sys

PROJECT_HOME = '.../project/'
CAFFE_HOME = '.../caffe/'
os.chdir(PROJECT_HOME)

sys.path.insert(0, CAFFE_HOME + 'caffe/python')
import caffe, h5py

from pylab import *
from caffe import layers as L

def net(hdf5, batch_size):
    n = caffe.NetSpec()
    n.data, n.label = L.HDF5Data(batch_size=batch_size, source=hdf5, ntop=2)
    n.ip1 = L.InnerProduct(n.data, num_output=50, weight_filler=dict(type='xavier'))
    n.relu1 = L.ReLU(n.ip1, in_place=True)
    n.ip2 = L.InnerProduct(n.relu1, num_output=50, weight_filler=dict(type='xavier'))
    n.relu2 = L.ReLU(n.ip2, in_place=True)
    n.ip3 = L.InnerProduct(n.relu2, num_output=4, weight_filler=dict(type='xavier'))
    n.loss = L.SigmoidCrossEntropyLoss(n.ip3, n.label)
    return n.to_proto()

with open(PROJECT_HOME + 'auto_train.prototxt', 'w') as f:
    f.write(str(net(PROJECT_HOME + 'train.h5list', 50)))
with open(PROJECT_HOME + 'auto_test.prototxt', 'w') as f:
    f.write(str(net(PROJECT_HOME + 'test.h5list', 20)))

caffe.set_device(0)
caffe.set_mode_gpu()
solver = caffe.SGDSolver(PROJECT_HOME + 'auto_solver.prototxt')

solver.net.forward()
solver.test_nets[0].forward()
solver.step(1)

niter = 200
test_interval = 10
train_loss = zeros(niter)
test_acc = zeros(int(np.ceil(niter * 1.0 / test_interval)))
print len(test_acc)
output = zeros((niter, 8, 4))

# The main solver loop
for it in range(niter):
    solver.step(1)  # SGD by Caffe
    train_loss[it] = solver.net.blobs['loss'].data
    solver.test_nets[0].forward(start='data')
    output[it] = solver.test_nets[0].blobs['ip3'].data[:8]

    if it % test_interval == 0:
        print 'Iteration', it, 'testing...'
        correct = 0
        data = solver.test_nets[0].blobs['ip3'].data
        label = solver.test_nets[0].blobs['label'].data
        for test_it in range(100):
            solver.test_nets[0].forward()
            # Positive values map to label 1, while negative values map to label 0
            for i in range(len(data)):
                for j in range(len(data[i])):
                    if data[i][j] > 0 and label[i][j] == 1:
                        correct += 1
                    elif data[i][j] %lt;= 0 and label[i][j] == 0:
                        correct += 1
        test_acc[int(it / test_interval)] = correct * 1.0 / (len(data) * len(data[0]) * 100)

# Train and test done, outputing convege graph
_, ax1 = subplots()
ax2 = ax1.twinx()
ax1.plot(arange(niter), train_loss)
ax2.plot(test_interval * arange(len(test_acc)), test_acc, 'r')
ax1.set_xlabel('iteration')
ax1.set_ylabel('train loss')
ax2.set_ylabel('test accuracy')
_.savefig('converge.png')

# Check the result of last batch
print solver.test_nets[0].blobs['ip3'].data
print solver.test_nets[0].blobs['label'].data

h5list 文件仅包含每行中 h5 文件的路径：

火车.h5list

/home/foo/bar/project/train.h5

测试.h5列表

/home/foo/bar/project/test.h5

和求解器：

auto_solver.prototxt


train_net: "auto_train.prototxt"
test_net: "auto_test.prototxt"
test_iter: 10
test_interval: 20
base_lr: 0.01
momentum: 0.9
weight_decay: 0.0005
lr_policy: "inv"
gamma: 0.0001
power: 0.75
display: 100
max_iter: 10000
snapshot: 5000
snapshot_prefix: "sed"
solver_mode: GPU

Converge graph:

上一批结果：



[[ 35.91593933 -37.46276474 -6.2579031 -6.30313492]
[ 42.69248581 -43.00864792 13.19664764 -3.35134125]
[ -1.36403108 1.38531208 2.77786589 -0.34310576]
[ 2.91686511 -2.88944006 4.34043217 0.32656598]
...
[ 35.91593933 -37.46276474 -6.2579031 -6.30313492]
[ 42.69248581 -43.00864792 13.19664764 -3.35134125]
[ -1.36403108 1.38531208 2.77786589 -0.34310576]
[ 2.91686511 -2.88944006 4.34043217 0.32656598]]

[[ 1. 0. 0. 0.]
[ 1. 0. 1. 0.]
[ 0. 1. 1. 0.]
[ 1. 0. 1. 1.]
...
[ 1. 0. 0. 0.]
[ 1. 0. 1. 0.]
[ 0. 1. 1. 0.]
[ 1. 0. 1. 1.]]

我认为这段代码还有很多地方需要改进。任何建议表示赞赏。

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

如何以 HDF5 格式提供 caffe 多标签数据？的相关文章

在 Django 中定义视图和 url。为什么调用函数时不使用括号？

我已经在经历 Python速成课程目前正在进行 Django Web应用程序项目学习日志阶段有些东西与我已经学到的相矛盾 views py file from django shortcuts import render def i
使用 python 制作本地服务器应用程序的最佳方法

我想要简单轻松地集成 python 和 vba 人们如果他们在阅读本文后亲自见到我阅读本文可能会杀了我但我正在使用 django 开发服务器来实现此目的有没有什么简单又好的方法仅举个例子我想使用 python 模块 openpy
python 可以检测它运行在哪个操作系统下吗？

python 可以检测操作系统然后为文件系统构建 if else 语句吗我需要将 Fn 字符串中的 C CobaltRCX 替换为 FileSys 字符串 import os path csv from time import strf
如何屏蔽 PyTorch 权重参数中的权重？

我正在尝试在 PyTorch 中屏蔽强制为零特定权重值我试图掩盖的权重是这样定义的def init class LSTM MASK nn Module def init self options inp dim super LSTM
在 Python 中使用 XPath 和 LXML

我有一个 python 脚本用于解析 XML 并将某些感兴趣的元素导出到 csv 文件中我现在尝试更改脚本以允许根据条件过滤 XML 文件等效的 XPath 查询将是 DC Events Confirmation contains T
使用 Django 的 post_save() 信号

我有两张桌子 class Advertisement models Model created at models DateTimeField auto now add True author email models EmailField
Dask DataFrame 的逐行处理

我需要处理一个大文件并更改一些值我想做这样的事情 for index row in dataFrame iterrows foo doSomeStuffWith row lol doOtherStuffWith row dataFrame
类属性在功能上依赖于其他类属性

我正在尝试使用静态类属性来定义另一个静态类属性我认为可以通过以下代码来实现 f lambda s s 1 class A foo foo bar f A foo 然而这导致NameError name A is not defined
NLTK、搭配问题：需要解包的值太多（预期为 2）

我尝试使用 NLTK 检索搭配但出现错误我使用内置的古腾堡语料库 I wrote alice nltk corpus gutenberg fileids 7 al nltk corpus gutenberg words alice al
反加入熊猫

我有两个表我想附加它们以便仅保留表 A 中的所有数据并且仅在其键唯一时添加表 B 中的数据键值在表 A 和 B 中是唯一的但在某些情况下键将出现在表 A 和 B 中我认为执行此操作的方法将涉及某种过滤联接反联接以获取表 B
如何为多组精灵创建随机位置？

我尝试使用 blit 和 draw 方法进行 for 循环并为 PlayerSprite 和 Treegroup 使用不同的变量 for PlayerSprite in Treegroup surface blit PlayerSprit
使用Python将图像转换为十六进制格式

我的下面有一个jpg文件tmp folder upload path tmp resized test jpg 我一直在使用下面的代码 Method 1 with open upload path rb as image file enco
在 Mac 上安装 Pygame 到 Enthought 构建中

关于在 Mac 上安装 Pygame 有许多未解答的问题但我将在这里提出我的具体问题并希望得到答案我在 Mac 上安装 Pygame 时遇到了难以置信的困难我使用 Enthought 版本 EPD 7 3 2 32 位它是我的默认框
在 Windows 上使用 IPython 笔记本时出现 500 服务器错误

我刚刚在 Windows 7 Professional 64 位上全新安装了 IPython 笔记本我采取的步骤是从以下位置安装 Python 3 4 1http python org http python org gt pip in
Python int 太大，无法放入 SQLite

我收到错误 OverflowError Python int 太大无法转换为 SQLite INTEGER 来自以下代码块该文件约25GB 因此必须分部分读取 length 6128765 Works on partitions of
是否可以写一个负的python类型注释

这可能听起来不合理但现在我需要否定类型注释我的意思是这样的 an int Not Iterable a string Iterable 这是因为我为一个函数编写了一个重载而 mypy 不理解我我的功能看起来像这样 overload
Plotly：如何避免巨大的 html 文件大小

我有一个 3D 装箱模型它使用绘图来绘制输出图我注意到绘制了 600 个项目生成 html 文件需要很长时间文件大小为 89M 这太疯狂了我怀疑可能存在一些巨大的重复或者是由单个项目的 add trace 方法引起的阴谋为
Python模块单元测试的最佳文件结构组织？

遗憾的是我发现有太多方法可以在 Python 中保存单元测试而且它们通常没有很好的文档记录我正在寻找一种终极结构它可以满足以下大部分要求 be discoverable by test frameworks including
Google App Engine 中的自定义身份验证

有谁知道或知道我可以在哪里学习如何使用 Python 和 Google App Engine 创建自定义身份验证流程我不想使用 Google 帐户进行身份验证并且希望能够创建自己的用户如果不是专门针对 Google App Engin
如何识别图形线条

我有以下格式的路径的 x y 数据示例仅用于说明 seq p1 p2 0 20 2 3 1 20 2 4 2 20 4 4 3 22 5 5 4 22 5 6 5 23 6 2 6 23 6 3 7 23 6 4 每条路径都有多个点它们

随机推荐

Flot 中轴的逗号分隔数字

有没有办法让 Flot 使轴编号以逗号分隔例如用 1 000 000 代替 1000000 您可以通过使用轴的tickFormatter 属性来做到这一点 xaxis tickFormatter function val axis in
Mojave + Xcode 10 构建在 glog config.h、gflags/gflags.h 上失败

我正在 Mac OS Mojave 和 Xcode 10 上测试 React Native 0 56 0 rc 2 Running react native init TestProject version 0 56 0 rc 2 cd T
Excel VBA 中的正则表达式

我在 Excel VBA 中使用 Microsoft 正则表达式引擎我对正则表达式很陌生但我现在有一个正在运行的模式我需要扩展它但我遇到了麻烦到目前为止这是我的代码 Sub ImportFromDTD Dim sDTDFile
TSQL 多列唯一约束也允许多个 Null

我目前正在做一些从 MS Access 到 SQL Server 的迁移 Access 允许唯一索引中存在多个 Null 而 SQL Server 不允许我一直在通过删除 SQL Server 中的索引并添加筛选索引来处理迁移 CREAT
启动jetty服务器时出现NoClassDefFoundError

我正在尝试在码头服务器中托管我的网络应用程序 spring 我将 war 文件复制到 jetty 服务器中的 webapp 文件夹中我并不是想嵌入jetty服务器而是试图在jetty内托管应用程序如tomcat 我没有安装jetty
如何在 Microsoft Visual Studio 2017 中检查 C++ 版本

我正在尝试使用以下代码检查我拥有的 C 版本 if cplusplus 201703L std cout lt lt C 17 n else if cplusplus 201402L std cout lt lt C 14 n else i
如何减少 MediaCodec H264 编码器延迟

我正在尝试使用 Android6 0 的 MediaCodec 将 h264 实时低延迟编码为流编码器大约有 6 帧延迟我想知道如何减少代码来自屏幕记录 cpp https android googlesource com platf
使用 Flask-restful RequestParser 进行嵌套验证

使用烧瓶宁静 http flask restful readthedocs org 微框架我在构建一个RequestParser这将验证嵌套资源假设预期的 JSON 资源格式为 a list obj1 1 obj2 2 obj3 3 o
ios ScheduledTimerWithTimeInterval 的时间量

我想使用 ScheduledTimerWithTimeInterval 来执行一定时间的定期任务比如说一小时但我如何在我的代码上实现这是我的代码 timer NSTimer scheduledTimerWithTimeInterval
TFS自定义签入策略调试

我创建了一个自定义签入政策如下面的链接所示 http msdn microsoft com en us library bb668980 aspx http msdn microsoft com en us library bb66898
范围对象 - 为什么有时我不能使用工作表

在这个线程中 Excel VBA 查找特定工作表上范围内的最大值 https stackoverflow com questions 31906571 excel vba find maximum value in range on spe
如何在 Rollup 中配置从多个输入文件仅生成单个输出文件？

配置Rollupjs生成库时如果输入是由多个javascript文件组成的数组我们如何才能将这些输入生成为一个输出 js 文件呢 export const lgService input src app services livegiv
如何在IE8及以下浏览器中应用边框半径？

我想知道如何将 border radius 应用于 IE8 及以下 IE8 浏览器我知道 border radius 是 HTML5 的一项功能而 IE8 不支持它我发现通过使用 htc 我们可以实现这一点但是通过使用 htc 我遇
Node.js npm mssql 函数返回未定义

我使用 mssql 和 node js 连接到 sql server 数据库我试图通过将连接代码包装在具有一个查询参数的函数中来减少代码当我从 router get 函数中的 with 调用该函数时它返回未定义任何帮助将非常感激 f
WPF - 非常基本的 ListBox.ItemTemplate 问题

好吧这是一个看似简单得令人尴尬的问题但却让我发疯我正在学习 DataTemplate 并尝试将一个非常非常简单的 ItemTemplate 应用于 ListBox 然而当我运行我的应用程序时模板被完全忽略我只得到标准外观的列表框
获取SSAS立方体上次处理时间

在 Excel 中我与数据多维数据集建立 Analysis Services 连接我希望能够通过向用户显示最后一次多维数据集处理时间发生的时间来向用户展示数据的最新情况在 SQL Server Management Studio SS
使用 System.loadLibrary() 时出现不满意的链接错误？

由于某种原因我在我的 java 应用程序中遇到了令人讨厌的不满意链接错误这是所涉犯罪者 System loadLibrary psjw 尽管库 psjw dll 显然与此类位于同一源包中请帮忙确保 psjw dll 位于您的 PAT
事件溯源：在重放事件并监听新传入事件时避免项目重复事件

在需要构建新视图的场景中我们可以重播来自活动商店结果我们将投影出新的视图因此我们的想法是部署一个新的投影该投影可以投影所有旧事件通过重播并监听新传入的事件并投影它们我认为在读取旧事件和收听新传入事件时可能会发生比赛条件因
如何在android中的google（设备）本机应用程序上添加自定义按钮？

我想在谷歌设备的本机应用程序上添加一个按钮例如谷歌地图使用此按钮我想打开我的应用程序我已经做了一些相关工作 Using 无障碍服务 https developer android com reference android ac
如何以 HDF5 格式提供 caffe 多标签数据？

我想将 caffe 与矢量标签一起使用而不是整数我检查了一些答案似乎 HDF5 是更好的方法但后来我陷入了这样的错误 precision layer cpp 34 检查失败 outer num inner num bottom 1

如何以 HDF5 格式提供 caffe 多标签数据？

如何以 HDF5 格式提供 caffe 多标签数据？ 的相关文章

随机推荐

热门标签

如何以 HDF5 格式提供 caffe 多标签数据？的相关文章