鉴于您的数据不太大,无法以 python 数组的形式提供文件名列表,我建议只在 Python 中进行预处理。创建文件名和标签的两个列表(相同的顺序),并将它们插入 randomshufflequeue 或队列中,然后从中出列。如果您想要 string_input_ Producer 的“无限循环”行为,您可以在每个纪元开始时重新运行“入队”。
一个非常玩具的例子:
import tensorflow as tf
f = ["f1", "f2", "f3", "f4", "f5", "f6", "f7", "f8"]
l = ["l1", "l2", "l3", "l4", "l5", "l6", "l7", "l8"]
fv = tf.constant(f)
lv = tf.constant(l)
rsq = tf.RandomShuffleQueue(10, 0, [tf.string, tf.string], shapes=[[],[]])
do_enqueues = rsq.enqueue_many([fv, lv])
gotf, gotl = rsq.dequeue()
with tf.Session() as sess:
sess.run(tf.initialize_all_variables())
tf.train.start_queue_runners(sess=sess)
sess.run(do_enqueues)
for i in xrange(2):
one_f, one_l = sess.run([gotf, gotl])
print "F: ", one_f, "L: ", one_l
关键是,当您执行以下操作时,您可以有效地将文件名/标签对排队enqueue
,这些对由dequeue
.