TensorFlow：自分の画像でのトレーニング

Question

TensorFlowは初めてです。 独自の画像をトレーニングするデータセットが可能な画像認識のヘルプを探しています。

新しいデータセットをトレーニングする例はありますか？

Olivier Moindrot · Accepted Answer

TensorFlowで独自のデータを入力する方法に興味がある場合は、このチュートリアルをご覧ください。
また、StanfordでCS230のベストプラクティスを含むガイドを作成しましたこちら。

新しい回答（`tf.data`付き）およびラベル付き

tf.dataにr1.4を導入すると、プレースホルダーやキューなしで画像のバッチを作成できます。手順は次のとおりです。

画像のファイル名と対応するラベルのリストを含むリストを作成します
これらのファイル名とラベルを読み取るtf.data.Datasetを作成します
データを前処理する
次のバッチを生成するtf.data.Datasetからイテレーターを作成します

コードは次のとおりです。

# step 1 filenames = tf.constant(['im_01.jpg', 'im_02.jpg', 'im_03.jpg', 'im_04.jpg']) labels = tf.constant([0, 1, 0, 1]) # step 2: create a dataset returning slices of `filenames` dataset = tf.data.Dataset.from_tensor_slices((filenames, labels)) # step 3: parse every image in the dataset using `map` def _parse_function(filename, label): image_string = tf.read_file(filename) image_decoded = tf.image.decode_jpeg(image_string, channels=3) image = tf.cast(image_decoded, tf.float32) return image, label dataset = dataset.map(_parse_function) dataset = dataset.batch(2) # step 4: create iterator and final input tensor iterator = dataset.make_one_shot_iterator() images, labels = iterator.get_next()

これで、プレースホルダーを介してデータを供給せずに、sess.run([images, labels])を直接実行できます。

古い回答（TensorFlowキューを使用）

まとめると、複数のステップがあります：

ファイル名のリストを作成します（例：画像へのパス）
TensorFlowファイル名キューを作成します
各画像の読み取りとデコード、固定サイズへのサイズ変更（バッチ処理に必要）
これらの画像のバッチを出力します

最も簡単なコードは次のとおりです。

# step 1 filenames = ['im_01.jpg', 'im_02.jpg', 'im_03.jpg', 'im_04.jpg'] # step 2 filename_queue = tf.train.string_input_producer(filenames) # step 3: read, decode and resize images reader = tf.WholeFileReader() filename, content = reader.read(filename_queue) image = tf.image.decode_jpeg(content, channels=3) image = tf.cast(image, tf.float32) resized_image = tf.image.resize_images(image, [224, 224]) # step 4: Batching image_batch = tf.train.batch([resized_image], batch_size=8)

TensorFlow：自分の画像でのトレーニング

新しい回答（tf.data付き）およびラベル付き

古い回答（TensorFlowキューを使用）

新しい回答（`tf.data`付き）およびラベル付き