TensorBoardのTensorflow混同マトリックス

Question

テンソルボードで混同行列のビジュアルを持ちたいです。これを行うために、Tensorflow Slimの評価例を変更しています： https://github.com/tensorflow/models/blob/master/slim/eval_image_classifier.py

このサンプルコードでは、精度は既に指定されていますが、ストリーミングではないため、「混同マトリックス」メトリックを直接追加することはできません。

ストリーミングメトリックスと非ストリーミングメトリックスの違いは何ですか？

したがって、次のように追加しようとしました。

c_matrix = slim.metrics.confusion_matrix(predictions, labels) #These operations needed for image summary c_matrix = tf.cast(c_matrix, uint8) c_matrix = tf.expand_dims(c_matrix, 2) c_matrix = tf.expand_dims(c_matrix, 0) op = tf.image_summary("confusion matrix", c_matrix, collections=[]) tf.add_to_collection(tf.GraphKeys.SUMMARIES, op)

これにより、テンソルボードに画像が作成されますが、おそらくフォーマットの問題があります。意味のある画像が生成されるように、マトリックスは0〜1の間で正規化する必要があります。

どうすれば意味のある混同マトリックスを作成できますか？マルチバッチ評価プロセスに対処するにはどうすればよいですか？

MLNINJA · Answer

ここに私がまとめたものがありますが、それはかなりうまくいきます。目盛りの配置など、いくつかの調整が必要です。

以下は、ほとんどすべての処理を実行する関数です。

from textwrap import wrap import re import itertools import tfplot import matplotlib import numpy as np from sklearn.metrics import confusion_matrix def plot_confusion_matrix(correct_labels, predict_labels, labels, title='Confusion matrix', tensor_name = 'MyFigure/image', normalize=False): ''' Parameters: correct_labels : These are your true classification categories. predict_labels : These are you predicted classification categories labels : This is a lit of labels which will be used to display the axix labels title='Confusion matrix' : Title for your matrix tensor_name = 'MyFigure/image' : Name for the output summay tensor Returns: summary: TensorFlow summary Other itema to note: - Depending on the number of category and the data , you may have to modify the figzie, font sizes etc. - Currently, some of the ticks dont line up due to rotations. ''' cm = confusion_matrix(correct_labels, predict_labels, labels=labels) if normalize: cm = cm.astype('float')*10 / cm.sum(axis=1)[:, np.newaxis] cm = np.nan_to_num(cm, copy=True) cm = cm.astype('int') np.set_printoptions(precision=2) ###fig, ax = matplotlib.figure.Figure() fig = matplotlib.figure.Figure(figsize=(7, 7), dpi=320, facecolor='w', edgecolor='k') ax = fig.add_subplot(1, 1, 1) im = ax.imshow(cm, cmap='Oranges') classes = [re.sub(r'([a-z](?=[A-Z])|[A-Z](?=[A-Z][a-z]))', r'\1 ', x) for x in labels] classes = ['
'.join(wrap(l, 40)) for l in classes] tick_marks = np.arange(len(classes)) ax.set_xlabel('Predicted', fontsize=7) ax.set_xticks(tick_marks) c = ax.set_xticklabels(classes, fontsize=4, rotation=-90, ha='center') ax.xaxis.set_label_position('bottom') ax.xaxis.tick_bottom() ax.set_ylabel('True Label', fontsize=7) ax.set_yticks(tick_marks) ax.set_yticklabels(classes, fontsize=4, va ='center') ax.yaxis.set_label_position('left') ax.yaxis.tick_left() for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])): ax.text(j, i, format(cm[i, j], 'd') if cm[i,j]!=0 else '.', horizontalalignment="center", fontsize=6, verticalalignment='center', color= "black") fig.set_tight_layout(True) summary = tfplot.figure.to_summary(fig, tag=tensor_name) return summary

そして、この関数を呼び出すために必要な残りのコードは次のとおりです。

''' confusion matrix summaries ''' img_d_summary_dir = os.path.join(checkpoint_dir, "summaries", "img") img_d_summary_writer = tf.summary.FileWriter(img_d_summary_dir, sess.graph) img_d_summary = plot_confusion_matrix(correct_labels, predict_labels, labels, tensor_name='dev/cm') img_d_summary_writer.add_summary(img_d_summary, current_step)

混乱させる!!!

Jerod · Answer

テストコードの「ストリーミング」混同マトリックスを作成および表示する方法を次に示します（返される_test_op_は、テスト対象の各バッチに対して評価されます）。

_def _get_streaming_metrics(prediction,label,num_classes): with tf.name_scope("test"): # the streaming accuracy (lookup and update tensors) accuracy,accuracy_update = tf.metrics.accuracy(label, prediction, name='accuracy') # Compute a per-batch confusion batch_confusion = tf.confusion_matrix(label, prediction, num_classes=num_classes, name='batch_confusion') # Create an accumulator variable to hold the counts confusion = tf.Variable( tf.zeros([num_classes,num_classes], dtype=tf.int32 ), name='confusion' ) # Create the update op for doing a "+=" accumulation on the batch confusion_update = confusion.assign( confusion + batch_confusion ) # Cast counts to float so tf.summary.image renormalizes to [0,255] confusion_image = tf.reshape( tf.cast( confusion, tf.float32), [1, num_classes, num_classes, 1]) # Combine streaming accuracy and confusion matrix updates in one op test_op = tf.group(accuracy_update, confusion_update) tf.summary.image('confusion',confusion_image) tf.summary.scalar('accuracy',accuracy) return test_op,accuracy,confusion _

_test_op_を実行してすべてのデータバッチを処理した後、必要に応じてconfusion.eval()またはsess.eval(confusion)によって（セッション内で）最終的な混同マトリックスを簡単に検索できます。

ma3oun · Answer

以下は、tf.contrib.metrics.MetricSpecで機能するものです（Estimatorを使用する場合）。 Jerodの回答とmetric_op.pyソースファイルからインスピレーションを得ています。パーセンテージでストリーム化された混同行列を取得します：

from tensorflow.python.framework import ops,dtypes from tensorflow.python.ops import array_ops,variables def _createLocalVariable(name, shape, collections=None, validate_shape=True, dtype=dtypes.float32): """Creates a new local variable. """ # Make sure local variables are added to # tf.GraphKeys.LOCAL_VARIABLES collections = list(collections or []) collections += [ops.GraphKeys.LOCAL_VARIABLES] return variables.Variable( initial_value=array_ops.zeros(shape, dtype=dtype), name=name, trainable=False, collections=collections, validate_shape=validate_shape) def streamingConfusionMatrix(label, prediction, weights=None,num_classes=None): """ Compute a streaming confusion matrix :param label: True labels :param prediction: Predicted labels :param weights: (Optional) weights (unused) :param num_classes: Number of labels for the confusion matrix :return: (percentConfusionMatrix,updateOp) """ # Compute a per-batch confusion batch_confusion = tf.confusion_matrix(label, prediction, num_classes=num_classes, name='batch_confusion') count = _createLocalVariable(None,(),dtype=tf.int32) confusion = _createLocalVariable('streamConfusion',[num_classes, num_classes],dtype=tf.int32) # Create the update op for doing a "+=" accumulation on the batch countUpdate = count.assign(count + tf.reduce_sum(batch_confusion)) confusionUpdate = confusion.assign(confusion + batch_confusion) updateOp = tf.group(confusionUpdate,countUpdate) percentConfusion = 100 * tf.truediv(confusion,count) return percentConfusion,updateOp

その後、次の方法で評価指標として使用できます。

from tensorflow.contrib import learn,metrics #[...] evalMetrics = {'accuracy': learn.MetricSpec(metric_fn=metrics.streaming_accuracy), 'confusionMatrix':learn.MetricSpec(metric_fn= lambda label,prediction,weights=None: streamingConfusionMatrix( label,prediction,weights,num_classes=nLabels))}

Numpy.set_printoptions（precision = 2、suppress = True）を使用して印刷することをお勧めします。

dandelion · Answer

再：あなたの画像は意味がありません- tf.summary.image のドキュメントによると、uint8の値は変更されず（正規化されない）、範囲[0、255]で解釈されます。画像を[0,1]ではなく[0,255]に再正規化しようとしましたか？