ZCAホワイトニングを実装する方法？ Python

Question

私はZCAホワイトニングを実装しようとしていて、それを行う記事をいくつか見つけましたが、それらは少し混乱しています。誰かが私に光を当てることができますか？

ヒントやヘルプは大歓迎です！

ここに私が読んだ記事があります：

http://courses.media.mit.edu/2010fall/mas622j/whiten.pdf http://bbabenko.tumblr.com/post/86756017649/learning-low-level- vision-feautres-in-10-lines-of

私はいくつかのことを試しましたが、それらのほとんどは理解できず、ある段階でロックされました。今私はこれをベースとして再び始めます：

dtype = np.float32 data = np.loadtxt("../inputData/train.csv", dtype=dtype, delimiter=',', skiprows=1) img = ((data[1,1:]).reshape((28,28)).astype('uint8')*255)

Pascal Timshel · Answer

これは、ZCAホワイトニングマトリックスを生成するためのpython関数です。

_def zca_whitening_matrix(X): """ Function to compute ZCA whitening matrix (aka Mahalanobis whitening). INPUT: X: [M x N] matrix. Rows: Variables Columns: Observations OUTPUT: ZCAMatrix: [M x M] matrix """ # Covariance matrix [column-wise variables]: Sigma = (X-mu)' * (X-mu) / N sigma = np.cov(X, rowvar=True) # [M x M] # Singular Value Decomposition. X = U * np.diag(S) * V U,S,V = np.linalg.svd(sigma) # U: [M x M] eigenvectors of sigma. # S: [M x 1] eigenvalues of sigma. # V: [M x M] transpose of U # Whitening constant: prevents division by zero epsilon = 1e-5 # ZCA Whitening matrix: U * Lambda * U' ZCAMatrix = np.dot(U, np.dot(np.diag(1.0/np.sqrt(S + epsilon)), U.T)) # [M x M] return ZCAMatrix _

そして使用例：

_X = np.array([[0, 2, 2], [1, 1, 0], [2, 0, 1], [1, 3, 5], [10, 10, 10] ]) # Input: X [5 x 3] matrix ZCAMatrix = zca_whitening_matrix(X) # get ZCAMatrix ZCAMatrix # [5 x 5] matrix xZCAMatrix = np.dot(ZCAMatrix, X) # project X onto the ZCAMatrix xZCAMatrix # [5 x 3] matrix _

それが役に立てば幸い！

理由の詳細 EdgarAndrésMargffoy Tuay の答えは正しくありません： RM で指摘されているとおり= 's comment 、 EdgarAndrésMargffoy Tuay のZCAホワイトニング関数には、小さいが重大な誤りがあります。np.diag(S)を削除する必要があります。 NumpyはSをm x 1ベクトルとして返し、m x m行列ではありません（Matlabなどの他のsvd実装に共通しています）。したがって、ZCAMatrix変数は、本来のm x m行列ではなく、m x 1ベクトルになります（入力がm x nの場合）。（また、Andfoyの回答の共分散行列は、Xが事前に中央揃えされている場合、つまり0を意味する場合にのみ有効です）。

ZCAに関するその他の参考資料：Pythonで、スタンフォードUFLDL ZCAホワイトニングの演習ここに対する完全な回答を見ることができます。

Edgar Andr&#233;s Margffoy Tuay · Answer

データはmxnマトリックスに保存されていますか？ここで、mはデータの次元であり、nはケースの総数です。そうでない場合は、データのサイズを変更する必要があります。たとえば、画像のサイズが28x28で、画像が1つしかない場合は、1x784のベクターが必要です。この関数を使用できます：

import numpy as np def flatten_matrix(matrix): vector = matrix.flatten(1) vector = vector.reshape(1, len(vector)) return vector

次に、以下を使用してZCAホワイトニングをトレーニングセットに適用します。

def zca_whitening(inputs): sigma = np.dot(inputs, inputs.T)/inputs.shape[1] #Correlation matrix U,S,V = np.linalg.svd(sigma) #Singular Value Decomposition epsilon = 0.1 #Whitening constant, it prevents division by zero ZCAMatrix = np.dot(np.dot(U, np.diag(1.0/np.sqrt(np.diag(S) + epsilon))), U.T) #ZCA Whitening matrix return np.dot(ZCAMatrix, inputs) #Data whitening

ZCAMatrixマトリックスを保存することが重要です。ニューラルネットのトレーニング後に予測する場合は、テストケースを乗算する必要があります。

最後に、スタンフォードUFLDLチュートリアル http://ufldl.stanford.edu/wiki/index.php/UFLDL_Tutorial または http://ufldl.stanford.edu/tutorial / 。それらにはかなり良い説明があり、MATLABでのプログラミング演習もいくつかありますが、MATLABにあるほとんどすべての関数は同じ名前でNumpyにあります。これが洞察につながることを願っています。

Samuele Cornell · Answer

私は議論に少し遅れるかもしれませんが、貧弱なPCプロセッサが大量のデータを処理するには遅すぎたためにTensorFlowでZCAを実装するのに苦労して、このスレッドを最近見つけました。

誰か興味があれば、TensorFlowでのZCAの実装について Gist を作成しました。

import tensorflow as tf from keras.datasets import mnist import numpy as np tf.enable_eager_execution() assert tf.executing_eagerly() class ZCA(object): """ Simple ZCA aka Mahalanobis transformation class made in TensorFlow. The code was largely ported from Keras ImageDataGenerator """ def __init__(self, epsilon=1e-5, dtype='float64'): """epsilon is the normalization constant, dtype refers to the data type used in the computation. WARNING: the default precision is set to float64 as i have found that when computing the mean tensorflow' and numpy results can differ by a substantial amount. Usage: fit method computes the principal components and should be called first, compute method returns the actual transformed tensor NOTE : The input to both methods must be a 4D tensor. """ assert dtype is 'float32' or 'float64', "precision must be float32 or float64" self.epsilon = epsilon self.dtype = dtype self.princ_comp = None self.mean = None def _featurewise_center(self, images_tensor): if self.mean is None: self.mean, _ = tf.nn.moments(images_tensor, axes=(0, 1, 2)) broadcast_shape = [1, 1, 1] broadcast_shape[2] = images_tensor.shape[3] self.mean = tf.reshape(self.mean, broadcast_shape) norm_images = tf.subtract(images_tensor, self.mean) return norm_images def fit(self, images_tensor): assert images_tensor.shape[3], "The input should be a 4D tensor" if images_tensor.dtype is not self.dtype: # numerical error for float32 images_tensor = tf.cast(images_tensor, self.dtype) images_tensor = self._featurewise_center(images_tensor) flat = tf.reshape(images_tensor, (-1, np.prod(images_tensor.shape[1:].as_list()))) sigma = tf.div(tf.matmul(tf.transpose(flat), flat), tf.cast(flat.shape[0], self.dtype)) s, u, _ = tf.svd(sigma) s_inv = tf.div(tf.cast(1, self.dtype), (tf.sqrt(tf.add(s[tf.newaxis], self.epsilon)))) self.princ_comp = tf.matmul(tf.multiply(u, s_inv), tf.transpose(u)) def compute(self, images_tensor): assert images_tensor.shape[3], "The input should be a 4D tensor" assert self.princ_comp is not None, "Fit method should be called first" if images_tensor.dtype is not self.dtype: images_tensor = tf.cast(images_tensor, self.dtype) images_tensors = self._featurewise_center(images_tensor) flatx = tf.cast(tf.reshape(images_tensors, (-1, np.prod(images_tensors.shape[1:]))), self.dtype) whitex = tf.matmul(flatx, self.princ_comp) x = tf.reshape(whitex, images_tensors.shape) return x def main(): import matplotlib.pyplot as plt train_set, test_set = mnist.load_data() x_train, y_train = train_set zca1 = ZCA(epsilon=1e-5, dtype='float64') # input should be a 4D tensor x_train = x_train.reshape(*x_train.shape, 1) zca1.fit(x_train) x_train_transf = zca1.compute(x_train) # reshaping to 28*28 and casting to uint8 for plotting x_train_transf = tf.reshape(x_train_transf, x_train_transf.shape[0:3]) fig, axes = plt.subplots(3, 3) for i, ax in enumerate(axes.flat): # Plot image. ax.imshow(x_train_transf[i], cmap='binary' ) xlabel = "True: %d" % y_train[i] ax.set_xlabel(xlabel) ax.set_xticks([]) ax.set_yticks([]) plt.show() if __name__ == '__main__': main()

これは元の質問に対する適切な回答ではないことはわかっていますが、ZCAのGPU実装を探しているが見つからない場合は、それでも役立つかもしれません。

Mr Tsjolder · Answer

どちらの回答も FLDLチュートリアルを参照していますが、説明されている手順を使用しているようには見えません。

したがって、チュートリアルに従ってPCA/ZCAのホワイトニングを実装するだけの回答を提供することは悪い考えではないかもしれないと思いました。

import numpy as np # generate some random, 2D data x = np.random.randn(1000, 2) # and center it x_c = x - np.mean(x, 0) # compute the 2x2 covariance matrix # (remember that covariance matrix is symmetric) sigma = np.cov(x, rowvar=False) # and extract eigenvalues and eigenvectors # using the algorithm for symmetric matrices l,u = np.linalg.eigh(sigma) # NOTE that for symmetric matrices, # eigenvalues and singular values are the same. # u, l, _ = np.linalg.svd(sigma) should thus give equivalent results # rotate the (centered) data to decorrelate it x_rot = np.dot(x_c, u) # check that the covariance is diagonal (indicating decorrelation) np.allclose(np.cov(x_rot.T), np.diag(np.diag(np.cov(x_rot.T)))) # scale the data by eigenvalues to get unit variance x_white = x_rot / np.sqrt(l) # have the whitened data be closer to the original data x_zca = np.dot(x_white, u.T)

自分でこれを関数にラップできると思います...

別の実装は、より効率的な場合があります（数値計算コースで覚えていることから、より優れた安定性を持つはずです）。

_,s,v = np.linalg.svd(x_c) x_rot = np.dot(x_c, v.T) x_white = x_rot / s * np.sqrt(len(x_c) - 1) x_zca = np.dot(x_white, v)

この背後にある数学については、相互検証済みのこの優れた答えを参照します。

malcubierre · Answer

これは48x48の配列で機能します。

def flatten_matrix(matrix): vector = matrix.flatten(order='F') vector = vector.reshape(1, len(vector)) return vector def zca_whitening(inputs): sigma = np.dot(inputs, inputs.T)/inputs.shape[1] #Correlation matrix U,S,V = np.linalg.svd(sigma) #Singular Value Decomposition epsilon = 0.1 #Whitening constant, it prevents division by zero ZCAMatrix = np.dot(np.dot(U, np.diag(1.0/np.sqrt(np.diag(S) + epsilon))), U.T) #ZCA Whitening matrix return np.dot(ZCAMatrix, inputs) #Data whitening def global_contrast_normalize(X, scale=1., subtract_mean=True, use_std=True, sqrt_bias=10, min_divisor=1e-8): """ __author__ = "David Warde-Farley" __copyright__ = "Copyright 2012, Universite de Montreal" __credits__ = ["David Warde-Farley"] __license__ = "3-clause BSD" __email__ = "wardefar@iro" __maintainer__ = "David Warde-Farley" .. [1] A. Coates, H. Lee and A. Ng. "An Analysis of Single-Layer Networks in Unsupervised Feature Learning". AISTATS 14, 2011. http://www.stanford.edu/~acoates/papers/coatesleeng_aistats_2011.pdf """ assert X.ndim == 2, "X.ndim must be 2" scale = float(scale) assert scale >= min_divisor mean = X.mean(axis=1) if subtract_mean: X = X - mean[:, np.newaxis] else: X = X.copy() if use_std: ddof = 1 if X.shape[1] == 1: ddof = 0 normalizers = np.sqrt(sqrt_bias + X.var(axis=1, ddof=ddof)) / scale else: normalizers = np.sqrt(sqrt_bias + (X ** 2).sum(axis=1)) / scale normalizers[normalizers < min_divisor] = 1. X /= normalizers[:, np.newaxis] # Does not make a copy. return X def ZeroCenter(data): data = data - np.mean(data,axis=0) return data def Zerocenter_ZCA_whitening_Global_Contrast_Normalize(data): numpy_data = np.array(data).reshape(48,48) data2 = ZeroCenter(numpy_data) data3 = zca_whitening(flatten_matrix(data2)).reshape(48,48) data4 = global_contrast_normalize(data3) data5 = np.rot90(data4,3) return data5

たとえばこの画像から：

戻り値：

これがコードです：

https://Gist.github.com/m-alcu/45f4a083cb5e388d2ed26ace4392ed66 、fer2013.csvファイルを同じディレクトリに配置する必要があります（ https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data ）