再整形せずにテンソルフローでベクトルと行列を乗算するにはどうすればよいですか？

Question

この：

import numpy as np a = np.array([1, 2, 1]) w = np.array([[.5, .6], [.7, .8], [.7, .8]]) print(np.dot(a, w)) # [ 2.6 3. ] # plain Nice old matrix multiplication n x (n, m) -> m import tensorflow as tf a = tf.constant(a, dtype=tf.float64) w = tf.constant(w) with tf.Session() as sess: print(tf.matmul(a, w).eval())

結果：

C:\_\Python35\python.exe C:/Users/MrD/.PyCharm2017.1/config/scratches/scratch_31.py [ 2.6 3. ] # bunch of errors in windows... Traceback (most recent call last): File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\common_shapes.py", line 671, in _call_cpp_shape_fn_impl input_tensors_as_shapes, status) File "C:\_\Python35\lib\contextlib.py", line 66, in __exit__ next(self.gen) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\errors_impl.py", line 466, in raise_exception_on_not_ok_status pywrap_tensorflow.TF_GetCode(status)) tensorflow.python.framework.errors_impl.InvalidArgumentError: Shape must be rank 2 but is rank 1 for 'MatMul' (op: 'MatMul') with input shapes: [3], [3,2]. During handling of the above exception, another exception occurred: Traceback (most recent call last): File "C:/Users/MrD/.PyCharm2017.1/config/scratches/scratch_31.py", line 14, in <module> print(tf.matmul(a, w).eval()) File "C:\_\Python35\lib\site-packages	ensorflow\python\ops\math_ops.py", line 1765, in matmul a, b, transpose_a=transpose_a, transpose_b=transpose_b, name=name) File "C:\_\Python35\lib\site-packages	ensorflow\python\ops\gen_math_ops.py", line 1454, in _mat_mul transpose_b=transpose_b, name=name) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\op_def_library.py", line 763, in apply_op op_def=op_def) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\ops.py", line 2329, in create_op set_shapes_for_outputs(ret) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\ops.py", line 1717, in set_shapes_for_outputs shapes = shape_func(op) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\ops.py", line 1667, in call_with_requiring return call_cpp_shape_fn(op, require_shape_fn=True) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\common_shapes.py", line 610, in call_cpp_shape_fn debug_python_shape_fn, require_shape_fn) File "C:\_\Python35\lib\site-packages	ensorflow\python\framework\common_shapes.py", line 676, in _call_cpp_shape_fn_impl raise ValueError(err.message) ValueError: Shape must be rank 2 but is rank 1 for 'MatMul' (op: 'MatMul') with input shapes: [3], [3,2]. Process finished with exit code 1

（処理中に同じ例外が発生する理由がわからない）

matmulを使用したTensorflow例外で提案されている解決策は、ベクトルを行列に再形成しますが、これは不必要に複雑なコードになります-ベクトルと行列を乗算する他の方法はまだありませんか？

ちなみにexpand_dims（上記のリンクで提案されているように）デフォルト引数を使用すると、ValueErrorが発生します。これは docs には記載されておらず、デフォルト引数を使用する目的に反します。

Steven · Accepted Answer

Matmulは、ランク2以上のテンソル用にコーディングされています。 numpyが行列ベクトル乗算も可能にするほどそれを持っているので、なぜ正直であるのか分かりません。

import numpy as np a = np.array([1, 2, 1]) w = np.array([[.5, .6], [.7, .8], [.7, .8]]) print(np.dot(a, w)) # [ 2.6 3. ] # plain Nice old matix multiplication n x (n, m) -> m print(np.sum(np.expand_dims(a, -1) * w , axis=0)) # equivalent result [2.6, 3] import tensorflow as tf a = tf.constant(a, dtype=tf.float64) w = tf.constant(w) with tf.Session() as sess: # they all produce the same result as numpy above print(tf.matmul(tf.expand_dims(a,0), w).eval()) print((tf.reduce_sum(tf.multiply(tf.expand_dims(a,-1), w), axis=0)).eval()) print((tf.reduce_sum(tf.multiply(a, tf.transpose(w)), axis=1)).eval()) # Note tf.multiply is equivalent to "*" print((tf.reduce_sum(tf.expand_dims(a,-1) * w, axis=0)).eval()) print((tf.reduce_sum(a * tf.transpose(w), axis=1)).eval())

dsalaj · Answer

_tf.einsum_ を使用すると、簡潔で直感的な形式で必要なことを正確に実行できます。

_with tf.Session() as sess: print(tf.einsum('n,nm->m', a, w).eval()) # [ 2.6 3. ] _

コメントを明示的にn x (n, m) -> mと書くことさえできます。私の意見では、より読みやすく直感的です。

私のお気に入りの使用例は、行列のバッチに重みベクトルを乗算する場合です。

_n_in = 10 n_step = 6 input = tf.placeholder(dtype=tf.float32, shape=(None, n_step, n_in)) weights = tf.Variable(tf.truncated_normal((n_in, 1), stddev=1.0/np.sqrt(n_in))) Y_predict = tf.einsum('ijk,kl->ijl', input, weights) print(Y_predict.get_shape()) # (?, 6, 1) _

そのため、変換や複製を行うことなく、すべてのバッチに重みを簡単に掛けることができます。これは、他の答えのように次元を拡張することではできません。したがって、 _tf.matmul_ 要件を回避して、バッチおよびその他の外部ディメンションに一致するディメンションを持たせる必要があります。

入力は、転置に続いて、ランク2以上のテンソルでなければなりません。内側の2次元は有効な行列乗算引数を指定し、さらに外側の次元は一致します。

Hooked · Answer

tf.tensordot を使用して、axes=1を設定できます。ベクトルと行列の単純な演算では、これは tf.einsum よりも少しきれいです。

tf.tensordot(a, w, 1)