kerasカスタムレイヤーでのブロードキャストによる要素ごとの乗算

Question

アクティブ化する前に要素ごとに乗算する必要がある重みを使用してカスタムレイヤーを作成しています。出力と入力が同じ形状の場合に動作させることができます。この問題は、入力として1次配列があり、出力として2次配列がある場合に発生します。 tensorflow.multiplyはブロードキャストをサポートしていますが、Layer.call（x、self.kernel）でxにself.kernel変数を掛けようとすると、次のように異なる形状であると文句を言います。

ValueError: Dimensions must be equal, but are 4 and 3 for 'my_layer_1/Mul' (op: 'Mul') with input shapes: [?,4], [4,3].

これが私のコードです：

from keras import backend as K from keras.engine.topology import Layer import tensorflow as tf from keras.models import Sequential import numpy as np class MyLayer(Layer): def __init__(self, output_dims, **kwargs): self.output_dims = output_dims super(MyLayer, self).__init__(**kwargs) def build(self, input_shape): # Create a trainable weight variable for this layer. self.kernel = self.add_weight(name='kernel', shape=self.output_dims, initializer='ones', trainable=True) super(MyLayer, self).build(input_shape) # Be sure to call this somewhere! def call(self, x): #multiply wont work here? return K.tf.multiply(x, self.kernel) def compute_output_shape(self, input_shape): return (self.output_dims) mInput = np.array([[1,2,3,4]]) inShape = (4,) net = Sequential() outShape = (4,3) l1 = MyLayer(outShape, input_shape= inShape) net.add(l1) net.compile(loss='mean_absolute_error', optimizer='adam', metrics=['accuracy']) p = net.predict(x=mInput, batch_size=1) print(p)

編集：入力形状（4、）と出力形状（4,3）が与えられた場合、重み行列は出力と同じ形状であり、1で初期化されている必要があります。したがって、上記のコードでは、入力は[1,2,3,4]であり、重み行列は[[1,1,1,1]、[1,1,1,1]、[1,1,1]である必要があります。、1]]であり、出力は[[1,2,3,4]、[1,2,3,4]、[1,2,3,4]]のようになります。

Daniel M&#246;ller · Accepted Answer

掛ける前に、要素を繰り返して形を大きくする必要があります。そのためにK.repeat_elementsを使用できます。（import keras.backend as K）

class MyLayer(Layer): #there are some difficulties for different types of shapes #let's use a 'repeat_count' instead, increasing only one dimension def __init__(self, repeat_count,**kwargs): self.repeat_count = repeat_count super(MyLayer, self).__init__(**kwargs) def build(self, input_shape): #first, let's get the output_shape output_shape = self.compute_output_shape(input_shape) weight_shape = (1,) + output_shape[1:] #replace the batch size by 1 self.kernel = self.add_weight(name='kernel', shape=weight_shape, initializer='ones', trainable=True) super(MyLayer, self).build(input_shape) # Be sure to call this somewhere! #here, we need to repeat the elements before multiplying def call(self, x): if self.repeat_count > 1: #we add the extra dimension: x = K.expand_dims(x, axis=1) #we replicate the elements x = K.repeat_elements(x, rep=self.repeat_count, axis=1) #multiply return x * self.kernel #make sure we comput the ouptut shape according to what we did in "call" def compute_output_shape(self, input_shape): if self.repeat_count > 1: return (input_shape[0],self.repeat_count) + input_shape[1:] else: return input_shape

buckithed · Answer

これは、DanielMöllerの回答に基づいているが、元のコードと同じようにtf.multiplyを使用する別のソリューションです。

class MyLayer(Layer): def __init__(self, output_dim, **kwargs): self.output_dim = output_dim super(MyLayer, self).__init__(**kwargs) def build(self, input_shape): # Create a trainable weight variable for this layer. output_shape = self.compute_output_shape(input_shape) self.kernel = self.add_weight(name='kernel', shape=(1,) + output_shape[1:], initializer='ones', trainable=True) super(MyLayer, self).build(input_shape) # Be sure to call this somewhere! def call(self, x): return K.tf.multiply(x, self.kernel) def compute_output_shape(self, input_shape): return (input_shape[0],self.output_dim)+input_shape[1:]