Minimum working example of training hyperprior; Weights not updating #138

alexarmstrongvi · 2022-05-11T18:05:50Z

alexarmstrongvi
May 11, 2022

I am trying to create a dummy example to train the hyperprior of an entropy model. I used bls2017.py as my reference. The issue seems to be that the dummy model doesn't see the trainable variables in the prior. Any thoughts on what I am missing?

My environment:

python=3.7.7
tensorflow=2.3.0
tensorflow_compression=2.0b1

My dummy example:

import tensorflow as tf
import tensorflow_compression as tfc

class DummyModel(tf.keras.Model):
    def __init__(self):
        super().__init__()
        self.prior = tfc.NoisyDeepFactorized()
        self.build((None, 10))
    
    def call(self, inputs):
        entropy_model = tfc.ContinuousBatchedEntropyModel(self.prior, coding_rank=1, compression=False)
        _, bits = entropy_model(inputs, training=True)
        return tf.reduce_mean(bits)
model = DummyModel()

model.compile(tf.keras.optimizers.Adam(0.1), tf.keras.losses.MeanAbsoluteError())

x_train = tf.random.normal([10**6, 10], mean=5.0, stddev=0.5)
y_train = tf.zeros(shape=(x_train.shape[0],1))

init_vars = [v.numpy().mean() for v in model.prior.trainable_variables]
print('Prior weights:', len(model.prior.trainable_variables))
print('Model weights:', len(model.trainable_weights))
history = model.fit(x_train, y_train, batch_size=1024, epochs=2)
print(history.history)
unchanged = init_vars == [v.numpy().mean() for v in model.prior.trainable_variables]
print('Prior weights unchanged?', unchanged)

Output:

Prior weights: 8
Model weights: 0
Epoch 1/2
977/977 [==============================] - 2s 2ms/step - loss: 53.6712
Epoch 2/2
977/977 [==============================] - 2s 2ms/step - loss: 53.6712
{'loss': [53.67115783691406, 53.67123031616211]}
Prior weights unchanged? True

I am aiming to see the training step take the gradient of the average bits estimate (i.e. the MAE loss relative to a 0 target) w.r.t. the prior weights and then apply some update to those weights.

Answered by jonaballe

May 11, 2022

Hi, you are experiencing a bug in older TF releases. tf.keras.Model classes didn't collect trainable variables from all nested objects that inherit from tf.Module, only from ones that inherit from tf.keras.layers.Layer. Distribution objects would fall in this category. This was fixed in a later TF version. I think it was fixed in 2.5. I'd recommend using the latest version (2.8; 2.9 should probably be released end of this week). If that's not possible, there is a workaround, check out this commit.

View full answer

jonaballe · 2022-05-11T21:08:24Z

jonaballe
May 11, 2022
Maintainer

Hi, you are experiencing a bug in older TF releases. tf.keras.Model classes didn't collect trainable variables from all nested objects that inherit from tf.Module, only from ones that inherit from tf.keras.layers.Layer. Distribution objects would fall in this category. This was fixed in a later TF version. I think it was fixed in 2.5. I'd recommend using the latest version (2.8; 2.9 should probably be released end of this week). If that's not possible, there is a workaround, check out this commit.

1 reply

alexarmstrongvi May 11, 2022
Author

That was the issue. Thanks for the explanation as well as the workaround.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Minimum working example of training hyperprior; Weights not updating #138

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Minimum working example of training hyperprior; Weights not updating #138

Uh oh!

alexarmstrongvi May 11, 2022

Replies: 1 comment · 1 reply

Uh oh!

Uh oh!

jonaballe May 11, 2022 Maintainer

Uh oh!

alexarmstrongvi May 11, 2022 Author

alexarmstrongvi
May 11, 2022

Replies: 1 comment 1 reply

jonaballe
May 11, 2022
Maintainer

alexarmstrongvi May 11, 2022
Author