Replies: 1 comment 1 reply
-
Any update of using keras.distribution for tensorflow? Currently it supports jax only. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
When using Keras 3.8.0/3.7.0 and tensorflow 2.18.0 to train on multiple gpus using MirroredStrategy, you typically get a NANs after a few training steps. Downgrading to Keras 3.6.0 fixes this. Maybe there is a bug in these versions? Here is a minimal example I posted on stack overflow a while back:
https://stackoverflow.com/questions/79285532/multi-gpu-training-in-tensorflow-results-in-nans
Beta Was this translation helpful? Give feedback.
All reactions