You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
then you will see "NaN or Inf found in input tensor" in terminal, it actually because of some of the KL divergence being inf.
I would like to ask if this phenomenon has been noticed, whether this is allowed or whether the hyperparameters need further adjustment.
The text was updated successfully, but these errors were encountered:
Hi, this does occur during the training of the high-level policy, but it currently doesn't seem to affect the results. We plan to address this issue later, so for now, you can consider it acceptable.
Steps to reproduce:
Increase
max_epochs
inskillmimic/data/cfg/train/rlg/hrl_humanoid_discrete_layupscore.yaml
and run
then you will see "NaN or Inf found in input tensor" in terminal, it actually because of some of the KL divergence being inf.
I would like to ask if this phenomenon has been noticed, whether this is allowed or whether the hyperparameters need further adjustment.
The text was updated successfully, but these errors were encountered: