Using MaskedCategorical with ProbabilisticActor #2910

MaHaArt · 2025-04-18T08:37:32Z

MaHaArt
Apr 18, 2025

I’m working on a reinforcement learning scenario with discrete action types and continuous parameters. I’m using a ProbabilisticActor with a CompositeDistribution. Initially, I used Categorical for the discrete action type and masked invalid actions directly in the logits. As a result, the KL divergence started to explode during training.

I’m now considering switching to torchrl.modules.MaskedCategorical instead of Categorical. However, it seems that the mask is not being passed correctly.

Question: Has anyone successfully used MaskedCategorical with a ProbabilisticActor and could share some hints?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using MaskedCategorical with ProbabilisticActor #2910

{{title}}

Replies: 0 comments

Select a reply

Using MaskedCategorical with ProbabilisticActor #2910

MaHaArt Apr 18, 2025

Replies: 0 comments

MaHaArt
Apr 18, 2025