is it possible that from_module and to_module doesn't work well for LSTM modules?

Hello!

I am using this logic in my codebase, inspired from [this](https://github.com/meta-pytorch/LeanRL/blob/760837e0844e32bb5e1b06afb612ac694f7240ca/leanrl/ppo_atari_envpool_torchcompile.py#L332)

```
self.agent_inference = type(self.policy)(**inference_kwargs)
self.agent_inference_p = from_module(self.policy).data
self.agent_inference_p.to_module(self.agent_inference)
```

and while this works well for all policy classes in my project, if there is an LSTM inside the policy, then it stops working.

I have verified this is the only reason why the LSTM policies don't work, because when I comment out these 3 lines and use self.policy instead of self.agent_inference, the LSTM-equipped agents actually learn well.

What is going on?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

is it possible that from_module and to_module doesn't work well for LSTM modules? #1441

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

is it possible that from_module and to_module doesn't work well for LSTM modules? #1441

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions