| About the reinforcement-learning category | | 7 | 4486 | October 18, 2023 |
| Performance Differences in TD3 Training When Switching from NumPy 1.26.0 to NumPy 2.2.6 | | 0 | 12 | December 9, 2025 |
| Several implementations of TruncatedNormal? | | 0 | 16 | December 1, 2025 |
| PPO learning poorly on LunarLander-v3 | | 2 | 104 | November 18, 2025 |
| Apparent RAM memory leak when converting batch of ndarray states to GPU tensor | | 0 | 37 | October 29, 2025 |
| Training Machine Learning Model In Browser For Reinforcement Learning | | 1 | 122 | October 27, 2025 |
| Model Boilerplate for a Simple DQN | | 3 | 81 | October 22, 2025 |
| Implementation of Hierarchical Actor Critic with PPolicy-on Policy-off Policy Optimization for primitive actions | | 0 | 43 | October 15, 2025 |
| PyTorch Compatibility with Older CUDA Versions | | 1 | 51 | September 20, 2025 |
| Agent Masking in Multi-agent environment? | | 0 | 37 | September 7, 2025 |
| Batching a multicategorical spec | | 4 | 111 | August 27, 2025 |
| How to pass options to env.reset within a data collector | | 0 | 38 | August 26, 2025 |
| Environments from scratch with Torchrl | | 17 | 1408 | August 25, 2025 |
| How to manage done in a batched custom Env? | | 3 | 75 | August 25, 2025 |
| ClipPPOLoss problem with MaskedCategorical dist | | 2 | 55 | August 21, 2025 |
| CosTrader Env from scratch... and transform problem | | 3 | 53 | August 15, 2025 |
| PPO with Categorical Action... help | | 10 | 143 | August 14, 2025 |
| Question about TorchRL ParallelEnv error on single-gpu device | | 3 | 84 | August 5, 2025 |
| Help understanding data collectors | | 1 | 65 | August 4, 2025 |
| Should we split the trajectories prior to calculating the loss for a DQN? | | 1 | 42 | August 4, 2025 |
| Question About If PPO Training Will Work | | 1 | 90 | July 29, 2025 |
| RTX 5090 interconnection with pytorch | | 6 | 214 | July 28, 2025 |
| Model almost instantly produces "nan" | | 4 | 207 | July 19, 2025 |
| What loss function should the inner loop of MAML use? | | 2 | 107 | June 27, 2025 |
| TruncatedNormal loc argument | | 3 | 75 | June 19, 2025 |
| Using buffers in ParallelEnvs / MultiSyncCollectors | | 2 | 140 | June 16, 2025 |
| Multi-agent RL with different agent action spaces | | 0 | 70 | June 12, 2025 |
| Policy Gradient For Pong Not Learning | | 0 | 51 | May 28, 2025 |
| Torchrl kl_div for old and new policy | | 0 | 56 | April 7, 2025 |
| Custom Vectorized environment for torchrl | | 3 | 169 | April 3, 2025 |