Skip to content

Pull requests: pytorch/rl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Bump vllm from 0.14.1 to 0.20.0 in /sota-implementations/grpo CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Dependencies Pull requests that update a dependency file python Pull requests that update python code sota-implementations/
#3708 opened May 6, 2026 by dependabot Bot Loading…
[BugFix] Check agent presence before updating action mask in PettingZoo BugFix Environments/pettingzoo Environments Adds or modifies an environment wrapper
#3703 opened May 5, 2026 by nshoman Loading…
3 of 10 tasks
[Feature] MuJoCo custom envs with selectable physics backend CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature
#3700 opened May 1, 2026 by vmoens Collaborator Loading…
6 of 7 tasks
[Feature] Add functorch integration tests for TensorDictModule - Fixes #154 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature Modules
#3697 opened Apr 30, 2026 by ParamThakkar123 Contributor Loading…
[CI] Add ruleset JSON requiring lint-done on protected branches CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3690 opened Apr 28, 2026 by vmoens Collaborator Loading…
3 tasks
[CI] Selective PR test matrix gated by changed-files + ciflow/* labels CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3674 opened Apr 27, 2026 by vmoens Collaborator Loading…
4 of 8 tasks
Bump transformers from 4.52.4 to 5.0.0rc3 in /sota-implementations/expert-iteration CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Dependencies Pull requests that update a dependency file python Pull requests that update python code sota-implementations/
#3601 opened Apr 8, 2026 by dependabot Bot Loading…
[CI] Install torchcodec from source CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Record
#3541 opened Mar 4, 2026 by vmoens Collaborator Loading…
[Feature] Added Lazy implementation of priority updates for replaybuffer prototype CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature ReplayBuffers
#3507 opened Feb 13, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
[Feature] Added support for TDMPC2 dataset CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Data Data-related PR, will launch data-related jobs Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature
#3501 opened Feb 12, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Feature] Added OpenEnv environments CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Environments Adds or modifies an environment wrapper Feature New feature llm/ LLM-related PR, triggers LLM CI tests Trainers
#3470 opened Feb 9, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Feature] Extended Support delayed spec initialization for exploration modules CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature Integrations/torch_geometric Integrations Modules
#3450 opened Feb 5, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
[Feature] Added MCTSPolicyBase, MCTSPolicy, AlphaGoPolicy, AlphaStarPolicy, and MuZeroPolicy CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation Feature New feature Modules
#3449 opened Feb 5, 2026 by ParamThakkar123 Contributor Loading…
6 of 10 tasks
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Documentation Improvements or additions to documentation llm/ LLM-related PR, triggers LLM CI tests Objectives
#3427 opened Jan 31, 2026 by vmoens Collaborator Loading…
[Feature] SDPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Feature New feature llm/ LLM-related PR, triggers LLM CI tests Objectives
#3425 opened Jan 30, 2026 by vmoens Collaborator Loading…
5 tasks
[CI] Add path-based triggers for niche workflows CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3403 opened Jan 28, 2026 by vmoens Collaborator Loading…
[BugFix] Call Transfom._call from reset BugFix CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. Transforms
#3385 opened Jan 26, 2026 by ParamThakkar123 Contributor Loading…
3 of 10 tasks
[Feature] Incremental TensorStorageCheckpointer CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3344 opened Jan 19, 2026 by vmoens Collaborator Loading…
[Feature] Add _Contiguous module and reshape improvements to encoders/decoders CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3306 opened Jan 8, 2026 by vmoens Collaborator Loading…
[BugFix] Fix SliceSampler for torch.compile compatibility CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3298 opened Jan 8, 2026 by vmoens Collaborator Loading…
Fix Habitat CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#3065 opened Jul 14, 2025 by vmoens Collaborator Loading…
[Algorithm] DPO CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. enhancement New feature or request
#3025 opened Jun 23, 2025 by vmoens Collaborator Loading…
[Feature, Example] A3C Atari Implementation for TorchRL CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. new algo New algorithm request or PR sota-implementations/
#3001 opened Jun 15, 2025 by simeetnayan81 Contributor Loading…
3 of 9 tasks
[CI] Fix gym rendering issues CI Has to do with CI setup (e.g. wheels & builds, tests...) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2995 opened Jun 9, 2025 by vmoens Collaborator Loading…
[Formatting] headers and future imports checks CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#2973 opened May 23, 2025 by vmoens Collaborator Loading…
10 tasks
ProTip! What’s not been updated in a month: updated:<2026-04-07.