Skip to content

Pull requests: typoverflow/WiseRL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add use_placeholder for easier configuration
#15 by OrangeX4 was merged Dec 6, 2024 Loading…
add OPPO
#11 by polaris0-a was closed Jul 9, 2024 Loading…
feat: Hindsight Preference Learning
#10 by typoverflow was merged May 23, 2024 Loading…
update: add hammer and pen datasets
#9 by typoverflow was merged May 9, 2024 Loading…
update: cliff walking dataset
#8 by typoverflow was merged May 2, 2024 Loading…
reward model evaluation
#7 by typoverflow was merged Apr 24, 2024 Loading…
Hindsight Preference Learning
#6 by typoverflow was closed Jul 5, 2024 Loading…
update: create variant_world_dataset.py
#5 by OrangeX4 was merged Mar 13, 2024 Loading…
update: add UtilsRL.env.wrapper as default wrapper module
#4 by OrangeX4 was merged Mar 13, 2024 Loading…
Feature: Robust Preference Learning
#3 by typoverflow was merged Mar 8, 2024 Loading…
[Feature]: Add AWAC variants
#2 by typoverflow was merged Jan 23, 2024 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.