Replies: 1 comment
-
Thanks for posting this. I will move this to our issues and tag it as a proposal of enhancements for the team to consider. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I would like to discuss and propose some improvements for SB3 integration with isaac lab (as a follow up of #528).
I've been playing around IsaacLab with SB3/SBX (SBX = SB3 + jax) and I have some improvements to suggests/questions:
To reproduce, using my fork (https://github.com/araffin/IsaacLab, branch
/feat/sb3-update
):Default, runs at 10 000 steps/s:
Fast variant, runs at 35 000 steps/s (SBX) and 30 000 steps/s:
fastest variant but incorrect (truncation not handled, lower performance):
--no--info
flag, runs at 40 000 steps/s (similar to rsl_rl)Side remark: while playing around, the rsl_rl ppo hyperparameters don't seem to be tuned for the A1 task/ I managed to get much higher rewards with SB3/SBX implementation (mean reward ~30 vs ~10 for rsl_rl, probably because they don't handle truncation?)
Beta Was this translation helpful? Give feedback.
All reactions