♻️ full agent refactor
This commit is contained in:
@@ -7,8 +7,6 @@ dt: 0.002
|
||||
substeps: 10
|
||||
history_length: 10
|
||||
|
||||
rma_mode: "none" # "none" | "teacher" | "deploy"
|
||||
|
||||
# Clean by default (deterministic eval). Confirming-experiment example —
|
||||
# re-eval an existing checkpoint in sim with a fixed 1-step action delay:
|
||||
# mjpython scripts/eval.py env=rotary_cartpole runner=mujoco_single \
|
||||
|
||||
Reference in New Issue
Block a user