66% training speedup for humanoid control through transformer-enhanced RL
Research Engineer
2 months (Apr 2024 - May 2024)
PyTorch, TDMPC2, Decision Transformers, MuJoCo, Reinforcement Learning
66% training time reduction for humanoid robot control. Enhanced TDMPC2 algorithm with decision transformers while maintaining performance benchmarks.
Training humanoid robots for complex motor control tasks requires significant computational time. Existing RL algorithms face efficiency bottlenecks.
Implemented transformer-based reinforcement learning in PyTorch, integrating decision transformers with TDMPC2. Validated efficiency gains across DreamerV3 and other algorithms in MuJoCo simulation for humanoid sit task.