Pengyuan Wang

About Me

I am Pengyuan Wang, an undergraduate student in Robotics Engineering (Honors Program) at Zhejiang University, expected to graduate in 2027. Currently, I am a research intern at the THU Spatial Intelligence Lab, working with Prof. Yiming Li.

My long-term research vision is to develop machine learning-based control for dynamic systems, encompassing reinforcement learning, imitation learning, and the perception-action loop, to achieve high-level autonomy. I aim to fully leverage the hardware advantages of diverse robotic platforms to enable safe and autonomous deployment in both industrial and daily-life settings.

My recent work spans humanoid locomotion and soccer control, autonomous terrain exploration, agile UAV control, and embedded systems for RoboMaster robots.

RESEARCH PROJECTS

End-to-End Humanoid Soccer Control via Depth-Based Active Perception

Yixiao Huo*, Pengyuan Wang*, Jiakang Jin*, Yinan Han*, et al.

Developed an end-to-end RL framework for a 25-DOF humanoid to perform complex soccer maneuvers. Integrated depth-based active perception and auxiliary prediction heads for robust sim-to-real hardware deployment.

ongoing
Autonomous Humanoid Exploration in Multi-Level Terrains

Yinan Han*, Yixiao Huo*, Pengyuan Wang*, Jiakang Jin*, et al.

Designed a goal-free heuristic exploration policy for a 25-DOF humanoid using active depth sensing. Synthesized competitive reward dynamics to induce emergent, terrain-adaptive locomotion across complex environments.

ongoing

COMPETITIONS

RoboMaster University Series: Embedded System Architecture

Zhejiang University RoboMaster Team "Hello World"

Architected the embedded software and multi-layer state machines for heterogeneous competitive robots. Developed high-bandwidth bit-field protocols and cascaded PID controllers to ensure deterministic, high-speed motion control.

Technical Report

ENGINEERING PROJECTS

General Motion Tracking on Noetix E1 Humanoid Platform

Internship at Noetix Robotics

Reproduced and optimized General Motion Tracking frameworks (BeyondMimic/HoloMotion) via Isaac Lab. Leveraged adaptive sampling and robust sim-to-real pipelines to achieve stable whole-body imitation of high-dynamic dance sequences.
Agile UAV Flight Control via Deep Reinforcement Learning

Research project at FAST Lab, Zhejiang University

Built a high-mobility UAV control stack bridging PPO-based RL and cascaded PID. Achieved autonomous agile maneuvers (e.g., figure-8) using extensive domain randomization for robust sim-to-real hardware transfer.