Yubin Wang

Greetings! I am Yubin Wang (王玉斌), a researcher at Huawei Noah's Ark Lab 🚢, Shanghai.

I received my M.Phil degree from The Hong Kong University of Science and Technology. I was a visiting student at King Abdullah University of Science and Technology.

/ / / Google Scholar

Research

I am broadly interested in building AI agents with strong reasoning, planning and learning capabilities to physically interact with both simulated and real world.

News

[Feb 2, 2025] BiM-PPO is accepted to IEEE TVT.

[Jan 9, 2025] LearningFlow is released.

[Dec 5, 2024] CALMM-Drive is released.

[Jun 30, 2024] RD-PPO is accepted to IROS 2024.

[Mar 01, 2024] Latent-MPC is released.

[Jan 29, 2024] MPC-CRL is accepted to ICRA 2024.

Selected Publications

-->

	Bilevel Multi-Armed Bandit-Based Hierarchical Reinforcement Learning for Interaction-Aware Self-Driving at Unsignalized Intersections Zengqi Peng, Yubin Wang, Lei Zheng, Jun Ma IEEE Transactions on Vehicular Technology, 2025. preprint
	LearningFlow: Automated Policy Learning Workflow for Urban Driving with Large Language Models Zengqi Peng, Yubin Wang, Xu Han, Lei Zheng, Jun Ma arXiv, 2025. preprint
	CALMM-Drive: Confidence-Aware Autonomous Driving with Large Multimodal Model Ruoyu Yao, Yubin Wang, Haichao Liu, Rui Yang, Zengqi Peng, Lei Zhu, Jun Ma arXiv, 2024. preprint
	Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections Zengqi Peng, Xiao Zhou, Lei Zheng, Yubin Wang, Jun Ma IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024 pdf
	Learning the References of Online Model Predictive Control for Urban Self-Driving Yubin Wang, Zengqi Peng, Yusen Xie, Yulin Li, Hakim Ghazzai, Jun Ma arxiv, 2024. project page / preprint / code
	Chance-Aware Lane Change with High-Level Model Predictive Control through Curriculum Reinforcement Learning Yubin Wang, Yulin Li, Zengqi Peng, Hakim Ghazzai, Jun Ma IEEE International Conference on Robotics and Automation (ICRA), 2024 pdf
	Curriculum Proximal Policy Optimization with Stage-Decaying Clipping for Self-Driving at Unsignalized Intersections Zengqi Peng, Xiao Zhou, Yubin Wang, Lei Zheng, Ming Liu, Jun Ma IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), 2023. pdf
	A Deep-Learning-Based Observer for State Estimation of Direct Contact Membrane Distillation System Modeled by Differential Algebraic Equations Yubin Wang, Yasmine Marani, Taous Meriem Laleg Kirati IEEE Conference on Control Technology and Applications (CCTA), 2022. pdf

Design and source code from this cool guy.