About Me

My name is Yu Yang (杨煜). I am currently a Ph.D. candidate in the Department of Control Science and Engineering at Zhejiang University (ZJU), where I have been since 2021, under the supervision of Prof. Yong Liu at the April Lab. From Feb 2025 to Feb 2026, I was a visiting Ph.D. student at National University of Singapore (NUS), working under the guidance of Prof. Gim Hee Lee at the CVRP Lab.

I'm open to research collaborations and currently seeking career opportunities starting in early 2027. Please feel free to reach out!

Research

My research lies at the intersection of Embodied Agents, Generative World Models, and 3D Computer Vision.

Currently, I am dedicated to developing autonomous self-evolving agents that continuously adapt and improve through complex environmental interactions. I also focus on 3D/4D world modeling, exploring controllable scene synthesis to create high-fidelity digital environments. By advancing agentic world modeling, I aim to bridge the gap between agent execution and environment interaction, leveraging world models as internal simulators to empower agents with efficient and adaptive policy optimization.

Furthermore, I leverage a strong background in 3D computer vision, with a focus on holistic scene perception and semantic understanding. I am also deeply interested in extending these 3D vision techniques to end-to-end autonomous driving, aiming to develop foundation models that integrate robust environmental awareness with closed-loop decision-making.

Agent

  • Self-Evolving Agents via Lifelong Continual Learning
  • Policy Optimization leveraging Reinforcement Learning (RL)

World Model

  • Agentic World Modeling for Policy Improvement
  • Controllable 3D/4D Scene Generation for World Simulation

3D Computer Vision

  • 3D Scene Perception and Semantic Understanding
  • Foundation Models for End-to-End Autonomous Driving

News

  • 2026.04:  Our paper WorldLens is accepted by CVPR 2026 (Oral).
  • 2026.02:  We are excited to release 3D and 4D World Modeling: A Survey.
  • 2026.02:  Our paper IR-WM is accepted by ICRA 2026.
  • 2025.10:  Our paper LiDARCrafter is accepted by AAAI 2026 (Oral).
  • 2025.09:  Our paper 𝒳-Scene is accepted by NeurIPS 2025.
  • 2025.02:  I joined the CVRP Lab at NUS as a visiting Ph.D. student.
  • 2024.12:  Our paper Drive-OccWorld is accepted by AAAI 2025 (Oral).
  • 2024.09:  Our paper SGN is accepted by IEEE Trans. on Image Processing.
  • 2023.07:  Our paper CenterLPS is accepted by ACM MM 2023.
  • 2023.06:  Our paper PANet is accepted by IROS 2023.
  • 2023.06:  Our paper SSC-RS is accepted by IROS 2023.

Publications

SPIRAL: Self-Evolving Action-Conditioned Video Generation via Reflective Planning Agents

SPIRAL: Self-Evolving Action-Conditioned Video Generation via Reflective Planning Agents

Yu Yang*, Yue Liao*, Jianbiao Mei*, Baisen Wang*, Xuemeng Yang, Licheng Wen, Jiangning Zhang, Xiangtai Li, Liang Lv, Hanlin Chen, Botian Shi, Yong Liu, Shuicheng Yan, Gim Hee Lee
Preprint, 2026.
Preprint, 2026
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Ao Liang*, Lingdong Kong*, Tianyi Yan*, Hongsi Liu*, Wesley Yang*, Ziqi Huang, Wei Yin, Jialong Zuo, Yixuan Hu, Dekai Zhu, Dongyue Lu, Youquan Liu, Guangfeng Jiang, Linfeng Li, Xiangtai Li, Long Zhou, Lai Xing Ng, Benoit R. Cottereau, Changxin Gao, Liang Pan, Wei Tsang Ooi, Ziwei Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
CVPR 2026 (Oral) VideoWorldModel Workshop @ CVPR 2026
3D and 4D World Modeling: A Survey

3D and 4D World Modeling: A Survey

Lingdong Kong*, Wesley Yang*, Jianbiao Mei*, Youquan Liu*, Ao Liang*, Dekai Zhu*, Dongyue Lu*, Wei Yin*, Xiaotao Hu, Mingkai Jia, Junyuan Deng, Kaiwen Zhang, Yang Wu, Tianyi Yan, Shenyuan Gao, Song Wang, Linfeng Li, Liang Pan, Yong Liu, Jianke Zhu, Wei Tsang Ooi, Steven C. H. Hoi, Ziwei Liu
Preprint, 2026.
Preprint, 2026
Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models

Vision-Centric 4D Occupancy Forecasting and Planning via Implicit Residual World Models

Jianbiao Mei*, Yu Yang*, Xuemeng Yang, Licheng Wen, Jiajun Lv, Botian Shi, Yong Liu
IEEE International Conference on Robotics & Automation (ICRA), 2026.
ICRA 2026
LiDARCrafter: Dynamic 4D world modeling from LiDAR sequences

LiDARCrafter: Dynamic 4D world modeling from LiDAR sequences

Ao Liang, Youquan Liu, Yu Yang, Dongyue Lu, Linfeng Li, Lingdong Kong, Huaici Zhao, Wei Tsang Ooi
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2026.
AAAI 2026 (Oral) Wild3D Workshop @ ICCV 2025
𝒳-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability

𝒳-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability

Yu Yang, Alan Liang, Jianbiao Mei, Yukai Ma, Yong Liu, Gim Hee Lee
Neural Information Processing Systems (NeurIPS), 2025.
NeurIPS 2025
Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

Driving in the Occupancy World: Vision-Centric 4D Occupancy Forecasting and Planning via World Models for Autonomous Driving

Yu Yang*, Jianbiao Mei*, Yukai Ma, Siliang Du, Wenqing Chen, Yijie Qian, Yuxiang Feng, Yong Liu
Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2025.
AAAI 2025 (Oral)
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

Jianbiao Mei, Tao Hu, Xuemeng Yang, Licheng Wen, Yu Yang, Tiantian Wei, Yukai Ma, Min Dou, Botian Shi, Yong Liu
Preprint, 2024.
Preprint, 2024
DQFormer: Toward Unified LiDAR Panoptic Segmentation With Decoupled Queries for Large-Scale Outdoor Scenes

DQFormer: Toward Unified LiDAR Panoptic Segmentation With Decoupled Queries for Large-Scale Outdoor Scenes

Yu Yang*, Jianbiao Mei*, Siliang Du, Yilin Xiao, Huifeng Wu, Xiao Xu, Yong Liu
IEEE Transactions on Geoscience and Remote Sensing (TGRS), 63, 1-15, 2025.
TGRS 2025
Camera-based 3d semantic scene completion with sparse guidance network

Camera-based 3d semantic scene completion with sparse guidance network

Jianbiao Mei, Yu Yang, Mengmeng Wang, Junyu Zhu, Xiangrui Zhao, Jongwon Ra, Laijian Li, Yong Liu
IEEE Transactions on Image Processing (TIP), 33, 5468-5481.
TIP 2024
Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking via Memory Networks

Exploit Spatiotemporal Contextual Information for 3D Single Object Tracking via Memory Networks

Jongwon Ra, MengMeng Wang, Jianbiao Mei, Shanqi Liu, Yu Yang, Yong Liu
International Conference on 3D Vision (3DV), 2024.
3DV 2024
CenterLPS: Segment Instances by Centers for LiDAR Panoptic Segmentation

CenterLPS: Segment Instances by Centers for LiDAR Panoptic Segmentation

Jianbiao Mei*, Yu Yang*, Mengmeng Wang, Zizhang Li, Xiaojun Hou, Jongwon Ra, Laijian Li, Yong Liu
Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023.
ACM MM 2023
PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

PANet: LiDAR Panoptic Segmentation with Sparse Instance Proposal and Aggregation

Jianbiao Mei*, Yu Yang*, Mengmeng Wang, Xiaojun Hou, Laijian Li, Yong Liu
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023.
IROS 2023
SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

SSC-RS: Elevate LiDAR Semantic Scene Completion with Representation Separation and BEV Fusion

Jianbiao Mei, Yu Yang, Mengmeng Wang, Tianxin Huang, Xuemeng Yang, Yong Liu
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023.
IROS 2023
Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos

Exploiting semantic-level affinities with a mask-guided network for temporal action proposal in videos

Yu Yang, Mengmeng Wang, Jianbiao Mei, Yong Liu
Applied Intelligence, 53(12), 15516-15536.
APIN 2023

Experience

National University of Singapore, CVRP Lab, Visiting Ph.D. Student
3D and 4D World Modeling, Scene Generation
Shanghai AI Laboratory, KnowledgeXLab, Research Intern
Self-Evolving Agents, Agentic World Modeling
Huawei Technologies Co., Ltd., Riemann Lab, Research Intern
Driving World Model, End-to-End Autonomous Driving
Meituan, AutoML Group, Research Intern
Large Vision-Language Models

Awards and Honors

  • 2025.10: Academic Scholarship, Zhejiang University
  • 2024.10: China Scholarship Council (CSC) Scholarship
  • 2023.10: Academic and Faculty Scholarship, Zhejiang University
  • 2019.10: National Scholarship, Ministry of Education of China
  • 2019.10: First Prize in the National Undergraduate Electronic Design Competition (Top 2.4%)
  • 2018-2020: Academician Scholarship, Faculty Scholarship

Services

Journal Reviewer

Conference Reviewer