avatar

Zhaoyang Wang

Ph.D. Student in CS
University of North Carolina
at Chapel Hill

[email protected] where X=first name

About Me

My name is Zhaoyang Wang (王朝阳 in Chinese). I am a 2nd year Ph.D. student in the Department of Computer Science at the University of North Carolina at Chapel Hill, advised by Prof. Huaxiu Yao. My research interests mainly lie in the reasoning and alignment of Large Language Models (LLMs). And I'm currently working on agents, welcome to reach out if you have interests to collaborate. In my spare time, I'm always excited to learn more about MLSys and Computer System.

Previously, I worked on robustness and text generation issues of Natural Language Processing (NLP). I received my master's degree in Computer Science from Sun Yat-sen University in 2024, and my bachelor's degree in Computer Science from North China Electric Power University in 2021. I also spent wonderful time interning at Microsoft (2023, 2025) and WeChat AI (2023).

Publications [ Google Scholar ] [ Full Publications ]

    Agents

  1. WebHarbor: Docking Real Websites for Evolving GUI Agent Environments teaser Blog
    Zhaoyang Wang, Qianhui Wu, Shi Qiu, WebHarbor Team, and Contributors
    Blog 2026.

  2. Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning teaser ICML 2026
    Zhaoyang Wang, Canwen Xu, Boyi Liu, Yite Wang, Siwei Han, Zhewei Yao, Huaxiu Yao, Yuxiong He
    Proceedings of the 43rd International Conference on Machine Learning.

  3. WebXSkill: Skill Learning for Autonomous Web Agents teaser Preprint
    Zhaoyang Wang, Qianhui Wu, Xuchao Zhang, Chaoyun Zhang, Wenlin Yao, Fazle Elahi Faisal, Baolin Peng, Si Qin, Suman Nath, and et al.
    Arxiv Preprint 2026.

  4. SynthAgent: Adapting Web Agents with Synthetic Supervision teaser ACL 2026
    Zhaoyang Wang, Yiming Liang, Xuchao Zhang, Qianhui Wu, Siwei Han, Anson Bastos, Ruijia Wang, Chetan Bansal, Baolin Peng, et al.
    Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics.

  5. GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL teaser Preprint
    Rui Yang, Qianhui Wu, Zhaoyang Wang, Hanyang Chen, Ke Yang, Hao Cheng, Huaxiu Yao, Baoling Peng, Huan Zhang, Jianfeng Gao, Tong Zhang
    Arxiv Preprint 2026.

  6. Reasoning

    Alignment

Experiences

  • Microsoft, Redmond, May 2025 - Aug 2025
    Research Intern, working on Web Agents under the supervision of Xuchao Zhang and Qianhui Wu.
  • Tencent/WeChat AI, Beijing, June 2023 - Sept 2023
    Research Intern, working on the RLHF stage of WeLM under the supervision of Liwen Zhu and Xiao Zhou.
  • Microsoft, Beijing, Jan 2023 - June 2023
    Research Intern, working on LLM Reasoning under the supervision of Shaohan Huang, Minghui Song and Zihan Zhang.



Last updated on May 19, 2026
Website credits to Minimal Light template