Zhihao Lin

Ph.D. in Autonomous Systems & Connectivity, University of Glasgow.

prof_pic.jpg

Zhihao Lin, Ph.D.

University of Glasgow

Glasgow, U.K.

I received my Ph.D. in Autonomous Systems & Connectivity from the University of Glasgow in June 2026, supervised by Dr. Jianglin Lan. πŸ“¬ Open for postdoctoral positions from Sept. 2026.

My research centres on representation learning for reinforcement learning, asking: what should an RL agent learn to see, so that it can act well?

This question grew out of my early work on autonomous driving, where I kept running into the same quiet puzzle: better perception did not automatically lead to better decisions. The gap between seeing and acting never felt like something more data or bigger models would simply close β€” it seemed to point at something more fundamental about how an agent’s understanding of the world becomes the way it chooses to act. That gap is what I keep returning to.

My Ph.D. work approaches it from three angles:

  • Geometric policy optimisation (GAC, ICLR 2026): treating bounded action spaces as a geometric constraint rather than an afterthought β€” replacing Gaussian policies and their ad-hoc squashing with an efficient spherical formulation that decomposes each action into a direction vector and a learnable concentration parameter.
  • Action Manifold Smoothing (AMS, ICML 2026): stabilising high-dimensional continuous control by replacing point-wise temporal-difference targets with orthogonally-sampled neighbourhood averages, taming the multiplicative Lipschitz-pathway error amplification that makes algorithms like TD3 and SAC collapse.
  • World-model-guided representation learning (NeurIPS 2026, under review): using a world model not as a simulator but as a structured supervision tool, shaping an encoder whose representations are simultaneously predictive and value-aware.

On the side, I have a deep personal interest in theoretical physics, particularly the thermodynamic and information-theoretic foundations of gravity and cosmology.

I am always happy to chat about RL, world models, embodied intelligence, or the physics of spacetime. Feel free to reach out.

news

Jun 24, 2026 Three papers accepted πŸŽ‰ β€” Dual-Mode SPL-SLAM (co-first author) in IEEE Transactions on Intelligent Transportation Systems, and two first-author papers in IEEE Transactions on Vehicular Technology: Hierarchical Multi-Agent MCTS for Safety-Critical Coordination in Mixed-Autonomy Roundabouts and A Two-Stage Spatiotemporal Trajectory Optimization Framework for Autonomous Lane Changing With Dynamic Risk Fields.
May 15, 2026 My sole-authored paper Action Manifold Smoothing: A Lipschitz Pathway Perspective on High-Dimensional Reinforcement Learning has been accepted to ICML 2026 (CORE A*). πŸŽ‰
Dec 15, 2025 My sole-authored paper Beyond Distributions: Geometric Action Control for Continuous Reinforcement Learning has been accepted to ICLR 2026 (CORE A*). πŸŽ‰
Dec 10, 2025 Our paper Scalable and Safe Multi-Agent Coordination with Reconstructed Level-k Monte Carlo Tree Search has been accepted to AAMAS 2026 (CORE A*). πŸŽ‰
Apr 01, 2025 πŸ† Awarded a Β£1,800 Research Mobility Fund from the University of Glasgow College of Science and Engineering for international research collaboration.

selected publications

  1. ICML
    icml2026_ams.png
    Action Manifold Smoothing: A Lipschitz Pathway Perspective on High-Dimensional Reinforcement Learning
    Zhihao Lin
    In International Conference on Machine Learning (ICML), 2026
    CORE A*
  2. ICLR
    iclr2026_gac.png
    Beyond Distributions: Geometric Action Control for Continuous Reinforcement Learning
    Zhihao Lin
    In International Conference on Learning Representations (ICLR), 2026
    CORE A*
  3. AAMAS
    aamas2026.gif
    Scalable and Safe Multi-Agent Coordination with Reconstructed Level-k Monte Carlo Tree Search
    Zhihao Lin, Lin Wu, Zhen Tian, and 2 more authors
    In International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), 2026
    CORE A*
  4. IEEE TVT
    tvt2026_twostage.png
    A Two-Stage Spatiotemporal Trajectory Optimization Framework for Autonomous Lane Changing With Dynamic Risk Fields
    Zhihao Lin, Zhen Tian, Xianxian Zhao, and 3 more authors
    IEEE Transactions on Vehicular Technology, 2026
  5. IEEE TITS
    tits_contingency.png
    Contingency-Aware Spatiotemporal Optimization for Safe Autonomous Vehicle Trajectory Planning
    Zhihao Lin, Jianglin Lan, Anh-Tu Nguyen, and 1 more author
    IEEE Transactions on Intelligent Transportation Systems, 2025
  6. Pattern Recognit.
    slam2_fig.png
    SLAM2: Simultaneous Localization and Multimode Mapping for Indoor Dynamic Environments
    Zhihao Lin, Qi Zhang, Zhen Tian, and 4 more authors
    Pattern Recognition, 2025