Homepage - Zihan Wang

Zihan Wang

Ph.D. Student

Board Member

I am a Ph.D. student at McGill University and Mila – Quebec AI Institute, co-advised by Prof. Doina Precup and Prof. Xiao-Wen Chang. My research focuses on continual reinforcement learning, aiming to develop AI systems that can learn and adapt in non-stationary environments. I am also a Board Member of ContinualAI, a nonprofit organization advancing research in continual learning. I co-organize the Mila Tea Talk, a weekly seminar series at Mila.

Previously, I completed my M.Sc. in Electrical and Computer Engineering at McGill and Mila, under the supervision of Prof. Narges Armanfard and Prof. Samira Ebrahimi Kahou, where I worked on anomaly detection. I also hold a B.Sc. in Electrical and Computer Engineering, with a Minor in Mathematics, from the University of Alberta.

Curriculum Vitae

zihan.wang(at)mila.quebec Google Scholar GitHub LinkedIn

Education

McGill University && Mila-Quebec AI Institute

School of Computer Science
Ph.D. Student
Supervisors: Doina Precup, Xiao-Wen Chang

Sep. 2025 - present
McGill University && Mila-Quebec AI Institute

M.Sc. in Electrical and Computer Engineering
Supervisors: Narges Armanfard, Samira Ebrahimi Kahou

Sep. 2023 - Aug. 2025
University of Alberta

B.Sc. in Electrical and Computer Engineering
Minor in Mathematics

Sep. 2018 - Jun. 2023

Experience

Mila – Quebec AI Institute / Linarite AI

Scientist in Residence

Sep 2025 – Apr 2026
ContinualAI

Board Member

May 2024 - Present
Huawei Noah's Ark Lab, Montreal, Canada

Researcher Intern

Jul. 2024 – Sep. 2024

Honors & Awards

Flight PS752 Commemorative Scholarship

2025–2026
FRQNT Doctoral Research Scholarship

2025–2029
GREAT Scholarship

2025
FRQNT Master’s Research Scholarship

2024–2025
McGill University Graduate Excellence Fellowship

2023–2026
University of Alberta Academic Scholarship (×3)

2019–2023
University of Alberta Dean’s Research Award (×2)

2021, 2022
Graduated with Distinction, University of Alberta

2023

News

2026

Abstract accepted as a poster at MIPCC 2026 — 'Finding the Words: Exploring AI-Supported Simulation Practice to Address Language Inequities in Pediatric Serious Illness Communication'. See you in MTL in October!

Oct 07

Our paper 'Unsupervised Continual Clustering via Forward-Backward Knowledge Distillation' has been accepted to ECML PKDD 2026!

Sep 06

Organizing the Finding the Frame Workshop at RLC 2026 in Montréal. See you in MTL!

Aug 15

Our paper 'GitChameleon 2.0: Evaluating AI Code Generation Against Python Library Version Incompatibilities' has been accepted to the ACL 2026 main conference!

Jul 01

Our paper 'Perceived Regret: Evaluating Agents in Any World' has been accepted to the Continual RL Workshop @ RLC 2026!

Jun 16

Submitted 'SIC-Agents: Benchmarking and Building Adaptive Simulator for Pediatric Serious Illness Communication Training' to EMNLP 2026.

May 25

Abstract accepted at EAPC 2026 — 'From Prototype to Practice: Iterative Design and Calibration of an AI Platform for Pediatric Serious-Illness Communication'.

May 03

Joining as a co-organizer of the Mila Tea Talk, a weekly seminar series at Mila.

Jan 01

2025

Awarded the GREAT Scholarship!

Oct 01

Our paper 'Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations' has been accepted to TMLR!

Sep 23

Awarded Flight PS752 Commemorative Scholarship

Sep 15

Started as a Ph.D. student in the School of Computer Science at McGill University and Mila in Fall 2025.

Sep 01

Our paper 'Zero-Shot Anomaly Detection with Dual-Branch Prompt Learning' selected as Oral at BMVC 2025! See you in Sheffield!

Jul 25

Submitted my M.Sc. thesis—ending a wonderful two years at iSMART Lab.

Jun 30

Awarded FRQNT Doctoral Research Scholarship

Apr 30

2024

Serving as a reviewer for TMLR.

Oct 01

Joined ContinualAI as Board Member

May 02

Awarded FRQNT Master's Research Scholarship

Apr 30

2023

Started M.Sc. thesis program in Electrical and Computer Engineering at McGill University and Mila in Fall 2023.

Sep 01

Graduated with a B.Sc. from the University of Alberta after five wonderful years. Capstone project: Guess a Sketch

Apr 30

Selected Publications (view all )

Perceived Regret: Evaluating Agents in Any World

Wesley Chung*, Zihan Wang*, Xiao-Wen Chang, David Meger, Doina Precup (* equal contribution)

Continual RL Workshop, Reinforcement Learning Conference (RLC) 2026

Continual reinforcement learning considers an agent that receives and learns from a stream of experience, aiming to maximize its accumulated reward. A fundamental problem is to evaluate such an agent using only its stream of experience, without assuming a particular structure of the world. We define an examiner that observes the same stream and, at every timestep, computes a perceived regret representing the agent's suboptimality gap from the examiner's perspective. We demonstrate empirically that perceived regret is a useful performance measure applicable to arbitrary streams of experience, and prove that no universal examiner can accurately assess all agents in any world, though specific modeling biases enable success in associated environments.

[Paper]

Perceived Regret: Evaluating Agents in Any World

Wesley Chung*, Zihan Wang*, Xiao-Wen Chang, David Meger, Doina Precup (* equal contribution)

Continual RL Workshop, Reinforcement Learning Conference (RLC) 2026

[Paper]

SIC-Agents: Benchmarking and Building Adaptive Simulator for Pediatric Serious Illness Communication Training

Zihan Wang, Anita Slominska, Rennie Bimman, Elizabeth Di Flumeri, Amanda Mayappo-Neeposh, Conall Francoeur, Tamara Ellen Carver, Xiao-Wen Chang, Doina Precup, Esin Darici Haritaoglu, Ismail Haritaoglu, Akshatha Arodi, Naomi Goloff

Under review at EMNLP 2026

Pediatric serious illness communication (SIC) is critically important, yet scalable communication training for clinicians remains limited. Compared with other dialogue simulation settings, pediatric SIC poses additional challenges, including multi-party interactions, response to parental distress, and strong dependence on feedback dynamics. In collaboration with educators and pediatric clinicians, we introduce the first benchmark suite and simulation framework tailored to pediatric SIC training. Our benchmarks, PitfallBench and DialogueBench, evaluate simulators both at the turn level and across full dialogues. We further propose SIC-Agents, a self-improving framework that generates a clinician-editable skill document to guide simulator behavior. Our experiments show that SIC-Agents outperforms static expert prompting. To support future research, we release our benchmarks and a training interface for parent simulation in pediatric SIC.

SIC-Agents: Benchmarking and Building Adaptive Simulator for Pediatric Serious Illness Communication Training

Under review at EMNLP 2026

Zero-shot Anomaly Detection with Dual-Branch Prompt Learning

Zihan Wang, Samira Ebrahimi Kahou, Narges Armanfard

Proceedings of the British Machine Vision Conference (BMVC) 2025 Oral

Zero-shot anomaly detection (ZSAD) aims to identify and localize unseen defects without requiring any labeled anomalies, but existing methods struggle to generalize under domain shifts. We propose PILOT, a framework combining a dual-branch prompt learning mechanism with label-free test-time adaptation, enabling dynamic adaptation to new distributions using only unlabeled data. PILOT achieves state-of-the-art performance on 13 industrial and medical benchmarks for both anomaly detection and localization under domain shift.

[Paper]

Zero-shot Anomaly Detection with Dual-Branch Prompt Learning

Zihan Wang, Samira Ebrahimi Kahou, Narges Armanfard

Proceedings of the British Machine Vision Conference (BMVC) 2025 Oral

[Paper]

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

Zihan Wang, Ruimin Chen, Mengxuan Liu, Guanfang Dong, Anup Basu

International Conference on Smart Multimedia 2022

We propose SPGNet, a method for 3D human pose estimation that integrates multi-dimensional re-projection into supervised learning. Our approach enforces kinematic constraints and jointly optimizes both 2D and 3D pose consistency, leading to improved accuracy. Experiments on the Human3.6M dataset show that SPGNet outperforms many state-of-the-art methods.

[Paper]

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

Zihan Wang, Ruimin Chen, Mengxuan Liu, Guanfang Dong, Anup Basu

International Conference on Smart Multimedia 2022

[Paper]

Warning

Action required

Education

Experience

Honors & Awards

News

Selected Publications (view all )

Perceived Regret: Evaluating Agents in Any World

Perceived Regret: Evaluating Agents in Any World

SIC-Agents: Benchmarking and Building Adaptive Simulator for Pediatric Serious Illness Communication Training

SIC-Agents: Benchmarking and Building Adaptive Simulator for Pediatric Serious Illness Communication Training

Zero-shot Anomaly Detection with Dual-Branch Prompt Learning

Zero-shot Anomaly Detection with Dual-Branch Prompt Learning

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

All publications