saurav jha | सौरभ झा

I am an ivado postdoctoral researcher at MILA, working with Sarath Chandar. My research focuses on building adaptive and reliable generative AI systems with a recurring theme of understanding what internal representations make models useful beyond static prediction. Few of my recent works include:

World models for reasoning and control: Latent-space design (Robotics world models), Test-time scaling (WMW)
Model compression: MoE-LLMs (work with Samsung), stable diffusion (iclr'25 work with Sony)
Adaptation under uncertainty: clip model (neurips'23), neural processes (neurips'24)

I completed my PhD at UNSW Sydney in August 2025, where I was advised by Lina Yao and Dong Gong. During the latter half of my PhD, I worked as an applied research scientist at openstream.ai, and as a research intern at Sony (hosted by Shiqi Yang and Shusuke Takahashi ) and Tencent (hosted by Shengju Qian).

Prior to my PhD, I worked on continual learning with Joost van de Weijer, and did my Erasmus Mundus Joint Master's Degree (EMJMD) in Advanced Systems Dependability at the University of St Andrews, the UK and l'Université de Lorraine, France. During my master's, I interned in Emmanuel Vincent's group at Inria Nancy.

At Mila, besides research, I also serve in the organizing committee of:

The CoLLAS monthly seminar series, that aims to bring together researchers from different domains of lifelong learning.
The Mila AI Colloquium, Mila's flagship seminar series that aims to host globally renowned guest researchers in the institute.

In my free time, I enjoy learning to travel, snorkelling, hiking, and binge-watching. I grew up in eastern Nepal, and every couple of years, I like to plan week-long treks in the Nepalese Himalayas.

news

May 2026: New work on evaluating robotics world models: preprint, webpage.
Apr 2026: New work with Samsung on model merging: preprint + models, blog, and subreddit.
Apr 2026: We are organizing the Workshop on Efficient Visual Generation at ECCV 2026!
Dec 2025: I will be serving in the steering committee of a **new** mila talk series led by Hugo Larochelle.
Oct 2025: Quoted in this MIT news story for perspectives on personalized object localization.
Aug 2025: My PhD thesis was recommended for the Dean's Award for outstanding PhD theses at UNSW Sydney.
May 2025: Mentoring session + Talk on continual learning in Bishesh Khanal's group, NAAMII, Nepal.
Jul 2024: Presented my paper "CLAP4CLIP" at the 2nd Bayes duality workshop, RIKEN AIP, Tokyo.
Apr 2024: Won a travel grant for presenting my paper "NPCL" at the EEML summer school, Novi Sad, Serbia.

talks & seminars

Mar 2026: "Introspective updates for Continual learning", Visual Computing Seminar, MIT CSAIL.
Apr 2025: "Uncertainty-aware Continual learning", Marinka Zitnik's group, Harvard University [Remote].
Jun 2025: Presented my paper "Mining Your Own Secrets" at the Sydney AI meetup .

Experience

IVADO postdoctoral fellow (Sep 2025 - Present)

MILA • Montréal, Canada 🇨🇦

Research aligned with advancing Canada's R3AI initiative.

Applied AI Scientist (May 2025 - Aug 2025)

OpenStream.ai • Melbourne, Australia 🇦🇺

Infra-focus: Developed production-grade conversational LLM agents for enterprise clients.

ML-focus: Implemented & shipped a POC for neuro-symbolic verification of multi-agent systems.

AI Research Intern (Sep 2024 - Mar 2025)

LightSpeed Studios, Tencent • Sydney, Australia 🇦🇺

Worked on controllable image generation and preference optimization for multi-modal LLMs.

Research Scientist Intern (May 2024 - Aug 2024)

Creative AI Lab, Sony Group Corporation • Tokyo, Japan 🇯🇵

Worked on continual personalization of pre-trained text-to-image diffusion models.

Research Assistant (Sep 2021 - Jan 2022)

Computer Vision Centre, Universitat Autònoma de Barcelona • Barcelona, Spain 🇪🇸

Worked on rehearsal-free continual learning for Vision Transformers (ViTs).

Research Intern (Mar 2021 - Jul 2021)

Multispeech group, Inria Nancy • Nancy, France 🇫🇷

Worked on learning domain-specific language models for speech recognition.

Machine Learning Engineer (Jun 2018 - Jul 2019)

FactSet Research Systems Inc. • Hyderabad, India 🇮🇳

Worked on improving FactSet's named entity recognition service with acronym disambiguation and neural topic modeling.

Awards & Recognition

IVADO Postdoctoral Fellowship: Among the 11 recipients of the 2025 cohort.
CSE writing fellowship: From UNSW Sydney, in recognition of a strong PhD publication record (2025).
Tiktok-sponsored Best Student Presentation awardee at the Sydney AI meetup (2024).
Travel grant awardee for Eastern European Machine Learning (EEML) summer school, Serbia (2024).
Best runner-up paper awardee at the CVPR 2022 workshop on Continual Learning.
University International Postgraduate Award (UIPA) recipient for PhD studies at UNSW Sydney (2022).
Best master’s thesis award for Erasmus+ DEPEND 2019-21 cohort.
Best students’ poster at Digital Ethics4EU 2021 workshop, TU Dublin.
Winner of Barclays chatbot challenge at Hack the Burgh 2020, the University of Edinburgh.
Erasmus Mundus scholarship for joint Master’s degree studies in the UK and France (2019).

Selected Works

Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models

Nilaksh^*, Saurav Jha^*, Artem Zholus^*, Sarath Chandar

Preprint arXiv 2026

Paper Code Project page (demos)

We train and evaluate several robotic world model variants, contrasting reconstruction-driven and semantics-driven latent spaces across multi-view inputs, policy rollouts, and out-of-distribution distractor settings.

REAM: Merging Improves Pruning of Experts in LLMs

Saurav Jha^*, Maryam Hashemzadeh, Ali Saheb Pasand, Ali Parviz, Min-Joong Lee, Boris Knyazev^*

Work done with Samsung SAIT AI Lab, Montréal

Preprint arXiv 2026

Paper Code HF Models

REAM shows that pseudo-pruning, rather than merging or pruning, better preserves performance when compressing Mixture-of-Experts LLMs across multiple benchmarks.

Probing the Effectiveness of World Models for Spatial Reasoning through Test-Time Scaling

Saurav Jha, M. Jehanzeb Mirza, Wei Lin, Shiqi Yang, Sarath Chandar

World Modeling Workshop 2026

Paper Code

We propose Verification through Spatial Assertions (ViSA), a proposer-solver method that enables faithful test-time verification of world model views for enhancing the spatial reasoning in existing VLMs.

Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models

Saurav Jha, Shiqi Yang, Masaki Ishii, Meng Zhao, Christian Simon, Jehanzeb Mirza, Dong Gong, Lina Yao, Shusuke Takahashi, Yuki Mitsufuji

Work done with Sony AI

ICLR 2025

Paper Project page

We propose using diffusion classifier scores for regularizing the parameter-space and function-space of text-to-image diffusion models, to achieve continual personalization.

CLAP4CLIP: Continual LeArning with Probabilistic finetuning for Vision-Language Models

Saurav Jha, Dong Gong, Lina Yao

NeurIPS 2024

Paper Code

Our work proposes Continual LeArning with Probabilistic finetuning (CLAP) - a probabilistic modeling frame- work over visual-guided text features per task, thus providing more calibrated CL finetuning.

NPCL: Neural Processes for Uncertainty-Aware Continual Learning

Saurav Jha, Dong Gong, He Zhao, Lina Yao

NeurIPS 2023

Paper Code

We propose a neural process-based continual learning approach with task-specific modules arranged in a hierarchical latent variable model. We tailor regularizers on the learned latent distributions to alleviate forgetting.

Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization

Francesco Pelosin^*, Saurav Jha^*, Andrea Torsello, Bogdan Raducanu, Joost van de Weijer

CVPR 2022 Workshop on Continual Learning (CLVision)

Paper Code

We investigate the continual learning of Vision Transformers (ViT) for the challenging exemplar-free scenario, with special focus on how to efficiently distill the knowledge of its crucial self-attention mechanism.

news

talks & seminars

Experience

IVADO postdoctoral fellow (Sep 2025 - Present)

MILA • Montréal, Canada 🇨🇦

Applied AI Scientist (May 2025 - Aug 2025)

OpenStream.ai • Melbourne, Australia 🇦🇺

AI Research Intern (Sep 2024 - Mar 2025)

LightSpeed Studios, Tencent • Sydney, Australia 🇦🇺

Research Scientist Intern (May 2024 - Aug 2024)

Creative AI Lab, Sony Group Corporation • Tokyo, Japan 🇯🇵

Research Assistant (Sep 2021 - Jan 2022)

Computer Vision Centre, Universitat Autònoma de Barcelona • Barcelona, Spain 🇪🇸

Research Intern (Mar 2021 - Jul 2021)

Multispeech group, Inria Nancy • Nancy, France 🇫🇷

Machine Learning Engineer (Jun 2018 - Jul 2019)

FactSet Research Systems Inc. • Hyderabad, India 🇮🇳

Awards & Recognition

Academic Services

tutoring at UNSW

Selected Works

Reconstruction or Semantics? What Makes a Latent Space Useful for Robotic World Models

REAM: Merging Improves Pruning of Experts in LLMs

Probing the Effectiveness of World Models for Spatial Reasoning through Test-Time Scaling

Mining Your Own Secrets: Diffusion Classifier Scores for Continual Personalization of Text-to-Image Diffusion Models

CLAP4CLIP: Continual LeArning with Probabilistic finetuning for Vision-Language Models

NPCL: Neural Processes for Uncertainty-Aware Continual Learning

Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization