Mandana Samiei

I am a PhD candidate in Computer Science at Mila - Quebec AI Institute and Reasoning and Learning Lab at McGill University in Montréal, where I am advised by Prof. Doina Precup and Prof. Blake Richards . I am interested in designing AI systems that are able to learn and adapt continually, instead of mastering specific tasks they are able to learn general skills/features which can be reused given a new task/dataset/environment. I study the underlying models of such adaptive and robust generalizations.

Just as humans can continuously adapt, can AI agents achieve similar abilities?

My research centers on exploring how AI agents can efficiently learn abstractions of intricate sensory observations of the world, enabling them to reason, plan, and make decisions. Recently, I find myself more and more interested in understanding the role of structure in designing adaptable agents, which can recieve observations/images and output actions/policy or human readable texual content.

Email CV GitHub Scholar BlueSky Twitter LinkedIn

And, finally I would like to share a quote from Jean Piaget whose research has inspired me: "Scientific knowledge is in perpetual evolution; it finds itself changed from one day to the next".

Highlights & News

May 2025 Our work Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? is accepted to the conference on Language Modeling (COLM) 2025

May 2025 We are organizing an Early-Career Spotlight Program at CoLLAs 2025 this is a joint effort with Martin Mundt, Amal Rannen-Triki, Razvan Pascanu, Sarath Chandar.

Mar 2025 I am going to give a tutorial on Large Language Models at the Mediterranean Machine Learning Summer School (M2L) at Split, Croatia, 8-12 September 2025.

Feb 2025 My abstract submission on "The Role of Schemas in Reinforcement Learning: Insights and Implications for Generalization" is accepted at RLDM 2025, I will be in Dublin Jun 10-13.

Jan 2025 Serving as a Diversity and Inclusion (D&I) Chair for Fourth Conference on Lifelong Learning Agents - CoLLAs 2025.

Dec 2024 Our work on Testing causal hypotheses through Hierarchical Reinforcement Learning to be presented as a poster at NeurIPS 2024 Workshop: Intrinsically Motivated Open-ended Learning (IMOL).

Oct 2024 Possible principles for aligned structure learning agents is on Arxiv.

May 2024 Serving as an senior organizer for Women in Machine Learning Workshop at NeurIPS 2024.

Apr 2024 Serving as a teaching assistant at Eastern European Machine Learning Summer School 2024 located in Serbia.

Feb 2024 Recognized as an senior reviewer at First Edition of Reinforcement Learning Conference - RLC 2024.

Jan 2024 Serving as an senior organizer for Women in Machine Learning Symposium at ICML 2024.

Sep 2023 Serving as an organizer for Machine Learning Reproducibility Challenge - MLRC 2023.

Aug 2023 Have been awarded the Fonds de Recherche du Québec Nature et technologies (FRQNT) for my proposal on Efficient Reinforcement Learning using Episodic Memory.

Apr 2023 Serving as a lead organizer for Women in Machine Learning Un-workshop at ICML 2023.

Mar 2023 Have been selected to participate at the Bellairs Invitational Workshop on Time, Input, and Action Abstraction in Reinforcement Learning. 🏝

Feb 2023 Have been selected as a board of directors for the Women in Machine Learning.

Nov 2022 Serving as a local chair at 2nd Conference on Lifelong Learning Agents - CoLLAs 2023

Mar 2022 Serving as an organizer at 1st Conference on Lifelong Learning Agents - CoLLAs 2022

Feb 2022 Have been awarded the Women in AI Excellence Doctoral Scholarship at Mila - Quebec AI Institute.

Jan 2022 Serving as a teaching assistant for the COMP 767: Reinforcement Learning course at McGill University.

Sept 2021 Invited talk on Towards Efficient Generalization in Continual RL using Episodic Memory at Microsoft Research Summit 2021.

Jul 2021 Serving as a content creator for the Neuromatch Academy 2021 for RL for Games tutorial.

Mar 2021 Started working on Memory for Reinforcement Learning as part of the Microsoft Research Grant, in collaboration with Mehdi Fatemi, Ida Momennejad, and John Langford.

Jan 2021 Have been awarded the Unifying Neuroscience and Artificial Intelligence (UNIQUE) PhD Excellence Scholarship Microsoft Research Summit 2021.

Nov 2020 Co-organizing a ICLR 2021 workshop on Rethinking ML Papers.

Nov 2020 Our work on Mimicking mammalian navigation in Watermaze using brain-inspired representations, has been selected for poster presentation in the Workshop on biological and artificial reinforcement learning to be held during NeurIPS 2020.

Apr 2020 Have been selected as an organizer for the Women in Machine Learning Un-workshop at ICML 2020.

Jan 2020 Yay! Started my PhD at McGill University and Mila Institute.

Dec 2019 Successfully defended my Masters dissertation.

Research

The Role of Schemas in Reinforcement Learning: Insights and Implications for Generalization

In cognitive psychology, schemas are considered to be “building blocks” of cognition, shaping how people view the world and interact with it. The goal of this paper is to propose a method for learning schemas in RL. We argue that, by representing tasks through schemas, agents can more effectively generalize from past experiences and adapt to new, unseen environments with minimal data.

with Doina Precup and Blake Richards

Beyond Multitask learning in Continual Learning

Continual Learning solutions often treat multitask learning as an upper-bound of what the learning process can achieve. This is a natural assumption, given that this objective directly addresses the catastrophic forgetting problem, which has been a central focus in early works. However, depending on the nature of the distributional shift in the data, the multi-task solution is not always optimal for the broader continual learning problem. We draw on principles from online learning to formalize the limitations of multitask objectives.

with Giulia Lanzillotta, Claire Vernade, and Razvan Pascanu

Testing Causal Hypotheses Through Hierarchical Reinforcement Learning

we propose hierarchical reinforcement learning (HRL) as a key ingredient to building agents that can systematically generate and test hypotheses that enables transferrable learning of the world, and discuss potential implementation strategies.

Presented at Intrinsically Motivated Open-ended Learning - IMOL Workshop at NeurIPS 2024 as a poster.

with Dongyan Lin and Anthony Chen

Towards Efficient Generalization in Continual RL using Episodic Memory

As part of a collaboration between Microsoft Research Research and Mila, work done with Blake Richards, Guillaume Lajoie, Mehdi Fatemi, Ida Momennejad, Geoffrey J. Gordon, and John Langford. Gave an invited talk at Microsoft Research Summit 2021. Slides.

Possible Principles for Aligned Structure Learning Agents

This paper offers a roadmap for the development of scalable aligned artificial intelligence (AI) from first principle descriptions of natural intelligence. Read More.

with Lancelot Da Costa and Cristian Dragos-Manta

The Cancer Genome Atlas Program (TCGA) Clinical Benchmark

This paper provide a benchmark to study cancer and gene expression relations. Tasks are a combination of clinical features and cancer study. An example of a task would be predicting gender, age, alcohol document history, family history of stomach cancer, the code of the disease and other clinical attributes for different types of patients based on their gene expressions. Read More. Github.

with Joseph Paul Cohen and Thomas Fevens

Torchmeta: A meta-learning library for pytorch

We introduce Torchmeta, a library built on top of PyTorch that enables seamless and consistent evaluation of meta-learning algorithms on multiple datasets, by providing data-loaders for most of the standard benchmarks in few-shot classification and regression, with a new meta-dataset abstraction. Read More. Github.

with Tristan Deleu and Yoshua Bengio

Organizing Committee

D&I chair at Fourth Conference on Lifelong Learning Agents - CoLLAs 2025

Local chair at Second Conference on Lifelong Learning Agents - CoLLAs 2023

Organizer at First Conference on Lifelong Learning Agents - CoLLAs 2022

Senior organizer with Eda Okur - 19th Women in Machine Learning Workshop NeurIPS 2024

Senior organizer with Caroline Weis - Women in Machine Learning Symposium ICML 2024

Senior organizer - Women in Machine Learning Un-workshop ICML 2023

Organizer - Women in Machine Learning Un-workshop ICML 2020

Organizer at Machine Learning Reproducibility Challenge - MLRC 2023

Teaching

EEML PyTorch and Colab intro, Summer 2024

Taught by Nemanja Rakićević and Matko Bošnjak

Teaching Assistant: COMP 767 Reinforcement Learning, Winter 2021

Taught by Prof. Doina Precup

Teaching Assistant: COMP 417 Intro to Robotics & Intelligent Systems, Fall 2020

Taught by Prof. Dave Meger

Teaching Assistant: IFT 6390 Fundamentals of Machine Learning , Fall 2021

Taught by Prof. Ioannis Mitliagkas and Prof. Guillaume Rabusseau

Reviewing

Conference on Neural Information Processing Systems - NeurIPS 2025
Reinforcement Learning Conference - RLC 2024 & 2025
Conference on Lifelong Learning Agents - CoLLAs 2022 & 2024 & 2025
The International Conference on Learning Representations - ICLR 2023 & 2024
Transactions on Machine Learning Research - TMLR 2023 & 2024 & 2025
Conference on Neural Information Processing Systems - NeurIPS 2020 & 2023
Generative Models for Decision Making workshop at ICLR 2024
Decision Awareness in Reinforcement Learning (DARL) at ICML 2022

Conference Photo Gallery

A selection of photos from various conferences I have attended, including Women in ML workshop at NeurIPS 2024 in Vancouver, EEML 2024 Summer School in Novi Sad, Serbia , and CoLLAs 2024 in Pisa, Italy.

Credit to Jon Barron for the template.
Last Update on Jan 2025.