Home

FLAIR Lab · Mila & Université de Montréal

Machine learning for the language of life.

We develop methods across the language modeling pipeline for biological sequences such as proteins, genomes, and transcriptomes, with applications in drug discovery.

Recent news
May 11, 2026 New on the blog: A Guide to Scientific Writing. The advice I wish I had before my first paper, learned from the writers and scientists who showed me what good looks like.
Apr 23, 2026 Proud of Lola Le Breton, who presented NeoBERT: A Next Generation BERT at ICLR 2026 as part of the TMLR journal track. With John X. Morris, Mariam El Mezouar, and Sarath Chandar.
Feb 27, 2026 New preprint: CoPeP: Benchmarking Continual Pretraining for Protein Language Models. A benchmark for studying how protein language models can keep up with new data without retraining from scratch. With Darshan Patil, Pranshu Malviya, Mathieu Reymond, and Sarath Chandar. In collaboration with Genentech.
May 22, 2025 New preprint: Structure-Aligned Protein Language Model. We bring 3D structural information into protein language model via a lightweight post-training. With Can Chen, David Heurtel-Depeiges, Robert M. Vernon, Christopher James Langmead, and Yoshua Bengio. In collaboration with Amgen.

Open positions

We are recruiting one PhD student and one graduate or undergraduate intern. Get in touch.