Research Papers
Organized by date. I bolded myself as an author for clarity. Google Scholar might be more up-to-date if you are looking for very recent work.
BioBench: A Blueprint to Move Beyond ImageNet for Scientific
ML Benchmarks
Samuel Stevens (Third
Workshop for Imageomics at NeurIPS 2025) [paper] [website] [code]
BioCLIP 2: Emergent Properties from Scaling Hierarchical
Contrastive Learning
Jianyang Gu, Samuel
Stevens, Elizabeth G Campolongo, Matthew J Thompson, Net Zhang,
Jiaman Wu, Andrei Kopanev, Zheda Mai, Alexander E. White, James Balhoff,
Wasila Dahdul, Daniel Rubenstein, Hilmar Lapp, Tanya Berger-Wolf,
Wei-Lun Chao, Yu Su (NeurIPS 2025, Spotlight)
[paper] [website] [demo]
Optimizing image capture for computer vision-powered
taxonomic identification and trait recognition of biodiversity
specimens
Alyson East, Elizabeth G. Campolongo, Luke
Meyers, S. M. Rayeed, Samuel Stevens, Iuliia
Zarubiieva, Isadora E. Fluck, Jennifer C. Giron, Maximiliane Jousse,
Scott Lowe, Kayla I. Perry, Isabelle Betancourt, Noah Charney, Evan
Donoso, Nathan Fox, Kim J. Landsbergen, Ekaterina Nepovinnykh, Michelle
Ramirez, Parkash Singh, Khum Thapa-Magar, Matthew Thompson, Evan Waite,
Tanya Berger-Wolf, Hilmar Lapp, Paula Mabee, Charles Stewart, Graham
Taylor, Sydne Record (Methods in Ecology and Evolution) [paper]
Mind the (Data) Gap: Evaluating Vision Systems in Small Data
Applications
Samuel Stevens, S M Rayeed,
Jenna Kline (Third Workshop for Imageomics at NeurIPS 2025) [paper]
Interpretable and Testable Vision Features via Sparse
Autoencoders
Samuel Stevens, Wei-Lun
Chao, Tanya Berger-Wolf, Yu Su (arXiv Preprint) [paper] [website] [demos] [models]
The Cool and the Cruel: Separating Hard Parts of LWE
Secrets
Niklas Nolte, Mohamed Malhou, Emily Wenger,
Samuel Stevens, Cathy Li, François Charton, Kristin
Lauter International Conference on Cryptology in Africa
(AFRICACRYPT) [paper] [code]
Salsa Fresca: Angular Embeddings and Pre-Training for ML
Attacks on Learning with Errors
Samuel
Stevens, Emily Wenger, Cathy Yuanchen Li, Niklas Nolte, Eshika
Saxena, Francois Charton, and Kristin Lauter (TMLR 2025) [paper] [OpenReview]
BioCLIP: A Vision Foundation Model for the Tree of
Life
Samuel Stevens*, Jiaman Wu*,
Matthew J Thompson, Elizabeth G Campolongo, Chan Hee Song, David Edward
Carlyn, Li Dong, Wasila M Dahdul, Charles Stewart, Tanya Berger-Wolf,
Wei-Lun Chao, Yu Su (* equal contribution) (CVPR 2024, Best Student
Paper) [paper] [website] [demo]
MMMU: A Massive Multi-Discipline Multimodal Understanding and
Reasoning Benchmark for Expert AGI
Xiang Yue, Yuansheng
Ni, Kai Zhang, Tianyu Zheng, Ruoqi Liu, Ge Zhang, Samuel
Stevens, Dongfu Jiang, Weiming Ren, Yuxuan Sun, Cong Wei, Botao
Yu, Ruibin Yuan, Renliang Sun, Ming Yin, Boyuan Zheng, Zhenzhu Yang,
Yibo Liu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen (CVPR 2024,
Best Paper
Finalist) [paper] [website]
A Simple Interpretable Transformer for Fine-Grained Image
Classification and Analysis
Dipanjyoti Paul, Arpita
Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel
Stevens, Kaiya Provost, Anuj Karpatne, Bryan Carstens, Daniel
Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao
(ICLR 2024) [paper] [code]
A Framework for Autonomic Computing for In Situ
Imageomics
Jenna Kline, Christopher Stewart, Tanya
Berger-Wolf, Michelle Ramirez, Samuel Stevens, Reshma
Ramesh Babu, Namrata Banerji, Alec Sheets, Sowbaranika Balasubramaniam,
Elizabeth Campolongo, Matthew Thompson, Charles V Stewart, Maksim
Kholiavchenko, Daniel I Rubenstein, Nina Van Tiel, Jackson Miliko
(IEEE International Conference on Autonomic Computing and
Self-Organizing Systems) [paper]
Roll Up Your Sleeves: Working with a Collaborative and
Engaging Task-Oriented Dialogue System
Lingbo Mo, Shijie
Chen, Ziru Chen, Xiang Deng, Ashley Lewis, Sunit Singh, Samuel
Stevens, Chang-You Tai, Zhen Wang, Xiang Yue, Tianshu Zhang, Yu
Su, Huan Sun (24th Meeting of the Special Interest Group on
Discourse and Dialogue, SIGDIAL 2023) [paper]
Mind2Web: Towards a Generalist Agent for the Web
Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel
Stevens, Boshi Wang, Huan Sun, Yu Su (NeurIPS 2023, Spotlight)
[paper] [website]
Memorization for Good: Encryption with Autoregressive
Language Models
Samuel Stevens, Yu Su.
(Arxiv Preprint) [paper] [website]
arXivEdits: Understanding the human revision process in
scientific writing
Chao Jiang, Wu Xu, Samuel
Stevens (EMNLP 2022) [paper] [code+data]
Bootstrapping a User-Centered Task-Oriented Dialogue
System
Shijie Chen, Ziru Chen, Xiang Deng, Ash Lewis,
Lingbo Mo, Samuel Stevens, Zhen Wang, Xiang Yue,
Tianshu Zhang, Yu Su, Huan Sun. (Alexa Prize TaskBot Challenge
Proceedings 2022) [paper] [website]
An Investigation of Language Model Interpretability via
Sentence Editing
Samuel Stevens, Yu Su.
(EMNLP BlackboxNLP 2021.) [paper] [code]
SalsaBot: Towards a Robust and Generalizable Embodied
Agent
Chan Hee Song, Jiaman Wu, Ju-Seung Byeon, Zexin Xu,
Vardaan Pahuja, Goonmeet Bajaj, Samuel Stevens, Ziru
Chen, Yu Su (short paper at Embodied AI Workshop at CVPR 2023, long
paper in Alexa Prize SimBot Challenge Proceedings 2023) [short paper] [long
paper]
Sam Stevens, 2024