news

Jan 22, 2025 Excited to share that DeLLMa has been accepted to ICLR 2025 as a Spotlight presentation. See you in Singapore πŸ‡ΈπŸ‡¬
Jan 6, 2025 Introducing METAGENE-1🧬, a 7B parameter metagenomic foundation model capable of pandemic monitoring, pathogen detection, and multi-species genomics.
Oct 7, 2024 Visting Philadelphia πŸ₯ͺ to attend the Conference on Language Modeling and present IsoBench!
Sep 3, 2024 Visiting Polymathic AI at the Flatiron Institute to work on foundation models for multi-discplinary sciences.
Jul 10, 2024 IsoBench has been accepted to the inaugural Conference on Language Modeling. Dataset preview now available on Hugging Face πŸ€—
May 20, 2024 Started an internship at Microsoft Research with the AI Frontiers Team
Apr 18, 2024 I gave a talk DeLLMa at the Information Science Institute NLG Seminar. Check out the video here ✌️
Apr 1, 2024 We introduce IsoBenchπŸ”₯, an evaluation suite that benchmarks multimodal foundation models on isomorphic representations!
Mar 13, 2024 Our work, On Retrieval Augmentation and the Limitations of Language Model Training, has been accepted to NAACL 2024 πŸ‡²πŸ‡½
Feb 6, 2024 New preprint available! We introduce DeLLMaπŸ€”, a large language model based framework for making rational decisions under uncertainty.
Jan 16, 2024 Our paper Interpretable Diffusion via Information Decomposition has been accepted for poster presentation at ICLR 2024! First time traveling to Vienna βœˆοΈπŸ‡¦πŸ‡Ή
Dec 10, 2023 Flying over to NOLA to attend NeurIPS 2023β€”let’s meet up! Michael and I will be presenting our work on mechanistic interpretability of GPT-2 for implementing a mathematical reasoning task.
Nov 17, 2023 I will present two papers on mechanistic interpretability and information-theoretic diffusion at SoCalNLP 2023. See you in UCLA!