news
Oct 08, 2025 | New preprint out: Boomerang Distillation Enables Zero-Shot Model Size Interpolation. We show that given a teacher and a single distilled student model, you can create models of intermediate sizes without any additional training! |
---|---|
May 08, 2025 | I was selected to receive a Kempner Institute Graduate Fellowship! |
May 01, 2025 | Three papers accepted to ICML 2025! Universal Neural Optimal Transport (main conference), Entropy-Driven Pre-Tokenization for Byte-Pair Encoding (Tokenization Workshop), and Guided Speculative Inference for Efficient Test-Time Alignment of LLMs (Spotlight at ES-FoMo Workshop) |