On Retrieval Augmentation and the Limitations of Language Model Training Paper • 2311.09615 • Published Nov 16, 2023 • 1
DeLLMa: A Framework for Decision Making Under Uncertainty with Large Language Models Paper • 2402.02392 • Published Feb 4 • 5
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations Paper • 2404.01266 • Published Apr 1 • 2
Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion Paper • 2401.12947 • Published Jan 23 • 2
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization Paper • 2410.04717 • Published Oct 7 • 17
From Symbolic Tasks to Code Generation: Diversification Yields Better Task Performers Paper • 2405.19787 • Published May 30 • 1
PLUM: Preference Learning Plus Test Cases Yields Better Code Language Models Paper • 2406.06887 • Published Jun 11 • 1