Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023
ReCode: Robustness Evaluation of Code Generation Models Paper • 2212.10264 • Published Dec 20, 2022 • 1
Equipping Transformer with Random-Access Reading for Long-Context Understanding Paper • 2405.13216 • Published May 21, 2024
Efficient Shapley Values Estimation by Amortization for Text Classification Paper • 2305.19998 • Published May 31, 2023
Word-level Textual Adversarial Attacking as Combinatorial Optimization Paper • 1910.12196 • Published Oct 27, 2019
Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study Paper • 2106.03826 • Published Jun 7, 2021
Weakly-Supervised Methods for Suicide Risk Assessment: Role of Related Domains Paper • 2106.02792 • Published Jun 5, 2021
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 53
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023
SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents Paper • 2403.08715 • Published Mar 13, 2024 • 20
WebArena: A Realistic Web Environment for Building Autonomous Agents Paper • 2307.13854 • Published Jul 25, 2023 • 24
Don't Copy the Teacher: Data and Model Challenges in Embodied Dialogue Paper • 2210.04443 • Published Oct 10, 2022
COBRA Frames: Contextual Reasoning about Effects and Harms of Offensive Statements Paper • 2306.01985 • Published Jun 3, 2023 • 1
Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model Paper • 2303.08613 • Published Mar 15, 2023
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents Paper • 2310.11667 • Published Oct 18, 2023 • 2
FewRel: A Large-Scale Supervised Few-Shot Relation Classification Dataset with State-of-the-Art Evaluation Paper • 1810.10147 • Published Oct 24, 2018
FewRel 2.0: Towards More Challenging Few-Shot Relation Classification Paper • 1910.07124 • Published Oct 16, 2019