LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models Paper • 2307.07889 • Published Jul 15, 2023 • 1
⚓️ Sailor Language Models Collection Sailor: Open Language Models tailored for South-East Asia (SEA) released by Sea AI Lab. • 18 items • Updated Jul 26 • 16
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21 • 26
Awesome feedback datasets Collection A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 65
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization Paper • 2301.12307 • Published Jan 28, 2023 • 3
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models Paper • 2303.08896 • Published Mar 15, 2023 • 4