LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models Paper • 2307.07889 • Published Jul 15, 2023 • 1
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models Paper • 2405.13684 • Published May 22
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language Models Paper • 2307.07889 • Published Jul 15, 2023 • 1