Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Paper • 2403.16950 • Published Mar 25 • 4
Function Calling v3 Collection Models fine-tuned for function-calling • 14 items • Updated Apr 27 • 19