grounded-ai
/

phi3.5-hallucination-judge

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

Jlonge4 commited on Sep 8

Commit

ad19736

•

1 Parent(s): 930af00

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -93,7 +93,7 @@ We compared our merged model's performance on the hallucination detection benchm
 Scores from arize/phoenix
-As shown in the table, our merged model achieves competitive performance, with an F1 score of 0.82, matching or outperforming several state-of-the-art language models on this hallucination detection task.
 ## Model description

 Scores from arize/phoenix
+As shown in the table, our merged model achieves competitive performance, with an F1 score of 0.83, matching or outperforming several state-of-the-art language models on this hallucination detection task.
 ## Model description