wenbopan
/

Faro-Yi-9B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

wenbopan commited on Mar 28

Commit

837b0fd

•

1 Parent(s): d8b4f73

Update results

Files changed (1) hide show

README.md +8 -8

README.md CHANGED Viewed

@@ -20,10 +20,10 @@ Fi-9B enhances its ability compared to Yi-9B-200K in most dimensions, especially
 ### Fact-based Evaluation (Open LLM Leaderboard)
-| **Metric**      | **winogrande** | **hellaswag** | **truthfulqa** | **ai2_arc** |
-|-----------------|----------------|---------------|----------------|-------------|
-| **Yi-9B-200K**  | 71.67          | 56.72         | 33.80          | 69.25       |
-| **Fi-9B-200K**  | 71.11          | **57.28**     | **40.86**      | **72.58**   |
 ### Long-context Modeling (LongBench)
@@ -46,10 +46,10 @@ Fi-9B enhances its ability compared to Yi-9B-200K in most dimensions, especially
 ### Bilingual Ability (CMMLU & MMLU)
-| **Name**       | **CMMLU** |
-|----------------|-----------|
-| **Yi-9B-200K** | 71.97     |
-| **Fi-9B-200K** | 73.28     |
 ## Current Limitations

 ### Fact-based Evaluation (Open LLM Leaderboard)
+| **Metric**     | **MMLU**  | GSM8K     | **HellaSwag** | **TruthfulQA** | **Arc** | **Winogrande** |
+| -------------- | --------- | --------- | ------------- | -------------- | ----------- | -------------- |
+| **Yi-9B-200K** | 65.73     | 50.49     | 56.72         | 33.80          | 69.25       | 71.67          |
+| **Fi-9B-200K** | **68.80** | **63.08** | **57.28**     | **40.86**      | **72.58**   | 71.11          |
 ### Long-context Modeling (LongBench)
 ### Bilingual Ability (CMMLU & MMLU)
+| **Name**       | MMLU      | **CMMLU** |
+| -------------- | --------- | --------- |
+| **Yi-9B-200K** | 65.73     | 71.97     |
+| **Fi-9B-200K** | **68.80** | **73.28** |
 ## Current Limitations