jondurbin commited on
Commit
d0211f7
1 Parent(s): 43f209d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -76,6 +76,7 @@ I didn't run any sort of comprehensive set of benchmarks, but here are a couple
76
  | model | score |
77
  | --- | --- |
78
  | bagel-dpo-8x7b-v0.2 | __0.7242__ |
 
79
  | bagel-8x7b-v0.2 | 0.5921 |
80
 
81
  ### GSM8K
@@ -100,6 +101,12 @@ index ccf6a5a3..df0b7422 100644
100
  filter_list:
101
  ```
102
 
 
 
 
 
 
 
103
  ### Data sources
104
 
105
  *Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*
 
76
  | model | score |
77
  | --- | --- |
78
  | bagel-dpo-8x7b-v0.2 | __0.7242__ |
79
+ | mixtral-8x7b-instruct-v0.1 | 0.6498 |
80
  | bagel-8x7b-v0.2 | 0.5921 |
81
 
82
  ### GSM8K
 
101
  filter_list:
102
  ```
103
 
104
+ | model | score |
105
+ | --- | --- |
106
+ | bagel-dpo-8x7b-v0.2 | |
107
+ | mixtral-8x7b-instruct-v0.1 | |
108
+ | bagel-8x7b-v0.2 | 0.5360 |
109
+
110
  ### Data sources
111
 
112
  *Yes, you will see benchmark names in the list, but this only uses the train splits, and a decontamination by cosine similarity is performed at the end as a sanity check*