Nicohst commited on
Commit
cc66773
1 Parent(s): ec848cb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +63 -7
README.md CHANGED
@@ -22,17 +22,73 @@ This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4b
22
 
23
  ## Model description
24
 
25
- More information needed
26
-
27
- ## Intended uses & limitations
28
-
29
- More information needed
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
 
31
  ## Training and evaluation data
32
 
33
- More information needed
34
 
35
- ## Training procedure
36
 
37
  ### Training hyperparameters
38
 
 
22
 
23
  ## Model description
24
 
25
+ This model was trained on Successful episodes of the top 3 model similar to [D20002](clembench-playpen/meta-llama-Meta-Llama-3.1-8B-Instruct_SFT_E1_D20002) but instead of using the whole episode as input,
26
+ each episode was split into conversation pieces.
27
+
28
+ e.g.
29
+ ```json
30
+ [
31
+ {
32
+ role: 'user'
33
+ content: '...'
34
+ },
35
+ {
36
+ role: 'assistant'
37
+ content: '...'
38
+ },
39
+ {
40
+ role: 'user'
41
+ content: '...'
42
+ },
43
+ {
44
+ role: 'assistant'
45
+ content: '...'
46
+ },
47
+ ]
48
+ ```
49
+
50
+ ```json
51
+ is split int:
52
+
53
+ [
54
+ {
55
+ role: 'user'
56
+ content: '...'
57
+ },
58
+ {
59
+ role: 'assistant'
60
+ content: '...'
61
+ },
62
+ ```
63
+
64
+ and
65
+
66
+ ```json
67
+ [
68
+ {
69
+ role: 'user'
70
+ content: '...'
71
+ },
72
+ {
73
+ role: 'assistant'
74
+ content: '...'
75
+ },
76
+ {
77
+ role: 'user'
78
+ content: '...'
79
+ },
80
+ {
81
+ role: 'assistant'
82
+ content: '...'
83
+ },
84
+ ]
85
+ ```
86
 
87
  ## Training and evaluation data
88
 
89
+ After splitting, the dataset contains about 4122 conversation bits accross all games.
90
 
91
+ The Dataset ID is D30003
92
 
93
  ### Training hyperparameters
94