checkpoint-2700
Browse files- README.md +8 -14
- model-00001-of-00004.safetensors +1 -1
- model-00002-of-00004.safetensors +1 -1
- model-00003-of-00004.safetensors +1 -1
- model-00004-of-00004.safetensors +1 -1
README.md
CHANGED
@@ -1,23 +1,21 @@
|
|
1 |
---
|
2 |
-
base_model: meta-llama/Meta-Llama-3-8B
|
3 |
language:
|
4 |
- sv
|
5 |
- da
|
6 |
- 'no'
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
pipeline_tag: text-generation
|
8 |
inference:
|
9 |
parameters:
|
10 |
temperature: 0.6
|
11 |
-
tags:
|
12 |
-
- pytorch
|
13 |
-
- llama
|
14 |
-
- llama-3
|
15 |
-
- ai-sweden
|
16 |
---
|
17 |
|
18 |
-
# AI-Sweden-Models/Llama-3-8B (checkpoint-
|
19 |
-
![](https://huggingface.co/AI-Sweden-Models/Llama-3-8B/resolve/main/l3swe.png?download=true)
|
20 |
-
|
21 |
|
22 |
### Intended usage:
|
23 |
This is a base model, it can be finetuned to a particular use case.
|
@@ -60,8 +58,4 @@ A total of 92 A100 GPUs were used, and roughly 250GB of data was processed.
|
|
60 |
|
61 |
## Benchmarks
|
62 |
|
63 |
-
Coming soon.
|
64 |
-
|
65 |
-
## Checkpoints
|
66 |
-
* 2700 (20/5/2024)
|
67 |
-
* 1500 (13/5/2024)
|
|
|
1 |
---
|
|
|
2 |
language:
|
3 |
- sv
|
4 |
- da
|
5 |
- 'no'
|
6 |
+
tags:
|
7 |
+
- pytorch
|
8 |
+
- llama
|
9 |
+
- llama-3
|
10 |
+
- ai-sweden
|
11 |
+
base_model: meta-llama/Meta-Llama-3-8B
|
12 |
pipeline_tag: text-generation
|
13 |
inference:
|
14 |
parameters:
|
15 |
temperature: 0.6
|
|
|
|
|
|
|
|
|
|
|
16 |
---
|
17 |
|
18 |
+
# AI-Sweden-Models/Llama-3-8B (checkpoint-1500)
|
|
|
|
|
19 |
|
20 |
### Intended usage:
|
21 |
This is a base model, it can be finetuned to a particular use case.
|
|
|
58 |
|
59 |
## Benchmarks
|
60 |
|
61 |
+
Coming soon.
|
|
|
|
|
|
|
|
model-00001-of-00004.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4976698672
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:fabc1bfaf757d63c542928748d3852440a71d69140b2763f299f9c3b20d5afe0
|
3 |
size 4976698672
|
model-00002-of-00004.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4999802720
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:069881fe4241e8b98384b91022dc673d970f379ca7be2357873777024a519204
|
3 |
size 4999802720
|
model-00003-of-00004.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4915916176
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:65aa0ac9ff9ae7f219e11f53414afa7ec9da841f3bab4f505a6d96878fe3e7ca
|
3 |
size 4915916176
|
model-00004-of-00004.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1168138808
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:adc4b5b064a042e322b802bd1f17e68eb937fe021435a9e242903b9100cd6bf7
|
3 |
size 1168138808
|