Arthur Zucker

ArthurZ

AI & ML interests

None yet

Articles

Organizations

ArthurZ's activity

New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 1 month ago

8-kv-heads

8
#14 opened about 1 month ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 1 month ago

Update config.json

#17 opened about 1 month ago by ArthurZ

Config KV Heads should be 8 now?

1
#16 opened about 1 month ago by tanmaylaud
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 1 month ago

8 kv heads

2
#13 opened about 1 month ago by kkokkie2360
New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 1 month ago

8-kv-heads

#15 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B about 1 month ago

8-kv-heads

3
#21 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct about 1 month ago

8-kv-heads

4
#17 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 2 months ago
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago

Update tokenizer to prepend special token

1
#11 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-8B-Instruct about 2 months ago

Upload tokenizer

2
#29 opened about 2 months ago by ArthurZ

Upload tokenizer

#28 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 2 months ago

Upload tokenizer

1
#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-8B about 2 months ago

Update tokenizer to prepend special token

1
#12 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct about 2 months ago

Upload tokenizer

1
#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B-Instruct about 2 months ago

Upload tokenizer

1
#12 opened about 2 months ago by ArthurZ
New activity in ArthurZ/new-t5-base about 2 months ago

Upload tokenizer

#1 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-8B-Instruct about 2 months ago

Upload tokenizer

#27 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B-Instruct about 2 months ago

Upload tokenizer

#11 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-8B-Instruct about 2 months ago

DO NOT MERGE test for vllm

2
#11 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago
New activity in meta-llama/Llama-Guard-3-8B-INT8 about 2 months ago

Update config.json

#6 opened about 2 months ago by ArthurZ
New activity in meta-llama/Llama-Guard-3-8B about 2 months ago

Update config.json

#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago

Update config.json

#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B-Instruct about 2 months ago

Update config.json

#6 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago

Update config.json

#8 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-8B about 2 months ago

Update config.json

#10 opened about 2 months ago by ArthurZ
New activity in google/gemma-2-9b 3 months ago
New activity in microsoft/Florence-2-large 3 months ago
New activity in mistralai/Mistral-7B-Instruct-v0.3 4 months ago

Slow tokenizer problem.

4
#22 opened 4 months ago by bradhutchings
New activity in mistralai/Mistral-7B-Instruct-v0.3 4 months ago

Upload tokenizer

#6 opened 4 months ago by ArthurZ

Upload tokenizer

#5 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-v0.3 4 months ago

Update README.md

#4 opened 4 months ago by ArthurZ

Update README.md

#3 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-Instruct-v0.3 4 months ago

Update README.md

#4 opened 4 months ago by ArthurZ

Update config.json

1
#3 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-v0.3 4 months ago

Upload MistralForCausalLM

#2 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-Instruct-v0.3 4 months ago

Upload MistralForCausalLM

#2 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-v0.3 4 months ago

Upload tokenizer

1
#1 opened 4 months ago by ArthurZ
New activity in mistralai/Mistral-7B-Instruct-v0.3 4 months ago

Upload tokenizer

#1 opened 4 months ago by ArthurZ
New activity in 01-ai/Yi-9B 4 months ago
New activity in meta-llama/Meta-Llama-3-8B-Instruct 4 months ago

Update config.json

1
#105 opened 4 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3-8B-Instruct 4 months ago

How to use EOT_ID

4
#54 opened 5 months ago by saksham-lamini
New activity in meta-llama/Meta-Llama-3-70B-Instruct 4 months ago

Update config.json

4
#33 opened 5 months ago by ArthurZ

Update README.md

1
#31 opened 5 months ago by kimseungho
New activity in meta-llama/Meta-Llama-3-8B-Instruct 4 months ago

Update tokenizer_config.json

16
#60 opened 5 months ago by Navanit-AI
New activity in meta-llama/Meta-Llama-3-8B-Instruct 5 months ago

Update config.json

1
#71 opened 5 months ago by ArthurZ