Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
144.5
TFLOPS
672
15
189
Arthur Zucker
ArthurZ
Follow
thomwolf's profile picture
fernandoyello's profile picture
Corsicanpainter's profile picture
298 followers
Ā·
17 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Recent Activity
Reacted to
Xenova
's
post
with š„
7 days ago
Have you tried out š¤ Transformers.js v3? Here are the new features: ā” WebGPU support (up to 100x faster than WASM) š¢ New quantization formats (dtypes) š 120 supported architectures in total š 25 new example projects and templates š¤ Over 1200 pre-converted models š Node.js (ESM + CJS), Deno, and Bun compatibility š” A new home on GitHub and NPM Get started with `npm i @huggingface/transformers`. Learn more in our blog post: https://huggingface.co/blog/transformersjs-v3
Reacted to
davidberenstein1957
's
post
with š
7 days ago
For anyone who struggles with NER or information extraction with LLM. We showed an efficient workflow for token classification including zero-shot suggestions and model fine-tuning with Argilla, GliNER, the NuMind NuExtract LLM and SpanMarker. @argilla Video: https://youtu.be/JvLpaYgNd84?feature=shared Notebooks and slides included to try it yourself š
Reacted to
LukeNeumann
's
post
with š¤Æ
7 days ago
Nine years ago, I uploaded the first 8K resolution video to YouTube and I've been stockpiling 8K footage ever since: https://www.youtube.com/watch?v=sLprVF6d7Ug&t Should @Overlaiapp release the first open-source 8K video dataset? Could anyone even fine tune a model with this?š
View all activity
Articles
Fixing Gradient Accumulation
Oct 16
ā¢
42
Improving Hugging Face Training Efficiency Through Packing with Flash Attention
Aug 21
ā¢
22
Fine-Tuning Gemma Models in Hugging Face
Feb 23
ā¢
24
Code Llama: Llama 2 learns to code
Aug 25, 2023
ā¢
8
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
mistralai/Pixtral-Large-Instruct-2411
9 days ago
Upload transformers version
5
#3 opened 9 days ago by
ArthurZ
New activity in
huggingface/documentation-images
12 days ago
Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 13 days ago by
kwen2501
New activity in
mistral-community/pixtral-12b
about 1 month ago
Update model weight
8
#13 opened about 1 month ago by
nguyen-brat
Update hidden_act to silu
2
#14 opened about 1 month ago by
ArthurZ
New activity in
rhymes-ai/Aria
about 2 months ago
llama.cpp support
9
#1 opened about 2 months ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
about 2 months ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened about 2 months ago by
dahara1
New activity in
mistral-community/pixtral-12b
2 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 2 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
2 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened 2 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
2 months ago
How to use safetensors?
2
#13 opened 2 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
2 months ago
lamma cpp ht to gguf not working
4
#2 opened 2 months ago by
RameshRajamani
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
3 months ago
8-kv-heads
8
#14 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Update config.json
#17 opened 4 months ago by
ArthurZ
Config KV Heads should be 8 now?
1
#16 opened 4 months ago by
tanmaylaud
New activity in
meta-llama/Llama-3.1-405B-Instruct-FP8
4 months ago
8 kv heads
2
#13 opened 4 months ago by
kkokkie2360
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
8-kv-heads
#15 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B
4 months ago
8-kv-heads
3
#21 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-Instruct
4 months ago
8-kv-heads
4
#17 opened 4 months ago by
ArthurZ
New activity in
meta-llama/Llama-3.1-405B-FP8
4 months ago
Updated eos_token to include multiple IDs
1
#14 opened 4 months ago by
vontimitta
Update tokenizer to prepend special token
#12 opened 4 months ago by
lysandre
New activity in
meta-llama/Llama-3.1-70B
4 months ago
Update tokenizer to prepend special token
1
#11 opened 4 months ago by
lysandre
Load more