Lysandre

lysandre

AI & ML interests

I like open source.

Articles

Organizations

lysandre's activity

New activity in THUDM/CogVideoX-5b 23 days ago

Update transformers version

#1 opened 23 days ago by lysandre
New activity in mistralai/Mistral-Large-Instruct-2407 about 2 months ago
New activity in meta-llama/Meta-Llama-3.1-405B about 2 months ago

Update tokenizer to prepend special token

#12 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago

Update tokenizer to prepend special token

1
#11 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 2 months ago

Update tokenizer to prepend special token

#12 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-8B about 2 months ago

Update tokenizer to prepend special token

1
#12 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct about 2 months ago

Upload tokenizer

1
#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B-Instruct about 2 months ago

Upload tokenizer

1
#12 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-8B-Instruct about 2 months ago

Upload tokenizer

2
#29 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct-FP8 about 2 months ago

Upload tokenizer

1
#9 opened about 2 months ago by ArthurZ
New activity in meta-llama/Meta-Llama-3.1-70B-Instruct about 2 months ago

configuration-changes

#1 opened about 2 months ago by lysandre
New activity in meta-llama/Meta-Llama-3.1-405B-Instruct about 2 months ago

Update original/mp16/README.md

#1 opened about 2 months ago by marcsun13

Update original/mp8/README.md

#2 opened about 2 months ago by marcsun13
New activity in meta-llama/Meta-Llama-3.1-405B about 2 months ago

Update original/mp16/README.md

#5 opened about 2 months ago by marcsun13

Update original/mp8/README.md

#4 opened about 2 months ago by marcsun13
New activity in meta-llama/Meta-Llama-3.1-8B about 2 months ago
New activity in meta-llama/Meta-Llama-3.1-70B about 2 months ago
New activity in meta-llama/Meta-Llama-3.1-405B about 2 months ago
New activity in meta-llama/Meta-Llama-3.1-405B-FP8 about 2 months ago
New activity in yentinglin/Llama-3-Taiwan-8B-Instruct-128k 2 months ago

TGI model serving errors

6
#4 opened 3 months ago by wennycooper
New activity in shenzhi-wang/Gemma-2-27B-Chinese-Chat 3 months ago

Default to eager attention

2
#1 opened 3 months ago by lysandre
New activity in google/gemma-2-27b-it 3 months ago
New activity in google/gemma-2-27b 3 months ago
New activity in google/gemma-2-27b-it 3 months ago

Default to eager implementation

#21 opened 3 months ago by lysandre
New activity in google/gemma-2-27b 3 months ago
New activity in google/gemma-2-9b-it 3 months ago
New activity in google/gemma-2-27b 3 months ago
New activity in huggingface/cookbook-images 4 months ago

Upload agents_db5.png

1
#15 opened 4 months ago by m-ric
New activity in facebook/blenderbot-3B 4 months ago
New activity in microsoft/Phi-3-mini-128k-instruct 4 months ago

About Transformers version

2
#58 opened 4 months ago by AllenChai
New activity in openai-community/gpt2 5 months ago

model output

2
#86 opened 6 months ago by foxsilverfox

🚩 Report

#87 opened 6 months ago by beerbubbles
New activity in facebook/wav2vec2-xls-r-1b-21-to-en 6 months ago

Incorrect config file

4
#5 opened 6 months ago by shrey-jasuja
New activity in facebook/xlm-roberta-xl 6 months ago
New activity in lysandre/bert-test 6 months ago

shhhhh

#3 opened 6 months ago by SFconvertbot

nononon

#2 opened 6 months ago by SFconvertbot
New activity in open-source-metrics/stars 7 months ago

Fix splits

#2 opened 7 months ago by lhoestq