2 3 1

Ofir Zafrir

ofirzaf

AI & ML interests

Sparsity, Qunatization, Model Compression

Recent Activity

authored a paper 6 days ago

Q8BERT: Quantized 8Bit BERT

authored a paper 6 days ago

FastDraft: How to Train Your Draft

upvoted a paper 8 days ago

FastDraft: How to Train Your Draft

View all activity

Articles

A Chatbot on your Laptop: Phi-2 on Intel Meteor Lake

Mar 20

• 5

Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding

Jan 30

• 4

Organizations

ofirzaf's activity

authored 2 papers 6 days ago

Q8BERT: Quantized 8Bit BERT

Paper • 1910.06188 • Published Oct 14, 2019 • 1

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published 11 days ago • 9

upvoted a paper 8 days ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published 11 days ago • 9

upvoted a paper 4 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5 • 33

New activity in microsoft/Phi-3-mini-4k-instruct 7 months ago

Changed instruction/chat template

#54 opened 7 months ago by

ofirzaf

authored a paper over 1 year ago

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

Paper • 2306.16601 • Published Jun 28, 2023 • 4

liked a Space over 1 year ago

Running on CPU Upgrade

11.9k

🏆

Open LLM Leaderboard 2

Track, rank and evaluate open LLMs and chatbots

updated 2 models about 2 years ago

Intel/distilbert-base-uncased-squadv1.1-sparse-80-1x4-block-pruneofa

Question Answering • Updated Sep 20, 2022 • 21

Intel/distilbert-base-uncased-sparse-80-1x4-block-pruneofa

Fill-Mask • Updated Aug 28, 2022 • 3

updated 5 models over 2 years ago

updated a model almost 3 years ago

Intel/bert-large-uncased-squadv1.1-sparse-90-unstructured

Question Answering • Updated Dec 5, 2021 • 39

updated 4 models over 3 years ago

Intel/bert-base-uncased-mnli-sparse-70-unstructured-no-classifier

Fill-Mask • Updated Jun 29, 2021 • 7

Intel/bert-base-uncased-sparse-1_2

Updated Jun 24, 2021 • 7

Intel/bert-base-uncased-mnli-sparse-70-unstructured

Text Classification • Updated May 24, 2021 • 33

Intel/bert-base-uncased-sparse-70-unstructured

Fill-Mask • Updated May 24, 2021 • 6