Ilyas Moutawwakil's picture

Ilyas Moutawwakil

IlyasMoutawwakil

·

IlyasMoutawwakil

AI & ML interests

Optimization, LLMs, Hardware, Backends, ..

Recent Activity

updated a dataset about 10 hours ago

optimum-benchmark/cpu

updated a dataset about 10 hours ago

optimum-benchmark/cuda

updated a dataset about 10 hours ago

optimum-benchmark/cuda

View all activity

Articles

AMD + 🤗: Large Language Models Out-of-the-Box Acceleration with AMD GPU

Overview of natively supported quantization schemes in 🤗 Transformers

Organizations

IlyasMoutawwakil's activity

upvoted an article 3 months ago

Article

The 5 Most Under-Rated Tools on Hugging Face

Aug 22

• 85

upvoted a paper 6 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

upvoted an article 6 months ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

Mar 15

• 8

upvoted a collection 6 months ago

Fast-RAG Inference Endpoints

An extremely easy to deploy RAG Pipeline using Inference Endpoints • 3 items • Updated Jun 3 • 1

upvoted an article 7 months ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22

• 66

upvoted a collection about 1 year ago

Neural Network Compression & Quantization

Tracks papers and links about neural network compression and quantization technics • 4 items • Updated Sep 22, 2023 • 1

upvoted 2 papers about 1 year ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 96

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Paper • 2306.15626 • Published Jun 27, 2023 • 17