Ahmad's picture

Ahmad

AhmadHakami

·

AI & ML interests

None yet

Organizations

AhmadHakami's activity

upvoted 2 collections 6 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 5 days ago • 160

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated 9 days ago • 16

upvoted an article 13 days ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

13 days ago

• 37

upvoted 2 collections 16 days ago

Stable Diffusion 3.5

6 items • Updated 11 days ago • 83

C4AI Aya Expanse

Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. • 3 items • Updated 16 days ago • 25

upvoted a paper 18 days ago

Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages

Paper • 2410.16153 • Published 19 days ago • 42

upvoted an article 22 days ago

Article

Open-source LLMs as LangChain Agents

Jan 24

• 34

upvoted a collection 24 days ago

Model2Vec base models

These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 7 items • Updated 12 days ago • 8

upvoted an article about 1 month ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 164

upvoted a paper about 1 month ago

Robust Speech Recognition via Large-Scale Weak Supervision

Paper • 2212.04356 • Published Dec 6, 2022 • 23

upvoted 2 collections about 1 month ago

NVLM 1.0

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 1 item • Updated Oct 1 • 48

Parakeet

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 8 items • Updated Oct 1 • 20

upvoted 4 collections about 2 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 16 days ago • 452

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 22 items • Updated 1 day ago • 91

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated Sep 18 • 215

Core ML Segment Anything 2

8 items • Updated Oct 4 • 26

upvoted 3 papers about 2 months ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11 • 11

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Paper • 2408.07199 • Published Aug 13 • 20

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3 • 80

upvoted a collection about 2 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 77