Nikolay Kozlov's picture

Nikolay Kozlov

NikolayKozloff

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-2bit

liked a model 1 day ago

kaitchup/Qwen2.5-72B-Instruct-AutoRound-GPTQ-4bit

liked a model 1 day ago

HuggingFaceTB/SmolVLM-Instruct

View all activity

Organizations

None yet

NikolayKozloff's activity

upvoted a collection 9 days ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 2 items • Updated Oct 1 • 2

upvoted 2 collections 12 days ago

LLäMmlein Chat Preview 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 8 items • Updated 5 days ago • 9

LLäMmlein 🐑

https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/ • 5 items • Updated 8 days ago • 7

upvoted 3 collections 16 days ago

Qwen2.5-Coder

Code-specific model series based on Qwen2.5 • 40 items • Updated 3 days ago • 231

🍓 Ichigo v0.4

The experimental family designed to train LLMs to understand sound natively. • 2 items • Updated 17 days ago • 6

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 60 items • Updated about 2 hours ago • 446

upvoted a collection 19 days ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated 5 days ago • 74

upvoted a collection 22 days ago

OS-Atlas

OS-Atlas series models • 7 items • Updated 10 days ago • 12

upvoted 2 collections 27 days ago

QTIP Quantized Models

See https://github.com/Cornell-RelaxML/qtip • 27 items • Updated 6 days ago • 5

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated about 12 hours ago • 181

upvoted a collection 28 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated about 23 hours ago • 97

upvoted 7 collections about 1 month ago

INTELLECT-1 Dataset

INTELLECT-1 Training dataset • 5 items • Updated Oct 8 • 14

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20 • 24

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated 23 days ago • 92

v4

18 items • Updated Oct 20 • 24

Arch-Function

6 items • Updated 29 days ago • 8

ApolloMoE & Apollo2

English, Chinese, French, Hindi, Spanish, Arabic, Russian, Japanese, Korean, German, Italian, Portuguese and 38 Minor Languages • 7 items • Updated Oct 15 • 3

LoLCATS

Linearizing LLMs with high quality and efficiency. We linearize the full Llama 3.1 model family -- 8b, 70b, 405b -- for the first time! • 4 items • Updated Oct 14 • 14

upvoted 2 collections about 2 months ago

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22 • 41

Qwen2

Qwen2 language models, instruction-tuned models of 3 sizes: 0.5B, 1.5B, 7B. • 3 items • Updated Jun 13 • 1