Ankit Sharma

nezubn

https://nezubn.com

AI & ML interests

engineering • systems • ml

Recent Activity

liked a model 5 days ago

bartowski/Llama-3.1_OpenScholar-8B-GGUF

upvoted a paper 13 days ago

Cut Your Losses in Large-Vocabulary Language Models

upvoted a paper about 1 month ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

View all activity

Organizations

nezubn's activity

upvoted a paper 13 days ago

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published 14 days ago • 40

upvoted a paper about 1 month ago

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21 • 22

upvoted 2 articles 4 months ago

Article

Optimizing your LLM in production

Sep 15, 2023

• 15

Article

Getting Started With Embeddings

Jun 23, 2022

• 38

upvoted an article 5 months ago

Article

quanto: a pytorch quantization toolkit

Mar 18

• 31

upvoted 2 papers 6 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 87

upvoted a paper 7 months ago

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25 • 52

upvoted an article 8 months ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

May 24, 2023

• 94

upvoted 8 papers 8 months ago

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4 • 62

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 104

Advancing LLM Reasoning Generalists with Preference Trees

Paper • 2404.02078 • Published Apr 2 • 44

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Paper • 2403.20041 • Published Mar 29 • 34

upvoted 2 papers 9 months ago

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13 • 49

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Paper • 2403.09611 • Published Mar 14 • 124

upvoted a paper 10 months ago

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29 • 48