Jenish-23's picture

6 8 38

Jenish-23

Jenish-23

·

jenish2014

AI & ML interests

Personal and Study

Organizations

None yet

Jenish-23's activity

upvoted 4 papers 9 months ago

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Paper • 2402.14905 • Published Feb 22 • 126

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

Paper • 2402.14658 • Published Feb 22 • 82

Neural Network Diffusion

Paper • 2402.13144 • Published Feb 20 • 94

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Paper • 2402.10176 • Published Feb 15 • 34

upvoted a collection 10 months ago

Pretrained Text-Generation Models Below 250M Parameters

Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. • 8 items • Updated Aug 10 • 7

upvoted a paper 10 months ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4 • 89

upvoted 2 collections 11 months ago

Small_Language_Models

23 items • Updated Feb 1 • 1

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated May 13 • 16