Shyam Peri's picture

24 108

Shyam Peri

shyamperi

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 19 hours ago

liked a Space 9 days ago

vespa-engine/colpali-vespa-visual-retrieval

upvoted a collection about 1 month ago

View all activity

Organizations

shyamperi's activity

upvoted a collection about 19 hours ago

OLMo 2

Artifacts for the second set of OLMo models. • 17 items • Updated about 7 hours ago • 29

upvoted 2 collections about 1 month ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Oct 24 • 506

MedEmbed: Embedding Models for Medical Domain

GitHub -> https://github.com/abhinand5/MedEmbed • 4 items • Updated Oct 21 • 7

upvoted a paper about 2 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted a paper 2 months ago

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24 • 13

upvoted a collection 2 months ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Sep 12 • 78

upvoted 2 collections 3 months ago

LLM2Encoder

Collection of initial models and models that use converted decoders to encoders as backbones • 11 items • Updated Sep 10 • 6

GLiNER bi-encoders

Bi-encoder and poly-encoder architectures of GLiNER • 5 items • Updated Sep 10 • 12

upvoted 3 articles 4 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 103

Article

LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?

Jul 25

• 18

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 273

upvoted a collection 7 months ago

NuNerZero - Zero Shot NER

The best compact Zero-Shot NER models with MIT license • 4 items • Updated Jul 3 • 19

upvoted 2 articles 7 months ago

Article

How to Finetune phi-3 on MacBook Pro

By

•

Apr 24

• 63

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22

• 227

upvoted a paper 9 months ago

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26 • 42

upvoted 5 papers about 1 year ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 14

Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 39

LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models

Paper • 2309.12307 • Published Sep 21, 2023 • 87

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Paper • 2309.12284 • Published Sep 21, 2023 • 18

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)

Paper • 2309.08968 • Published Sep 16, 2023 • 22