Umut Hope YILDIRIM's picture

Umut Hope YILDIRIM PRO

umuthopeyildirim

·

https://umutyildirim.com

AI & ML interests

None yet

Organizations

umuthopeyildirim's activity

upvoted an article 11 days ago

Article

Decoding Strategies in Large Language Models

By

•

11 days ago

• 34

upvoted a paper about 1 month ago

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Paper • 2409.16299 • Published Sep 9 • 9

upvoted a paper 3 months ago

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 73

upvoted 2 papers 4 months ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16 • 54

Searching for Best Practices in Retrieval-Augmented Generation

Paper • 2407.01219 • Published Jul 1 • 11

upvoted a collection 4 months ago

Noisy OCR

4 items • Updated Jul 8 • 1

upvoted a paper 5 months ago

GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

Paper • 2406.08451 • Published Jun 12 • 23

upvoted 2 collections 5 months ago

PEFT

200 items • Updated Jul 8 • 14

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217

upvoted a collection 7 months ago

MoEs papers reading list

60 items • Updated 5 days ago • 134

upvoted a collection 8 months ago

DBRX

DBRX is a mixture-of-experts (MoE) large language model trained from scratch by Databricks. • 3 items • Updated Mar 27 • 91

upvoted a paper 8 months ago

Matryoshka Representation Learning

Paper • 2205.13147 • Published May 26, 2022 • 9

upvoted a collection 9 months ago

🔍 Daily Picks in Interpretability & Analysis of LMs

Outstanding research in interpretability and evaluation of language models, summarized • 80 items • Updated 11 days ago • 90

upvoted a paper 9 months ago

In-Context Language Learning: Architectures and Algorithms

Paper • 2401.12973 • Published Jan 23 • 4

upvoted a collection 10 months ago

Fin-RWKV-V1

Attention free financial expert modal - RWKV V4 • 6 items • Updated Feb 2 • 1

upvoted 5 papers about 1 year ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 29

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

Paper • 2310.11954 • Published Oct 18, 2023 • 24

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 74

Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts

Paper • 2310.11784 • Published Oct 18, 2023 • 10

Context-Aware Meta-Learning

Paper • 2310.10971 • Published Oct 17, 2023 • 16