Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2410.17243

📑 Trending Papers - October 🔟

Differential Transformer

Paper • 2410.05258 • Published Oct 7 • 165
Baichuan-Omni Technical Report

Paper • 2410.08565 • Published 28 days ago • 82
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86
FrugalNeRF: Fast Convergence for Few-shot Novel View Synthesis without Learned Priors

Paper • 2410.16271 • Published 17 days ago • 80

about 18 hours ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published 3 days ago • 44
"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 3 days ago • 41
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published 8 days ago • 52
RARe: Retrieval Augmented Retrieval with In-Context Examples

Paper • 2410.20088 • Published 13 days ago • 5

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86

Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published 16 days ago • 12
LOGO -- Long cOntext aliGnment via efficient preference Optimization

Paper • 2410.18533 • Published 15 days ago • 42
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86
LongReward: Improving Long-context Large Language Models with AI Feedback

Paper • 2410.21252 • Published 10 days ago • 16

The corresponding demos/checkpoints/papers/datasets of Inf-CL.

DAMO-NLP-SG/LiT-B-32_CC12M

Updated 18 days ago • 1
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86

about Transformer

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22 • 27
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss

Paper • 2410.17243 • Published 16 days ago • 86

Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection

Paper • 2409.08513 • Published Sep 13 • 10
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12 • 43
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18 • 73
LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18 • 30

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 53
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17 • 51
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20 • 40
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20 • 50

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs