rubbyninja's picture

23

rubbyninja

rubbyninja

·

AI & ML interests

None yet

Recent Activity

updated a collection 6 days ago

advancing research

upvoted a paper 6 days ago

Self-Taught Evaluators

updated a collection 14 days ago

advancing research

View all activity

Organizations

None yet

rubbyninja's activity

upvoted a paper 6 days ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5 • 27

upvoted a paper 14 days ago

ReFT: Reasoning with Reinforced Fine-Tuning

Paper • 2401.08967 • Published Jan 17 • 29

upvoted 6 papers about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25 • 40

O1 Replication Journey: A Strategic Progress Report -- Part 1

Paper • 2410.18982 • Published Oct 8 • 1

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

Paper • 1909.13231 • Published Sep 29, 2019 • 1

The Surprising Effectiveness of Test-Time Training for Abstract Reasoning

Paper • 2411.07279 • Published Nov 11 • 3

Scaling Laws for Precision

Paper • 2411.04330 • Published Nov 7 • 6

Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models

Paper • 2410.11081 • Published Oct 14 • 19

upvoted 4 papers about 2 months ago

Consistency Models

Paper • 2303.01469 • Published Mar 2, 2023 • 8

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 61

MemGPT: Towards LLMs as Operating Systems

Paper • 2310.08560 • Published Oct 12, 2023 • 7

A Moral Imperative: The Need for Continual Superalignment of Large Language Models

Paper • 2403.14683 • Published Mar 13 • 1

upvoted 2 papers 2 months ago

nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Paper • 2410.01131 • Published Oct 1 • 9

GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models

Paper • 2410.05229 • Published Oct 7 • 21

upvoted 5 papers 3 months ago

Aligning Machine and Human Visual Representations across Abstraction Levels

Paper • 2409.06509 • Published Sep 10 • 1

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 51

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31 • 12

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking

Paper • 2403.09629 • Published Mar 14 • 75

upvoted a paper 4 months ago

Prompt Cache: Modular Attention Reuse for Low-Latency Inference

Paper • 2311.04934 • Published Nov 7, 2023 • 28