Ber666's picture

5 4

Ber666

SDSB

ber66666

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

upvoted a paper 16 days ago

Training Large Language Models to Reason in a Continuous Latent Space

View all activity

Organizations

None yet

SDSB's activity

upvoted a paper 4 days ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published 6 days ago • 33

upvoted a paper 16 days ago

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 17 days ago • 62

upvoted 2 papers 6 months ago

Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking

Paper • 2406.05673 • Published Jun 9 • 3

Pandora: Towards General World Model with Natural Language Actions and Video States

Paper • 2406.09455 • Published Jun 12 • 15

liked a model 7 months ago

maitrix-org/Pandora

Updated Jun 18 • 61

upvoted a paper over 1 year ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 17

liked a model almost 2 years ago

facebook/opt-iml-30b

Text Generation • Updated Jan 24, 2023 • 772 • 74

liked 2 models about 2 years ago

google/flan-t5-xxl

Text2Text Generation • Updated Jul 27, 2023 • 822k • 1.22k

google/t5-xxl-lm-adapt

Text2Text Generation • Updated Jan 24, 2023 • 272 • 8