4 4 1

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

updated a model 6 days ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

updated a model 6 days ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

updated a collection 6 days ago

Natural Language Reinforcement Learning

View all activity

Organizations

Benjamin-eecs's activity

updated 2 models 6 days ago

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Policy

Feature Extraction • Updated 6 days ago • 13

Benjamin-eecs/Llama-3.1-8B-Instruct-NLRL-TicTacToe-Value

Feature Extraction • Updated 6 days ago • 14

updated a collection 6 days ago

Natural Language Reinforcement Learning

Collection

4 items • Updated 6 days ago

upvoted a paper 8 days ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published 9 days ago • 25

authored a paper 9 days ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published 9 days ago • 25

commented a paper 9 days ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published 9 days ago • 25 •

authored a paper 4 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 52

updated 2 models 6 months ago

deepseek-ai/DeepSeek-V2-Chat

Text Generation • Updated Jun 8 • 1.3k • 444

deepseek-ai/DeepSeek-V2

Text Generation • Updated Jun 8 • 5.87k • 286

authored a paper 6 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 14

upvoted a paper 6 months ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7 • 14

authored a paper 6 months ago

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23 • 35

New activity in deepseek-ai/DeepSeek-VL-7B 8 months ago

Update app.py

#1 opened 8 months ago by

minhdang

New activity in deepseek-ai/deepseek-vl-7b-chat 8 months ago

abb

#8 opened 8 months ago by

perpE

upvoted a paper 9 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8 • 39

authored a paper 9 months ago

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8 • 39

liked a Space 9 months ago

Running on Zero

275

🐬

Chat with DeepSeek VL 7B

authored a paper 11 months ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5 • 41