Dong Hai Phuong Nguyen
phuong-d-h-nguyen
AI & ML interests
LLM, RL, CV
Recent Activity
liked
a dataset
4 days ago
data-is-better-together/10k_prompts_ranked
liked
a model
20 days ago
google/paligemma-3b-ft-ocrvqa-896-jax
updated
a collection
26 days ago
CoT
Organizations
Collections
9
-
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 57 -
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Paper • 2403.13447 • Published • 18 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 109 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 67
spaces
1
models
None public yet
datasets
None public yet