-
Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement
Paper • 2411.06558 • Published • 34 -
SlimLM: An Efficient Small Language Model for On-Device Document Assistance
Paper • 2411.09944 • Published • 12 -
Look Every Frame All at Once: Video-Ma^2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing
Paper • 2411.19460 • Published • 10 -
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Paper • 2412.05237 • Published • 47
Siyeol Kim
ississssi
·
AI & ML interests
None yet
Recent Activity
updated
a collection
5 days ago
Interestings
updated
a collection
14 days ago
Interestings
updated
a collection
25 days ago
Interestings
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet