VLM-RLAIF
Collection
Respository for ACL 2024 paper "Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI feedback"
โข
10 items
โข
Updated
This Hub repository contains a HuggingFace's transformers
implementation of VLM-RLAIF model of SNUMPR lab.