AuroraCap - a Reself Collection

Reself 's Collections

updated 1 day ago

A Detailed Captioning Baseline and Benchmark for Video

Reself/AuroraCap-7B-IMG

Updated 1 day ago • 2 • 2

Note The image caption model offers a better performance-cost trade-off.
Reself/AuroraCap-7B-VID

Updated about 19 hours ago • 1 • 2

Note The video caption model offers a better performance-cost trade-off.
Reself/Video-Detailed-Caption

Updated about 8 hours ago

Note The VDC benchmark contains 1,027 videos with captions averaging over 500 words.
Reself/AuroraCap-trainset

Updated 3 days ago

Note over 20M image and video data collection for AuroraCap training with vicuna and llama-3 pre-tokenize.