CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy Paper • 2410.13218 • Published 20 days ago • 4
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models Paper • 2410.10818 • Published 23 days ago • 14
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal Representations Paper • 2410.08049 • Published 27 days ago • 8
Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation Paper • 2410.05363 • Published 30 days ago • 44
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second Paper • 2410.02073 • Published Oct 2 • 40
Learning the Latent Rules of a Game from Data: A Chess Story Paper • 2410.02426 • Published Oct 3 • 5
Self-Supervised Any-Point Tracking by Contrastive Random Walks Paper • 2409.16288 • Published Sep 24 • 5
Evaluating Multiview Object Consistency in Humans and Image Models Paper • 2409.05862 • Published Sep 9 • 8