Image-to-Text
Transformers
PyTorch
phi3_v
text-generation
latex
custom_code
Edit model card

Model Summary

Cephalo is a series of multimodal materials science focused vision large language models (V-LLMs) designed to integrate visual and linguistic data for advanced understanding and interaction in human-AI or multi-agent AI frameworks.

image/png

Model Capabilities

This version of Cephalo, lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha, is trained to convert images of equations to LaTeX code.

Downloads last month
42
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.

Datasets used to train lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha

Collection including lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha