Image-to-Text
Transformers
PyTorch
phi3_v
text-generation
latex
custom_code

Model Summary

Cephalo is a series of multimodal materials science focused vision large language models (V-LLMs) designed to integrate visual and linguistic data for advanced understanding and interaction in human-AI or multi-agent AI frameworks.

image/png

Model Capabilities

This version of Cephalo, lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha, is trained to convert images of equations to LaTeX code.

Downloads last month
29
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API does not yet support model repos that contain custom code.

Datasets used to train lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha

Collection including lamm-mit/Cephalo-LaTeX-Phi-3-vision-128k-4b-alpha