| | --- |
| | library_name: transformers |
| | license: mit |
| | datasets: |
| | - array/SAT |
| | --- |
| | |
| | # Model Card for Model ID |
| |
|
| | Please check https://github.com/arijitray1993/SAT on how to run inference with this model. |
| |
|
| | If you use the model, please cite: |
| | ``` |
| | @misc{ray2024satspatialaptitudetraining, |
| | title={SAT: Spatial Aptitude Training for Multimodal Language Models}, |
| | author={Arijit Ray and Jiafei Duan and Reuben Tan and Dina Bashkirova and Rose Hendrix and Kiana Ehsani and Aniruddha Kembhavi and Bryan A. Plummer and Ranjay Krishna and Kuo-Hao Zeng and Kate Saenko}, |
| | year={2024}, |
| | eprint={2412.07755}, |
| | archivePrefix={arXiv}, |
| | primaryClass={cs.CV}, |
| | url={https://arxiv.org/abs/2412.07755}, |
| | } |
| | ``` |