Computer Use Agent [SFT] – December 15, 2025
Collection
Localization, Tool Calling, and Navigation
•
5 items
•
Updated
•
2
Gliese-CUA-Tool-Call-8B is a Computer Use Agent (CUA) multimodal model based on Qwen2.5-VL-7B-Instruct, designed for GUI understanding, UI localization, and action execution across web, desktop, and mobile environments. It focuses on visual grounding, intent driven actioning, and UI based question answering (VQA), enabling reliable interaction with real world software interfaces. The model is optimized for agentic tool calling, producing structured actions that can be directly executed by downstream systems.
| File Name | Quant Type | File Size | File Link |
|---|---|---|---|
| Gliese-CUA-Tool-Call-8B.BF16.gguf | BF16 | 15.2 GB | Download |
| Gliese-CUA-Tool-Call-8B.F16.gguf | F16 | 15.2 GB | Download |
| Gliese-CUA-Tool-Call-8B.Q8_0.gguf | Q8_0 | 8.1 GB | Download |
| Gliese-CUA-Tool-Call-8B.mmproj-bf16.gguf | mmproj-bf16 | 1.36 GB | Download |
| Gliese-CUA-Tool-Call-8B.mmproj-f16.gguf | mmproj-f16 | 1.35 GB | Download |
| Gliese-CUA-Tool-Call-8B.mmproj-q8_0.gguf | mmproj-q8_0 | 856 MB | Download |
(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
Here is a handy graph by ikawrakow comparing some lower-quality quant types (lower is better):
8-bit
16-bit
Base model
Qwen/Qwen2.5-VL-7B-Instruct