arxiv:2507.01006
Wenyi Hong
wenyi
AI & ML interests
multi-modal, pretrain
Recent Activity
upvoted a paper 3 days ago
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents submitted a paper 3 days ago
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents upvoted a paper about 1 month ago
Vision2Web: A Hierarchical Benchmark for Visual Website Development with Agent Verification