Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation Paper • 2504.17207 • Published 5 days ago • 27
Step1X-Edit: A Practical Framework for General Image Editing Paper • 2504.17761 • Published 4 days ago • 78
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 13 minutes ago • 453
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 600