Even Small Reasoners Should Quote Their Sources: Introducing the Pleias-RAG Model Family Paper • 2504.18225 • Published 4 days ago • 4 • 2
Towards Understanding Camera Motions in Any Video Paper • 2504.15376 • Published 8 days ago • 131 • 2
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Paper • 2504.12080 • Published 13 days ago • 7 • 2
Subject-driven Video Generation via Disentangled Identity and Motion Paper • 2504.17816 • Published 6 days ago • 10 • 2
VideoVista-CulturalLingo: 360$^\circ$ Horizons-Bridging Cultures, Languages, and Domains in Video Comprehension Paper • 2504.17821 • Published 6 days ago • 20 • 2
DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models Paper • 2504.15716 • Published 7 days ago • 7 • 2
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning Paper • 2504.16656 • Published 6 days ago • 45 • 2
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Paper • 2504.16427 • Published 6 days ago • 11 • 2
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published 4 days ago • 31 • 2
The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs Paper • 2504.17768 • Published 5 days ago • 9 • 3
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation Paper • 2504.17025 • Published 6 days ago • 3 • 1