view article Article CinePile 2.0 - making stronger datasets with adversarial refinement Oct 23, 2024 • 16
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published 25 days ago • 76
SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer • 6 items • Updated 12 days ago • 4
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 403
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 193
Remote VAE Inference Endpoints Collection Models and handler code used in https://huggingface.co./blog/remote_vae • 5 items • Updated Mar 10 • 4
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20 • 143
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 25 days ago • 146