Diffusers Pipelines Library for Stable Diffusion

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

rizavelioglu authored a paper 7 days ago

Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off

Paper99 authored a paper 15 days ago

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper99 authored a paper 19 days ago

OmniCaptioner: One Captioner to Rule Them All

View all activity

sd-diffusers-pipelines-library's activity

Paper99

authored a paper 15 days ago

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published 18 days ago • 46

Paper99

authored a paper 19 days ago

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published 19 days ago • 20

pcuenq

authored a paper 21 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 21 days ago • 176

Paper99

authored 2 papers about 1 month ago

LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis

Paper • 2503.21749 • Published Mar 27 • 26

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Paper • 2503.21758 • Published Mar 27 • 21

susunghong

authored a paper about 1 month ago

MusicInfuser: Making Video Diffusion Listen and Dance

Paper • 2503.14505 • Published Mar 18 • 11

Paper99

authored 2 papers 2 months ago

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published Feb 10 • 14

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs

Paper • 2502.18461 • Published Feb 25 • 15

gyrojeff

authored a paper 2 months ago

Hyperstroke: A Novel High-quality Stroke Representation for Assistive Artistic Drawing

Paper • 2408.09348 • Published Aug 18, 2024 • 1

thethomasboyer

authored a paper 3 months ago

PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models

Paper • 2312.08290 • Published Dec 13, 2023 • 2

Paper99

authored a paper 3 months ago

IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models

Paper • 2501.13920 • Published Jan 23 • 17

thomagram

authored a paper 4 months ago

Normalizing Flows are Capable Generative Models

Paper • 2412.06329 • Published Dec 9, 2024 • 9

sunovivid

authored a paper 5 months ago

Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance

Paper • 2403.17377 • Published Mar 26, 2024 • 2

thomagram

authored a paper 5 months ago

World-consistent Video Diffusion with Explicit 3D Modeling

Paper • 2412.01821 • Published Dec 2, 2024 • 4

noamrot

authored a paper 5 months ago

Pathways on the Image Manifold: Image Editing via Video Generation

Paper • 2411.16819 • Published Nov 25, 2024 • 37

annadeichler

authored 2 papers 5 months ago

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis

Paper • 2404.19622 • Published Apr 30, 2024 • 2

MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans

Paper • 2410.00253 • Published Sep 30, 2024

Paper99

authored a paper 7 months ago

Emu3: Next-Token Prediction is All You Need

Paper • 2409.18869 • Published Sep 27, 2024 • 95

AgainstEntropy

authored a paper 8 months ago

Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models

Paper • 2408.15518 • Published Aug 28, 2024 • 43

susunghong

authored a paper 9 months ago

Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention

Paper • 2408.00760 • Published Aug 1, 2024 • 7