Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
Xizhou Zhu
Einsiedler
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
4 days ago
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models
authored
a paper
about 1 month ago
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
authored
a paper
about 2 months ago
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning
View all activity
Organizations
None yet
Papers
11
arxiv:
2504.15279
arxiv:
2503.19757
arxiv:
2503.10291
arxiv:
2501.07783
Expand 11 papers
models
0
None public yet
datasets
0
None public yet