Model and data for ReflectiVA: Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering [CVPR 2025]
Federico Cocchi
fede97
AI & ML interests
Multimodal LLM - Computer Vision
Recent Activity
updated
a model
5 days ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-finetuning
updated
a model
5 days ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-siglip-pretrain
updated
a model
5 days ago
aimagelab/LLaVA_MORE-llama_3_1-8B-S2-finetuning
Organizations
Collections
5
models
0
None public yet
datasets
5
fede97/external_test_set_v1
Viewer
•
Updated
•
340
•
43
fede97/external_data_test_example_v3
Updated
•
3
fede97/external_data_test_example
Viewer
•
Updated
•
410
•
82
fede97/external_data_test_example_v2
Viewer
•
Updated
•
410
•
104
fede97/dpo_demo
Viewer
•
Updated
•
148k
•
91