multiple tags does not works

by mhassanch - opened 7 days ago

7 days ago

For the same input image:

caption1 = "a cat. a remote control."
→ Only detects cat; the confidence score for remote control is very low.
caption2 = "a remote control. a cat."
→ Only detects remote control; the confidence score for cat becomes very low.

This behavior suggests that the ONNX model only gives high confidence to the first object in the caption, regardless of what's actually present in the image.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment