Last updated 11 days ago
A natural idea is to threshold based detection over CLIP score.
Two Sided thresholding
CIFAR 100
CLIP Score
99.94
95.71
97.78
95.70
PQ Score
99.93
96.71
98.30
96.67
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation