Complement Object Direct Examples

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Abstract: Large Vision-Language Models (LVLMs) suffer from severe object hallucinations, leading them to frequently generate outputs that do not correspond to the image content, significantly reducing ...

GitHub

Cut and Learn for Unsupervised Image & Video Object Detection and Instance Segmentation

We propose MaskCut approach to generate pseudo-masks for multiple objects in an image. CutLER can learn unsupervised object detectors and instance segmentors solely on ImageNet-1K. CutLER exhibits ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Mitigating Object Hallucination in Large Vision-Language Models via Visual Attention Direct Preference Optimization

Cut and Learn for Unsupervised Image & Video Object Detection and Instance Segmentation

Trending now