Apple proposes MAD-Bench benchmark to solve multi-modal large language model hallucination problem
Apple Research proposed the MAD-Bench benchmark to solve the problem of vulnerability of multi-modal large language models (MLLMs) in handling misleading information. This study consisted of 850 image-cue pairs and evaluated the ability of MLLMs to handle
2025-01-05