AI models often give the right answers but point to the wrong sources
Leading AI models like GPT and Gemini frequently cite irrelevant text passages in document analyses, even when their answers are correct.

["Leading AI models, including GPT and Gemini, are capable of providing accurate answers to complex questions, but a new study reveals a concerning trend: these models often rely on text passages that do not actually support their responses. This phenomenon, dubbed 'attribution hallucination' by researchers at Peking University, poses significant risks in highly regulated fields such as law and medicine, where accuracy and reliability are paramount.", 'The researchers behind the study have developed a novel benchmark called CiteVQA, which systematically tests AI models for attribution hallucination. This benchmark is the first of its kind, and it aims to evaluate the ability of AI models to provide accurate citations for their answers.
According to the researchers, even when AI models produce correct answers, the cited evidence often turns out to be incorrect or irrelevant.', 'The implications of attribution hallucination are far-reaching, particularly in fields where professionals rely on accurate information to make informed decisions. In law, for instance, incorrect citations can lead to flawed arguments and unjust outcomes. Similarly, in medicine, reliance on inaccurate information can compromise patient care and safety.', 'The development of CiteVQA marks an important step towards addressing the issue of attribution hallucination in AI models.
By systematically evaluating the performance of leading AI models, researchers hope to raise awareness about this problem and drive the development of more accurate and reliable AI systems. As AI continues to play an increasingly important role in various industries, ensuring the accuracy and reliability of these systems is crucial.', "The study's findings serve as a reminder that AI models, no matter how advanced, are not infallible. While AI has the potential to revolutionize numerous fields, it is essential to approach its development and deployment with caution, prioritizing accuracy, reliability, and transparency."]
Source: The Decoder