The Blind Spot of AI Search Agents: Confirmation Over Research
Top AI search agents like GPT-5.4 and Kimi K2.6 rely on prior knowledge rather than actual web research, according to a new study.

Leading AI search agents, such as GPT-5.4 and Kimi K2.6, are not living up to their potential as research tools. Instead of scouring the web for new information, they often rely on what they already know, using the web to confirm pre-existing knowledge rather than conducting genuine research. This is the finding of a recent study conducted by researchers at the Harbin Institute of Technology.
The researchers developed a new benchmark called LiveBrowseComp, which tests AI models on events from the last 90 days. This approach allows for a more accurate assessment of a model's ability to perform actual research, rather than simply drawing on its training data. The results were striking: once the models were unable to fall back on their prior knowledge, their performance faltered significantly.
The existing rankings of AI search agents were reshuffled when evaluated using the LiveBrowseComp benchmark, suggesting that the current leaders may not be as effective as they seem. According to the researchers, this highlights a significant blind spot in the development of AI search agents, which are increasingly being relied upon for information gathering and research tasks. The study's findings have significant implications for the development of AI search agents, highlighting the need for more advanced models that can truly learn and adapt in real-time.
As it stands, the current crop of AI search agents appears to be more focused on confirmation than discovery. The article, which first appeared on The Decoder, underscores the importance of continued research and development in the field of AI, with a focus on creating more effective and efficient search tools. By acknowledging the limitations of current AI search agents, researchers can work towards creating more sophisticated models that are capable of genuine research and discovery.
Source: The Decoder