AI Research

41 articles in this category

Mozilla's AI-Powered Pipeline Uncovers 271 Hidden Firefox Vulnerabilities

Anthropic's Claude Mythos Preview AI model discovers 271 previously unknown security vulnerabilities in Firefox 150, including bugs dating back 20 years.

The Decoder

May 08, 2026·1 min read

AI Research

Venom and Hot Peppers Offer a Key to Killing Resistant Bacteria

Researchers from the National Autonomous University of Mexico develop new antibiotics from scorpion venom and habanero peppers to combat tuberculosis and reduce bacterial resistance.

Wired

May 08, 2026·1 min read

AI Research

Anthropic Introduces Natural Language Autoencoders That Convert Claude’s Internal Activations Directly into Human-Readable Text Explanations

Anthropic introduces Natural Language Autoencoders (NLAs), a technique that directly converts a model's internal activations into human-readable text explanations.

MarkTechPost

May 08, 2026·1 min read

AI Research

The Human Touch: Can AI and Translators Coexist in Europe's Publishing Industry?

The rise of AI technology has disrupted translation jobs in publishing, but human translators may still have a role to play.

The Guardian Technology

May 07, 2026·1 min read

AI Research

The AI Jailbreakers: Uncovering the Dark Side of Chatbots

Journalist Jamie Bartlett explores the world of AI jailbreaking, where individuals intentionally manipulate chatbots to reveal their vulnerabilities and improve safety.

The Guardian Technology

May 07, 2026·1 min read

AI Research

Anthropic Unveils 'Dreaming' AI System That Lets Agents Learn From Their Own Mistakes

Anthropic introduces 'dreaming,' a system that enables AI agents to learn from past sessions and improve over time, along with updates to its Claude Managed Agents platform.

VentureBeat

May 07, 2026·1 min read

AI Research

UK schools urged to remove pupil photos over AI blackmail threat

Experts warn UK schools to remove online photos of pupils' faces due to rising threat of AI-generated sexually explicit images being used for blackmail.

The Guardian Technology

May 07, 2026·1 min read

AI Research

How Sakana Trained a 7B Model to Orchestrate GPT, Claude, and Gemini LLMs

Sakana AI's RL Conductor, a small language model trained via reinforcement learning, automatically orchestrates a diverse pool of worker LLMs to achieve state-of-the-art results on difficult reasoning and coding benchmarks.

VentureBeat

May 07, 2026·1 min read

AI Research

Mozilla Verifies 271 Vulnerabilities with AI-Powered Mythos Model

Mozilla's use of Anthropic's Mythos AI model yields 271 verified Firefox vulnerabilities with minimal false positives.

Ars Technica

May 07, 2026·1 min read

AI Research

Europe's answer to AI regulation complexity is to just delay most of it

The EU has agreed on simplified AI rules, easing requirements for small businesses and delaying deadlines for high-risk AI.

The Decoder

May 07, 2026·1 min read

AI Research

Meta AI Releases NeuralBench: A Unified Open-Source Framework to Benchmark NeuroAI Models Across 36 EEG Tasks and 94 Datasets

Meta AI has released NeuralBench, a unified, open-source framework for benchmarking AI models of brain activity across 36 EEG tasks and 94 datasets.

MarkTechPost

May 07, 2026·1 min read

AI Research

Why AI breaks without context — and how to fix it

The gap between what AI promises and what it delivers is not subtle, and the issue often lies not with the model, but with the context in which it's deployed.

VentureBeat

May 07, 2026·1 min read

AI Research

Correctness Before Corrections: The vLLM V0 to V1 Migration

PipelineRL's vLLM inference engine upgrade from V0 to V1 required fixing backend behavior to match training dynamics.

Hugging Face

May 06, 2026·1 min read

AI Research

Building an agentic AI strategy that pays off - without risking business failure

As AI strategy task forces present executives with daring options, experts warn that a solid strategy is crucial to avoid failure and unlock $3 trillion in annual productivity gains.

ZDNet

May 04, 2026·1 min read

AI Research

Tailoring AI Solutions for Health Care Needs

The AI market is full of big promises of grand transformation.

MIT Technology Review

May 04, 2026·1 min read

AI Research

OpenAI says human attention is the bottleneck, so it built a system to let agents manage themselves

OpenAI introduces Symphony, a system that enables AI agents to manage themselves, eliminating the need for human oversight in coding workflows.

The Decoder

May 04, 2026·1 min read

AI Research

Flaws in Kenya’s AI-driven health reforms driving up costs for the poorest

An investigation finds Kenya's AI-driven healthcare system favours the rich, contradicting President William Ruto's promise of universal access.

The Guardian Technology

May 04, 2026·1 min read

AI Research

A Developer's Guide to Systematic Prompting: Mastering Negative Constraints, Structured JSON Outputs, and Multi-Hypothesis Verbalized Sampling

Developers are formalizing prompting techniques to address specific failure modes in large language models, ensuring reliability and consistency in production systems.

MarkTechPost

May 03, 2026·1 min read

AI Research

Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time

Sakana AI's KAME architecture combines the speed of direct speech-to-speech models with the knowledge of large language models, achieving near-zero latency and improved conversational AI.

MarkTechPost

May 03, 2026·1 min read

AI Research

What is Tokenization Drift and How to Fix It?

Tokenization drift occurs when small changes in input formatting cause unpredictable shifts in model behavior, and it can be addressed through automated prompt optimization.

MarkTechPost

May 03, 2026·1 min read

AI Research

Research roundup: 6 cool science stories we almost missed

Every month, a handful of fascinating scientific stories slip through the cracks – here are six that nearly went unnoticed.

Ars Technica

May 02, 2026·1 min read

AI Research

200,000 MCP servers expose a command execution flaw that Anthropic calls a feature

A security flaw in the Model Context Protocol (MCP) affects 200,000 servers, allowing for arbitrary command execution due to insecure default settings.

VentureBeat

May 01, 2026·1 min read

AI Research

Operationalizing AI for Scale and Sovereignty

Companies are taking control of their own data to tailor AI for their needs, balancing ownership with safe and trusted data flow.

MIT Technology Review

May 01, 2026·1 min read

AI Research

The Download: a new Christian phone network, and debugging LLMs

A new Christian phone network launches with strict content controls, while a startup aims to make AI model development more transparent and controllable.

MIT Technology Review

May 01, 2026·1 min read

AI Research

With $1 Cyberattacks on the Rise, Durable Defenses Pay Off

As generative AI makes cyberattacks cheaper and more accessible, robust defenses are crucial to prevent vulnerabilities from being exploited.

IEEE Spectrum

Apr 30, 2026·1 min read

AI Research

The AI Evaluation Bottleneck: How Cost is Redefining the Field

The cost of evaluating AI models has skyrocketed, making it a new bottleneck in the field, with some evaluations costing tens of thousands of dollars.

Hugging Face

Apr 29, 2026·1 min read

AI Research

Granite 4.1 LLMs: A Technical Walkthrough of Data Engineering and Training

A detailed look at the data engineering, pre-training, supervised fine-tuning, and reinforcement learning behind the Granite 4.1 LLMs.

Hugging Face

Apr 29, 2026·1 min read

AI Research

The Download: Storing Nuclear Waste and Orchestrating Agents

Today's tech news: nuclear waste storage becomes urgent as nuclear energy gains traction, and AI agents are being developed to work together to tackle complex tasks.

MIT Technology Review

Apr 29, 2026·1 min read

AI Research

Meet the AI Jailbreakers: Hackers Pushing Chatbots to the Dark Side

Hackers are manipulating AI chatbots into breaking their own safety rules to test their security and push the boundaries of what they can do.

The Guardian Technology

Apr 29, 2026·1 min read

$How to build custom reasoning agents with a fraction of the compute$

AI Research

How to build custom reasoning agents with a fraction of the compute

A new training paradigm called Reinforcement Learning with Verifiable Rewards with Self-Distillation (RLSD) allows enterprises to build custom reasoning models at a lower cost.

VentureBeat

Apr 28, 2026·1 min read

AI Research

Over 80% of US government agencies already use AI agents - and it's only the beginning

More than 80% of US government agencies have already adopted AI agents, with many planning to increase their use in the coming years, according to a new IDC study.

ZDNet

Apr 28, 2026·1 min read

AI Research

Researchers find AI text is making the internet more uniform and weirdly cheerful

A large-scale analysis of websites from the Internet Archive reveals the pervasive influence of AI-generated text on the web.

The Decoder

Apr 28, 2026·1 min read

AI Research

The Human Touch: Zine Creators Push Back Against AI Influence

Zine artists and writers argue that the handmade nature of self-published booklets is incompatible with artificial intelligence.

The Guardian Technology

Apr 28, 2026·1 min read

AI Research

The missing step between hype and profit

The development of AI has reached a crucial point where companies have built the technology and promised transformation, but the path to get there remains unclear.

MIT Technology Review

Apr 27, 2026·1 min read

AI Research

Rebuilding the Data Stack for AI

Many enterprises are discovering that the biggest obstacle to meaningful AI adoption is the state of their data infrastructure.

MIT Technology Review

Apr 27, 2026·1 min read

AI Research

Engineering Collisions: How NYU Is Remaking Health Research

New York University's Institute for Engineering Health is revolutionizing the approach to health research by assembling teams from various disciplines to tackle specific disease states.

IEEE Spectrum

Apr 27, 2026·1 min read

AI Research

Anthropic Tests AI-Powered Marketplace with Real-World Transactions

Anthropic's experimental marketplace, Project Deal, enables AI agents to buy and sell goods for real money, yielding surprising results on agent performance.

TechCrunch

Apr 26, 2026·1 min read

AI Research

AI-Powered Cybercrime on the Rise: Supercharged Scams and AI Healthcare

The increasing use of AI by cybercriminals is leading to a surge in sophisticated scams and phishing attacks, while AI is also being used in healthcare to improve patient outcomes, but its effectiveness is still uncertain.

MIT Technology Review

Apr 24, 2026·1 min read