Category

AI Research

41 articles in this category

Mozilla's AI-Powered Pipeline Uncovers 271 Hidden Firefox Vulnerabilities
AI Research

Mozilla's AI-Powered Pipeline Uncovers 271 Hidden Firefox Vulnerabilities

Anthropic's Claude Mythos Preview AI model discovers 271 previously unknown security vulnerabilities in Firefox 150, including bugs dating back 20 years.

The Decoder
May 08, 2026·1 min read
Venom and Hot Peppers Offer a Key to Killing Resistant Bacteria
AI Research

Venom and Hot Peppers Offer a Key to Killing Resistant Bacteria

Researchers from the National Autonomous University of Mexico develop new antibiotics from scorpion venom and habanero peppers to combat tuberculosis and reduce bacterial resistance.

Wired
May 08, 2026·1 min read
Anthropic Introduces Natural Language Autoencoders That Convert Claude’s Internal Activations Directly into Human-Readable Text Explanations
AI Research

Anthropic Introduces Natural Language Autoencoders That Convert Claude’s Internal Activations Directly into Human-Readable Text Explanations

Anthropic introduces Natural Language Autoencoders (NLAs), a technique that directly converts a model's internal activations into human-readable text explanations.

MarkTechPost
May 08, 2026·1 min read
The Human Touch: Can AI and Translators Coexist in Europe's Publishing Industry?
AI Research

The Human Touch: Can AI and Translators Coexist in Europe's Publishing Industry?

The rise of AI technology has disrupted translation jobs in publishing, but human translators may still have a role to play.

The Guardian Technology
May 07, 2026·1 min read
The AI Jailbreakers: Uncovering the Dark Side of Chatbots
AI Research

The AI Jailbreakers: Uncovering the Dark Side of Chatbots

Journalist Jamie Bartlett explores the world of AI jailbreaking, where individuals intentionally manipulate chatbots to reveal their vulnerabilities and improve safety.

The Guardian Technology
May 07, 2026·1 min read
Anthropic Unveils 'Dreaming' AI System That Lets Agents Learn From Their Own Mistakes
AI Research

Anthropic Unveils 'Dreaming' AI System That Lets Agents Learn From Their Own Mistakes

Anthropic introduces 'dreaming,' a system that enables AI agents to learn from past sessions and improve over time, along with updates to its Claude Managed Agents platform.

VentureBeat
May 07, 2026·1 min read
UK schools urged to remove pupil photos over AI blackmail threat
AI Research

UK schools urged to remove pupil photos over AI blackmail threat

Experts warn UK schools to remove online photos of pupils' faces due to rising threat of AI-generated sexually explicit images being used for blackmail.

The Guardian Technology
May 07, 2026·1 min read
How Sakana Trained a 7B Model to Orchestrate GPT, Claude, and Gemini LLMs
AI Research

How Sakana Trained a 7B Model to Orchestrate GPT, Claude, and Gemini LLMs

Sakana AI's RL Conductor, a small language model trained via reinforcement learning, automatically orchestrates a diverse pool of worker LLMs to achieve state-of-the-art results on difficult reasoning and coding benchmarks.

VentureBeat
May 07, 2026·1 min read
Mozilla Verifies 271 Vulnerabilities with AI-Powered Mythos Model
AI Research

Mozilla Verifies 271 Vulnerabilities with AI-Powered Mythos Model

Mozilla's use of Anthropic's Mythos AI model yields 271 verified Firefox vulnerabilities with minimal false positives.

Ars Technica
May 07, 2026·1 min read
Europe's answer to AI regulation complexity is to just delay most of it
AI Research

Europe's answer to AI regulation complexity is to just delay most of it

The EU has agreed on simplified AI rules, easing requirements for small businesses and delaying deadlines for high-risk AI.

The Decoder
May 07, 2026·1 min read
Meta AI Releases NeuralBench: A Unified Open-Source Framework to Benchmark NeuroAI Models Across 36 EEG Tasks and 94 Datasets
AI Research

Meta AI Releases NeuralBench: A Unified Open-Source Framework to Benchmark NeuroAI Models Across 36 EEG Tasks and 94 Datasets

Meta AI has released NeuralBench, a unified, open-source framework for benchmarking AI models of brain activity across 36 EEG tasks and 94 datasets.

MarkTechPost
May 07, 2026·1 min read
Why AI breaks without context — and how to fix it
AI Research

Why AI breaks without context — and how to fix it

The gap between what AI promises and what it delivers is not subtle, and the issue often lies not with the model, but with the context in which it's deployed.

VentureBeat
May 07, 2026·1 min read
Correctness Before Corrections: The vLLM V0 to V1 Migration
AI Research

Correctness Before Corrections: The vLLM V0 to V1 Migration

PipelineRL's vLLM inference engine upgrade from V0 to V1 required fixing backend behavior to match training dynamics.

Hugging Face
May 06, 2026·1 min read
Building an agentic AI strategy that pays off - without risking business failure
AI Research

Building an agentic AI strategy that pays off - without risking business failure

As AI strategy task forces present executives with daring options, experts warn that a solid strategy is crucial to avoid failure and unlock $3 trillion in annual productivity gains.

ZDNet
May 04, 2026·1 min read
Tailoring AI Solutions for Health Care Needs
AI Research

Tailoring AI Solutions for Health Care Needs

The AI market is full of big promises of grand transformation.

MIT Technology Review
May 04, 2026·1 min read
OpenAI says human attention is the bottleneck, so it built a system to let agents manage themselves
AI Research

OpenAI says human attention is the bottleneck, so it built a system to let agents manage themselves

OpenAI introduces Symphony, a system that enables AI agents to manage themselves, eliminating the need for human oversight in coding workflows.

The Decoder
May 04, 2026·1 min read
Flaws in Kenya’s AI-driven health reforms driving up costs for the poorest
AI Research

Flaws in Kenya’s AI-driven health reforms driving up costs for the poorest

An investigation finds Kenya's AI-driven healthcare system favours the rich, contradicting President William Ruto's promise of universal access.

The Guardian Technology
May 04, 2026·1 min read
A Developer's Guide to Systematic Prompting: Mastering Negative Constraints, Structured JSON Outputs, and Multi-Hypothesis Verbalized Sampling
AI Research

A Developer's Guide to Systematic Prompting: Mastering Negative Constraints, Structured JSON Outputs, and Multi-Hypothesis Verbalized Sampling

Developers are formalizing prompting techniques to address specific failure modes in large language models, ensuring reliability and consistency in production systems.

MarkTechPost
May 03, 2026·1 min read
Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time
AI Research

Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time

Sakana AI's KAME architecture combines the speed of direct speech-to-speech models with the knowledge of large language models, achieving near-zero latency and improved conversational AI.

MarkTechPost
May 03, 2026·1 min read
What is Tokenization Drift and How to Fix It?
AI Research

What is Tokenization Drift and How to Fix It?

Tokenization drift occurs when small changes in input formatting cause unpredictable shifts in model behavior, and it can be addressed through automated prompt optimization.

MarkTechPost
May 03, 2026·1 min read
Research roundup: 6 cool science stories we almost missed
AI Research

Research roundup: 6 cool science stories we almost missed

Every month, a handful of fascinating scientific stories slip through the cracks – here are six that nearly went unnoticed.

Ars Technica
May 02, 2026·1 min read
200,000 MCP servers expose a command execution flaw that Anthropic calls a feature
AI Research

200,000 MCP servers expose a command execution flaw that Anthropic calls a feature

A security flaw in the Model Context Protocol (MCP) affects 200,000 servers, allowing for arbitrary command execution due to insecure default settings.

VentureBeat
May 01, 2026·1 min read
Operationalizing AI for Scale and Sovereignty
AI Research

Operationalizing AI for Scale and Sovereignty

Companies are taking control of their own data to tailor AI for their needs, balancing ownership with safe and trusted data flow.

MIT Technology Review
May 01, 2026·1 min read
The Download: a new Christian phone network, and debugging LLMs
AI Research

The Download: a new Christian phone network, and debugging LLMs

A new Christian phone network launches with strict content controls, while a startup aims to make AI model development more transparent and controllable.

MIT Technology Review
May 01, 2026·1 min read
With $1 Cyberattacks on the Rise, Durable Defenses Pay Off
AI Research

With $1 Cyberattacks on the Rise, Durable Defenses Pay Off

As generative AI makes cyberattacks cheaper and more accessible, robust defenses are crucial to prevent vulnerabilities from being exploited.

IEEE Spectrum
Apr 30, 2026·1 min read
The AI Evaluation Bottleneck: How Cost is Redefining the Field
AI Research

The AI Evaluation Bottleneck: How Cost is Redefining the Field

The cost of evaluating AI models has skyrocketed, making it a new bottleneck in the field, with some evaluations costing tens of thousands of dollars.

Hugging Face
Apr 29, 2026·1 min read
Granite 4.1 LLMs: A Technical Walkthrough of Data Engineering and Training
AI Research

Granite 4.1 LLMs: A Technical Walkthrough of Data Engineering and Training

A detailed look at the data engineering, pre-training, supervised fine-tuning, and reinforcement learning behind the Granite 4.1 LLMs.

Hugging Face
Apr 29, 2026·1 min read
The Download: Storing Nuclear Waste and Orchestrating Agents
AI Research

The Download: Storing Nuclear Waste and Orchestrating Agents

Today's tech news: nuclear waste storage becomes urgent as nuclear energy gains traction, and AI agents are being developed to work together to tackle complex tasks.

MIT Technology Review
Apr 29, 2026·1 min read
Meet the AI Jailbreakers: Hackers Pushing Chatbots to the Dark Side
AI Research

Meet the AI Jailbreakers: Hackers Pushing Chatbots to the Dark Side

Hackers are manipulating AI chatbots into breaking their own safety rules to test their security and push the boundaries of what they can do.

The Guardian Technology
Apr 29, 2026·1 min read
How to build custom reasoning agents with a fraction of the compute
AI Research

How to build custom reasoning agents with a fraction of the compute

A new training paradigm called Reinforcement Learning with Verifiable Rewards with Self-Distillation (RLSD) allows enterprises to build custom reasoning models at a lower cost.

VentureBeat
Apr 28, 2026·1 min read
Over 80% of US government agencies already use AI agents - and it's only the beginning
AI Research

Over 80% of US government agencies already use AI agents - and it's only the beginning

More than 80% of US government agencies have already adopted AI agents, with many planning to increase their use in the coming years, according to a new IDC study.

ZDNet
Apr 28, 2026·1 min read
Researchers find AI text is making the internet more uniform and weirdly cheerful
AI Research

Researchers find AI text is making the internet more uniform and weirdly cheerful

A large-scale analysis of websites from the Internet Archive reveals the pervasive influence of AI-generated text on the web.

The Decoder
Apr 28, 2026·1 min read
The Human Touch: Zine Creators Push Back Against AI Influence
AI Research

The Human Touch: Zine Creators Push Back Against AI Influence

Zine artists and writers argue that the handmade nature of self-published booklets is incompatible with artificial intelligence.

The Guardian Technology
Apr 28, 2026·1 min read
The missing step between hype and profit
AI Research

The missing step between hype and profit

The development of AI has reached a crucial point where companies have built the technology and promised transformation, but the path to get there remains unclear.

MIT Technology Review
Apr 27, 2026·1 min read
Rebuilding the Data Stack for AI
AI Research

Rebuilding the Data Stack for AI

Many enterprises are discovering that the biggest obstacle to meaningful AI adoption is the state of their data infrastructure.

MIT Technology Review
Apr 27, 2026·1 min read
Engineering Collisions: How NYU Is Remaking Health Research
AI Research

Engineering Collisions: How NYU Is Remaking Health Research

New York University's Institute for Engineering Health is revolutionizing the approach to health research by assembling teams from various disciplines to tackle specific disease states.

IEEE Spectrum
Apr 27, 2026·1 min read
Anthropic Tests AI-Powered Marketplace with Real-World Transactions
AI Research

Anthropic Tests AI-Powered Marketplace with Real-World Transactions

Anthropic's experimental marketplace, Project Deal, enables AI agents to buy and sell goods for real money, yielding surprising results on agent performance.

TechCrunch
Apr 26, 2026·1 min read
AI-Powered Cybercrime on the Rise: Supercharged Scams and AI Healthcare
AI Research

AI-Powered Cybercrime on the Rise: Supercharged Scams and AI Healthcare

The increasing use of AI by cybercriminals is leading to a surge in sophisticated scams and phishing attacks, while AI is also being used in healthcare to improve patient outcomes, but its effectiveness is still uncertain.

MIT Technology Review
Apr 24, 2026·1 min read
Rocket Report: Artemis III Rocket Prepares for Launch; SpaceX Expands into AI
AI Research

Rocket Report: Artemis III Rocket Prepares for Launch; SpaceX Expands into AI

The Rocket Report returns with updates on the Artemis III rocket, SpaceX's AI ambitions, and controversy surrounding Canada's spaceport plans.

Ars Technica
Apr 24, 2026·1 min read
Health-care AI is here. We don’t know if it actually helps patients.
AI Research

Health-care AI is here. We don’t know if it actually helps patients.

I don’t need to tell you that AI is everywhere .

MIT Technology Review
Apr 24, 2026·1 min read
QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard
AI Research

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

QIMMA validates benchmarks before evaluating models, ensuring reported scores reflect genuine Arabic language capability in LLMs.

Hugging Face
Apr 21, 2026·1 min read