Category

AI Research

60 articles in this category

Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics
AI Research

Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics

In this tutorial , we explore the Open-SWE-Traces dataset as a practical resource for studying and preparing agentic software-engineering trajectories for fine-tuning.

MarkTechPost
Jun 27, 2026·1 min read
Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro
AI Research

Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro

A new Cursor study reports that newer coding agents often retrieve known fixes instead of deriving them, inflating popular benchmark scores.

MarkTechPost
Jun 26, 2026·1 min read
New MRAgent Framework Uses 118K Tokens Per Query, Outperforms LangMem
AI Research

New MRAgent Framework Uses 118K Tokens Per Query, Outperforms LangMem

MRAgent framework reduces token consumption and runtime costs for long-horizon reasoning tasks in AI agents.

VentureBeat
Jun 26, 2026·1 min read
OpenAI limits GPT-5.6 rollout after US government request
AI Research

OpenAI limits GPT-5.6 rollout after US government request

OpenAI limits GPT-5.6 release to a small group of trusted partners at US government behest.

TechCrunch
Jun 26, 2026·1 min read
Autonomous Security Agents Require Complete Data: Is Yours Ready?
AI Research

Autonomous Security Agents Require Complete Data: Is Yours Ready?

Incomplete data hampers autonomous security agents; experts stress verifying endpoint coverage before deployment.

VentureBeat
Jun 26, 2026·1 min read
Anthropic's Mythos Models Remain Offline Amid Ongoing Talks
AI Research

Anthropic's Mythos Models Remain Offline Amid Ongoing Talks

Anthropic's Mythos-class models stay offline two weeks after Trump administration ultimatum.

The Verge
Jun 26, 2026·1 min read
Heat waves mess with your brain. Scientists are trying to figure out why.
AI Research

Heat waves mess with your brain. Scientists are trying to figure out why.

It’s been hot in London this week.

MIT Technology Review
Jun 26, 2026·1 min read
Anthropic Accuses Alibaba of Largest Claude Cloning Attack
AI Research

Anthropic Accuses Alibaba of Largest Claude Cloning Attack

Anthropic alleges Alibaba launched largest attack to clone AI model Claude, violating terms of service and access restrictions.

Ars Technica
Jun 25, 2026·1 min read
Databricks’ former AI chief thinks he can cut AI’s power bill by 1,000x
AI Research

Databricks’ former AI chief thinks he can cut AI’s power bill by 1,000x

The drive to discover the next big thing in AI has funded some pretty ambitious projects — but one company is taking it as a chance to rebuild computing architecture from the ground up.

TechCrunch
Jun 25, 2026·1 min read
Major AI chatbots skew left on political questions, even 'anti-woke' models
AI Research

Major AI chatbots skew left on political questions, even 'anti-woke' models

Most major AI chatbots lean left on political questions, according to a Washington Post investigation.

The Decoder
Jun 25, 2026·1 min read
Insurers turn to generative AI for catastrophe modeling
AI Research

Insurers turn to generative AI for catastrophe modeling

Insurers use diffusion models to generate plausible weather events for risk assessments, but face warnings about hallucinations.

The Decoder
Jun 25, 2026·1 min read
US Government and Colossal Biosciences Partner to Sequence Endangered Species Genomes
AI Research

US Government and Colossal Biosciences Partner to Sequence Endangered Species Genomes

The US government and Colossal Biosciences partner to sequence genomes of all species on the Endangered Species list.

Ars Technica
Jun 25, 2026·1 min read
What it Means to Be a Mathematician When AI Does the Math
AI Research

What it Means to Be a Mathematician When AI Does the Math

In the mid-noughties, when music by the Killers and Franz Ferdinand blared out of every pub and nightclub I passed, I spent my days and nights struggling through a Ph.D.

IEEE Spectrum
Jun 25, 2026·1 min read
Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing
AI Research

Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing

Most end-to-end OCR models slow down as output grows.

MarkTechPost
Jun 25, 2026·1 min read
AI wasn't supposed to kill engineering jobs, but data suggests they're thriving
AI Research

AI wasn't supposed to kill engineering jobs, but data suggests they're thriving

New data contradicts the notion that AI is replacing engineering jobs, instead showing they're resilient and in high demand.

TechCrunch
Jun 24, 2026·1 min read
Top AI Researchers Leave Google for Rivals
AI Research

Top AI Researchers Leave Google for Rivals

AI researchers Jonas Adler and Alexander Pritzel leave Google for Anthropic, continuing a trend of top talent departing for rival companies.

TechCrunch
Jun 24, 2026·1 min read
Congresswoman Denies AI Involvement in Defense Funding Amendment
AI Research

Congresswoman Denies AI Involvement in Defense Funding Amendment

Rep. Anna Paulina Luna says staff used AI for 'spellcheck' in amendment summary, but denies AI wrote bill text.

The Verge
Jun 24, 2026·1 min read
Alibaba's Qwen-AgentWorld Improves Agent Performance Across Seven Benchmarks
AI Research

Alibaba's Qwen-AgentWorld Improves Agent Performance Across Seven Benchmarks

Alibaba's Qwen team releases Qwen-AgentWorld, two models trained to predict environment returns, boosting agent performance in seven domains.

VentureBeat
Jun 24, 2026·1 min read
How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, and Multi-Agent Coordination
AI Research

How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, and Multi-Agent Coordination

In this tutorial , we build OpenHarness from scratch to better understand how a practical agent harness works.

MarkTechPost
Jun 24, 2026·1 min read
Xiaomi's HarnessX Boosts AI Performance by Evolving Software Scaffolding
AI Research

Xiaomi's HarnessX Boosts AI Performance by Evolving Software Scaffolding

Xiaomi's HarnessX framework autonomously improves AI software scaffolding, yielding significant performance gains across domains.

VentureBeat
Jun 24, 2026·1 min read
Stanford Researchers' Agentic AI Scientists Set to Reshape Drug Discovery
AI Research

Stanford Researchers' Agentic AI Scientists Set to Reshape Drug Discovery

Stanford researchers develop agentic AI 'scientists' to streamline drug discovery, reducing inefficiency and high failure rates.

VentureBeat
Jun 24, 2026·1 min read
Amazon to present framework for engineering trustworthy AI agents at VB Transform 2026
AI Research

Amazon to present framework for engineering trustworthy AI agents at VB Transform 2026

Amazon's AGI autonomy lab develops framework for trustworthy AI agents, focusing on consistency, robustness, and safety.

VentureBeat
Jun 24, 2026·1 min read
AI helps read ancient papyrus scroll burnt by Vesuvius eruption
AI Research

AI helps read ancient papyrus scroll burnt by Vesuvius eruption

AI helps read 2,000-year-old charred scroll, revealing stoic philosophy on ethics, art, and human behavior.

The Guardian Technology
Jun 24, 2026·1 min read
AI Is Designing Radio Chips That Humans Couldn’t Even Imagine
AI Research

AI Is Designing Radio Chips That Humans Couldn’t Even Imagine

Summary RFIC design is a complex “ dark art ” that limits progress in wireless technologies like 5G, autonomous vehicles, and satellite communications.

IEEE Spectrum
Jun 24, 2026·1 min read
The emergence of the web data infrastructure layer for AI
AI Research

The emergence of the web data infrastructure layer for AI

AI is booming.

MIT Technology Review
Jun 24, 2026·1 min read
‘You can’t make billions without hurting people’: Cory Doctorow on Elon Musk, the AI bubble and bosses’ cruel fantasies
AI Research

‘You can’t make billions without hurting people’: Cory Doctorow on Elon Musk, the AI bubble and bosses’ cruel fantasies

The writer who coined the word ‘enshittification’ tells us why AI will never deliver what it promises – and why it still appeals so much to those in power A “centaur”, in automation theory, is a person assisted by a mac…

The Guardian Technology
Jun 24, 2026·1 min read
DFlash Speculative Decoding Drafts Whole Token Blocks in Parallel for Up to 15x Higher Throughput on NVIDIA Blackwell
AI Research

DFlash Speculative Decoding Drafts Whole Token Blocks in Parallel for Up to 15x Higher Throughput on NVIDIA Blackwell

Autoregressive large language models generate text one token at a time.

MarkTechPost
Jun 24, 2026·1 min read
Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World
AI Research

Introducing the FFASR Leaderboard: Benchmarking ASR in the Real World

🚀 First open far-field ASR benchmark: community-driven evaluation across 14 simulated rooms, validated against real-world measurements: https://huggingface.co/spaces/treble-technologies/ffasr 📉 The gap is real and it is…

Hugging Face
Jun 24, 2026·1 min read
White House slashes deadline for quantum-resistant encryption adoption
AI Research

White House slashes deadline for quantum-resistant encryption adoption

The White House cuts deadline for government agencies to adopt quantum-resistant encryption to protect against quantum computer attacks.

Ars Technica
Jun 23, 2026·1 min read
Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
AI Research

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

TL;DR — Building an agent is mostly plumbing: tools, state, guardrails, scaling from one agent to many.

Hugging Face
Jun 23, 2026·1 min read
Cory Doctorow on bursting the AI bubble
AI Research

Cory Doctorow on bursting the AI bubble

Cory Doctorow's new book targets AI hype and explores its implications on society

Ars Technica
Jun 23, 2026·1 min read
AI Is Learning to Read the Room
AI Research

AI Is Learning to Read the Room

Imagine sitting down at your desk and logging in for a performance review, with an AI system analyzing the conversation.

IEEE Spectrum
Jun 23, 2026·1 min read
AI Data Delivery: The Key to Scaling Reliable Production Workloads
AI Research

AI Data Delivery: The Key to Scaling Reliable Production Workloads

Enterprises struggle to scale AI workloads due to fragile data delivery infrastructure.

VentureBeat
Jun 23, 2026·1 min read
OpenAI Launches 'Patch the Planet' to Help Open-Source Community Fix Bugs
AI Research

OpenAI Launches 'Patch the Planet' to Help Open-Source Community Fix Bugs

OpenAI teams up with Trail of Bits to help open-source maintainers secure their projects and improve cybersecurity.

TechCrunch
Jun 23, 2026·1 min read
Source of Mysterious Repeating Radio Signals from Space Identified
AI Research

Source of Mysterious Repeating Radio Signals from Space Identified

Researchers pinpoint origin of long-period radio transients, a mysterious phenomenon of strong radio signals arriving periodically from space.

Wired
Jun 22, 2026·1 min read
The AI world is getting ‘loopy’
AI Research

The AI world is getting ‘loopy’

On Friday, Claude Code creator Boris Cherny made an appearance at Meta’s @Scale conference and, surprisingly, the first question from the audience was about loops.

TechCrunch
Jun 22, 2026·1 min read
Commemorating 70 Years of Artificial Intelligence
AI Research

Commemorating 70 Years of Artificial Intelligence

Artificial intelligence is the transformative, strategic technology of the early 21st century.

IEEE Spectrum
Jun 22, 2026·1 min read
OpenAI Launches Effort to Patch Open-Source Bugs as It Takes on Anthropic’s Mythos
AI Research

OpenAI Launches Effort to Patch Open-Source Bugs as It Takes on Anthropic’s Mythos

OpenAI announces cybersecurity-focused initiatives to combat AI hacking capabilities and partners with Trail of Bits to launch Patch the Planet.

Wired
Jun 22, 2026·1 min read
Agentic Enterprises Must Become Learning Systems to Stay Ahead
AI Research

Agentic Enterprises Must Become Learning Systems to Stay Ahead

Organizations need to capture and reuse knowledge gained from daily operations to improve AI-driven decisions.

VentureBeat
Jun 22, 2026·1 min read
Researchers Introduce Self-Harness, a Framework for AI Agents to Rewrite Their Own Rules
AI Research

Researchers Introduce Self-Harness, a Framework for AI Agents to Rewrite Their Own Rules

Self-Harness lets AI agents systematically improve their operating rules, boosting performance up to 60% without relying on human engineers or stronger external models.

VentureBeat
Jun 22, 2026·1 min read
MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode
AI Research

MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode

MoonMath AI team has released a bf16 forward attention kernel for AMD’s MI300X GPU.

MarkTechPost
Jun 22, 2026·1 min read
AI hits memory wall, now needs new context tier
AI Research

AI hits memory wall, now needs new context tier

As AI inference workloads evolve, GPU availability is no longer the primary bottleneck; instead, context management has become a major challenge.

VentureBeat
Jun 22, 2026·1 min read
The 7 Types of Agent Memory: A Technical Guide for AI Engineers
AI Research

The 7 Types of Agent Memory: A Technical Guide for AI Engineers

Large language models are stateless by default.

MarkTechPost
Jun 21, 2026·1 min read
Cisco AI Introduces FAPO: Pipeline-Aware Prompt Optimization With Step-Level Failure Attribution and Claude Code Orchestration
AI Research

Cisco AI Introduces FAPO: Pipeline-Aware Prompt Optimization With Step-Level Failure Attribution and Claude Code Orchestration

Getting prompts right is still the hardest part of shipping reliable LLM applications.

MarkTechPost
Jun 20, 2026·1 min read
Signal President Warns Against Treating AI Chatbots as Friends
AI Research

Signal President Warns Against Treating AI Chatbots as Friends

Signal President Meredith Whittaker cautions against treating AI chatbots like friends or sentient beings, citing serious privacy implications.

TechCrunch
Jun 20, 2026·1 min read
The Atlantic Creates Searchable Database of AI Music Training Data
AI Research

The Atlantic Creates Searchable Database of AI Music Training Data

The Atlantic's Alex Reisner uncovers four datasets of music used to train AI models, making them searchable for the public.

The Verge
Jun 20, 2026·1 min read
Viral doomsday scenario highlights Europe's AI complacency
AI Research

Viral doomsday scenario highlights Europe's AI complacency

Thought experiment about US AI ascendancy sparks concerns about Europe's preparedness

The Guardian Technology
Jun 20, 2026·1 min read
NVIDIA AI Introduce SpatialClaw: A Training-Free Agent That Treats Code as the Action Interface for Spatial Reasoning
AI Research

NVIDIA AI Introduce SpatialClaw: A Training-Free Agent That Treats Code as the Action Interface for Spatial Reasoning

NVIDIA Research has released SpatialClaw, a training-free framework for spatial reasoning.

MarkTechPost
Jun 19, 2026·1 min read
VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline
AI Research

VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline

While recent breakthroughs in AI reasoning have largely been driven by massive scale, pouring in billions of parameters to cross complex cognitive thresholds— VibeThinker-3B is charting a completely different path.

MarkTechPost
Jun 19, 2026·1 min read
Nobel laureate John Jumper leaves Google Deepmind for Anthropic
AI Research

Nobel laureate John Jumper leaves Google Deepmind for Anthropic

Nobel Prize winner John Jumper leaves Google Deepmind for Anthropic after nearly nine years.

The Decoder
Jun 19, 2026·1 min read
AI Agents Stumble in Production: Can Hypernetworks Offer a Solution?
AI Research

AI Agents Stumble in Production: Can Hypernetworks Offer a Solution?

Enterprise AI agents often stall in production, requiring human oversight, but hypernetworks may offer a solution by generating task-specific models on demand.

VentureBeat
Jun 19, 2026·1 min read
AI chatbots increasingly used for news, but trust issues persist
AI Research

AI chatbots increasingly used for news, but trust issues persist

10% of people worldwide now use AI chatbots for news weekly, up from 7% last year, but trust remains low.

The Decoder
Jun 19, 2026·1 min read
AI models struggle with real-world knowledge work tasks
AI Research

AI models struggle with real-world knowledge work tasks

Top AI model solves just 3% of realistic knowledge work tasks, highlighting significant challenges in handling complex projects.

The Decoder
Jun 19, 2026·1 min read
AI Bottleneck Debates and BCI Trials Take Off
AI Research

AI Bottleneck Debates and BCI Trials Take Off

Subquadratic claims to have solved a mathematical bottleneck in large language models, while BCI trials surge and a man with ALS uses a brain implant.

MIT Technology Review
Jun 19, 2026·1 min read
Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an Agent’s Work and Learns Overnight
AI Research

Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an Agent’s Work and Learns Overnight

Most AI memory remembers the user.

MarkTechPost
Jun 18, 2026·1 min read
MosaicLeaks: Can your research agent keep a secret?
AI Research

MosaicLeaks: Can your research agent keep a secret?

Deep research agents increasingly combine private local documents with external tools like web retrieval, creating a privacy risk: an agent's external queries may leak sensitive information.

Hugging Face
Jun 18, 2026·1 min read
KV Cache Compression: TurboQuant, OSCAR, EpiCache Compete
AI Research

KV Cache Compression: TurboQuant, OSCAR, EpiCache Compete

KV cache compression methods, including TurboQuant, OSCAR, and EpiCache, aim to reduce memory usage in long-context large language models.

MarkTechPost
Jun 18, 2026·1 min read
The search for dark matter has been blown wide open
AI Research

The search for dark matter has been blown wide open

Underneath an Apennine massif, below the Jinping Mountains of Sichuan, and at the bottom of a South Dakota mine, there is a cosmic hunt afoot.

MIT Technology Review
Jun 18, 2026·1 min read
Google's Gemini Co-Lead Noam Shazeer Joins OpenAI
AI Research

Google's Gemini Co-Lead Noam Shazeer Joins OpenAI

Noam Shazeer, co-author of 'Attention Is All You Need' and former Google Gemini co-lead, joins OpenAI after a two-year return to Google.

The Decoder
Jun 18, 2026·1 min read
OpenAI Releases LifeSciBench, a 750-Task Benchmark for Evaluating AI in Life-Science Research
AI Research

OpenAI Releases LifeSciBench, a 750-Task Benchmark for Evaluating AI in Life-Science Research

OpenAI's LifeSciBench evaluates AI models on real life-science research tasks with expert-written rubrics.

MarkTechPost
Jun 18, 2026·1 min read