Category

AI Models

60 articles in this category

OpenAI Previews GPT-5.6 With Sol, Terra, and Luna: Tiered Models, New Reasoning Modes, Limited Access
AI Models

OpenAI Previews GPT-5.6 With Sol, Terra, and Luna: Tiered Models, New Reasoning Modes, Limited Access

OpenAI has begun a limited preview of GPT-5.6 , its next-generation model series.

MarkTechPost
Jun 26, 2026·1 min read
OpenAI unveils GPT-5.6 family with Sol, Terra, and Luna models
AI Models

OpenAI unveils GPT-5.6 family with Sol, Terra, and Luna models

OpenAI announces limited preview of GPT-5.6 family, including Sol, Terra, and Luna models for various enterprise needs.

VentureBeat
Jun 26, 2026·1 min read
OpenAI unveils GPT-5.6 model suite amid US AI regulatory drama
AI Models

OpenAI unveils GPT-5.6 model suite amid US AI regulatory drama

OpenAI releases GPT-5.6 model suite, including Sol, Terra, and Luna, with improved coding, cybersecurity, and biology capabilities.

The Verge
Jun 26, 2026·1 min read
OpenAI Delays AI Model Release at US Government's Request
AI Models

OpenAI Delays AI Model Release at US Government's Request

OpenAI is staggering the release of GPT 5.6 after a US government request, echoing Anthropic's Mythos launch.

The Guardian Technology
Jun 26, 2026·1 min read
Liquid AI releases smallest AI model yet, LFM2.5-230M, for data extraction and local deployment
AI Models

Liquid AI releases smallest AI model yet, LFM2.5-230M, for data extraction and local deployment

Liquid AI's LFM2.5-230M model outperforms larger models in data extraction and can run on local devices.

VentureBeat
Jun 25, 2026·1 min read
Anthropic's Claude gains traction among paid AI consumers
AI Models

Anthropic's Claude gains traction among paid AI consumers

Anthropic's Claude is increasingly chosen by consumers who pay for AI, according to trend data from Indagari.

TechCrunch
Jun 25, 2026·1 min read
DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds
AI Models

DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

DeepReinforce has released Ornith-1.0 , an open-source model family built for agentic coding.

MarkTechPost
Jun 25, 2026·1 min read
OpenAI's GPT-5.5 Instant Model Updated with Improved Shopping and Constraint Handling
AI Models

OpenAI's GPT-5.5 Instant Model Updated with Improved Shopping and Constraint Handling

OpenAI updates GPT-5.5 Instant model, used in free ChatGPT version, with improved shopping results and complex constraint handling.

VentureBeat
Jun 25, 2026·1 min read
Mistral Launches OCR 4, Pushing Document Extraction Beyond Text
AI Models

Mistral Launches OCR 4, Pushing Document Extraction Beyond Text

Mistral AI releases OCR 4, a document intelligence model that extracts structured representations of documents, complete with bounding boxes and confidence scores.

VentureBeat
Jun 24, 2026·1 min read
Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency
AI Models

Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency

Gradium today released two real-time speech translation models: stt-translate and s2s-translate .

MarkTechPost
Jun 24, 2026·1 min read
OpenAI and Broadcom unveil custom AI inference chip Jalapeño
AI Models

OpenAI and Broadcom unveil custom AI inference chip Jalapeño

OpenAI and Broadcom partner on Jalapeño, a custom AI accelerator chip for large language model inference.

VentureBeat
Jun 24, 2026·1 min read
OpenAI Unveils Custom Chip Built with Broadcom
AI Models

OpenAI Unveils Custom Chip Built with Broadcom

OpenAI reveals its first custom-built inference processor, Jalapeño, designed with Broadcom to enhance AI model performance.

TechCrunch
Jun 24, 2026·1 min read
Mistral OCR 4 Brings Citation-Ready Structured Output to RAG, Agentic, and Enterprise Search Pipelines
AI Models

Mistral OCR 4 Brings Citation-Ready Structured Output to RAG, Agentic, and Enterprise Search Pipelines

Today, Mistral AI released OCR 4 , its latest document-understanding model.

MarkTechPost
Jun 23, 2026·1 min read
Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDFs Using Schemas
AI Models

Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDFs Using Schemas

Datalab has released lift , a 9B open-weights vision model for structured extraction.

MarkTechPost
Jun 23, 2026·1 min read
Cursor announces its own AI model, a new Git platform, and a mobile app
AI Models

Cursor announces its own AI model, a new Git platform, and a mobile app

Cursor has shared new details about its first fully self-trained AI model and announced two new products.

The Decoder
Jun 23, 2026·1 min read
GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval
AI Models

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

In this tutorial , we work with GLM-5.2 and use its hosted, OpenAI-compatible API instead of running the full model locally.

MarkTechPost
Jun 23, 2026·1 min read
Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable Pool of Frontier LLMs
AI Models

Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable Pool of Frontier LLMs

Today, Sakana AI launched Sakana Fugu .

MarkTechPost
Jun 22, 2026·1 min read
We got local models to triage the OpenClaw repo for FREE!*
AI Models

We got local models to triage the OpenClaw repo for FREE!*

*Free as in beer, excluding the cost of electricity, and assuming you already own the hardware June 2026 will go down as the moment that people realized closed models can be taken away.

Hugging Face
Jun 22, 2026·1 min read
US Export Control on AI Models Falters
AI Models

US Export Control on AI Models Falters

The White House ordered Anthropic to restrict exports of AI models Fable and Mythos, citing national security concerns.

TechCrunch
Jun 19, 2026·1 min read
IEEE Rolls Out Large Language Models Virtual Training Course
AI Models

IEEE Rolls Out Large Language Models Virtual Training Course

Large language models have moved out of the research lab and into engineers’ daily workflow.

IEEE Spectrum
Jun 19, 2026·1 min read
Liquid AI Introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: Dense Bi-Encoder and Late-Interaction Models for Fast Multilingual Search Across 11 Languages
AI Models

Liquid AI Introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: Dense Bi-Encoder and Late-Interaction Models for Fast Multilingual Search Across 11 Languages

This week, Liquid AI released two new retrieval models.

MarkTechPost
Jun 19, 2026·1 min read
OpenAI upgrades ChatGPT's healthcare capabilities with GPT-5.5 Instant
AI Models

OpenAI upgrades ChatGPT's healthcare capabilities with GPT-5.5 Instant

OpenAI upgrades ChatGPT's healthcare capabilities with GPT-5.5 Instant, outperforming doctor-written answers

The Decoder
Jun 18, 2026·1 min read
New AI optimization framework Arbor outperforms Claude Code and Codex by 2.5x
AI Models

New AI optimization framework Arbor outperforms Claude Code and Codex by 2.5x

Arbor framework automates AI-driven research and optimization, outperforming Claude Code and Codex by 2.5x on the same compute budget.

VentureBeat
Jun 18, 2026·1 min read
UK to Use Flawed Facial Age Estimation for Asylum Seekers
AI Models

UK to Use Flawed Facial Age Estimation for Asylum Seekers

UK to deploy facial age estimation for asylum seekers despite technology's known inaccuracy and bias.

Wired
Jun 18, 2026·1 min read
Anthropic Pulls AI Models After US Government Export Control Directive
AI Models

Anthropic Pulls AI Models After US Government Export Control Directive

Anthropic takes AI models offline due to US export control directive barring foreign nationals from using services.

Ars Technica
Jun 17, 2026·1 min read
Zhipu AI's GLM-5.2 rivals closed-source leaders in coding
AI Models

Zhipu AI's GLM-5.2 rivals closed-source leaders in coding

Zhipu AI releases GLM-5.2 with 1-million-token context under MIT license, nearing closed-source models in coding tasks.

The Decoder
Jun 17, 2026·1 min read
GLM-5.2: Built for Long-Horizon Tasks
AI Models

GLM-5.2: Built for Long-Horizon Tasks

We're introducing GLM-5.2, our latest flagship model for long-horizon tasks.

Hugging Face
Jun 17, 2026·1 min read
Z.ai's GLM-5.2 Open-Weights Model Beats GPT-5.5 on Coding Benchmarks at Fraction of Cost
AI Models

Z.ai's GLM-5.2 Open-Weights Model Beats GPT-5.5 on Coding Benchmarks at Fraction of Cost

Chinese AI startup Z.ai releases GLM-5.2, a 753-billion parameter open-weights LLM that outperforms GPT-5.5 on multiple long-horizon coding benchmarks at 1/6th the cost.

VentureBeat
Jun 16, 2026·1 min read
Microsoft's Copilot Cowork switches to usage-based billing, considers DeepSeek model
AI Models

Microsoft's Copilot Cowork switches to usage-based billing, considers DeepSeek model

Microsoft moves Copilot Cowork to usage-based billing, mulls cheaper DeepSeek model.

The Decoder
Jun 16, 2026·1 min read
How easily can Russian propaganda fool AI models? A new benchmark finds out
AI Models

How easily can Russian propaganda fool AI models? A new benchmark finds out

The Institute of the Estonian Language has released a benchmark measuring how susceptible AI language models are to Russian propaganda.

The Decoder
Jun 16, 2026·1 min read
Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat
AI Models

Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

Nous Research has shipped a change to Hermes Agent.

MarkTechPost
Jun 16, 2026·1 min read
Roblox Age Verification Tech Evolves Beyond Simple Checkbox
AI Models

Roblox Age Verification Tech Evolves Beyond Simple Checkbox

Roblox introduces facial age estimation tech to improve age verification.

The Verge
Jun 15, 2026·1 min read
Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch
AI Models

Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch

GLM-5.2 is the latest large language model from Z.ai, becoming the third major release in the GLM-5 line.

MarkTechPost
Jun 15, 2026·1 min read
AI Model 'Count Anything' Accurately Counts Objects in Images Using Text Prompts
AI Models

AI Model 'Count Anything' Accurately Counts Objects in Images Using Text Prompts

New AI model 'Count Anything' counts objects in images using text prompts with a significantly reduced error rate.

The Decoder
Jun 13, 2026·1 min read
Anthropic blocks Fable 5 and Mythos 5 access globally amid government order
AI Models

Anthropic blocks Fable 5 and Mythos 5 access globally amid government order

Anthropic cuts off access to Fable 5 and Mythos 5 for all foreign nations due to national security concerns.

The Verge
Jun 13, 2026·1 min read
Anthropic to disable advanced AI models after US export control order
AI Models

Anthropic to disable advanced AI models after US export control order

Anthropic to disable its most advanced AI models for all users after US government order citing national security concerns.

The Guardian Technology
Jun 13, 2026·1 min read
Anthropic blocks public access to Claude Fable 5 and Mythos 5 models following US government order
AI Models

Anthropic blocks public access to Claude Fable 5 and Mythos 5 models following US government order

US government issues export control directive, citing national security, for Anthropic to suspend access to top-tier AI models for foreign nationals.

VentureBeat
Jun 13, 2026·1 min read
Anthropic's Claude Fable 5 outperforms GPT-5.5 in math accuracy
AI Models

Anthropic's Claude Fable 5 outperforms GPT-5.5 in math accuracy

Claude Fable 5 achieves 88% accuracy on FrontierMath's hardest tier, surpassing GPT-5.5 by 13 points

The Decoder
Jun 13, 2026·1 min read
Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6
AI Models

Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6

This week, Moonshot AI released Kimi K2.7-Code .

MarkTechPost
Jun 13, 2026·1 min read
Anthropic Disables AI Models After US Government Directive
AI Models

Anthropic Disables AI Models After US Government Directive

Anthropic shuts down Fable and Mythos AI models due to US export controls, just days after launch.

Ars Technica
Jun 13, 2026·1 min read
Moonshot AI's Kimi K2.7-Code update claims 30% reduction in thinking tokens
AI Models

Moonshot AI's Kimi K2.7-Code update claims 30% reduction in thinking tokens

Moonshot AI releases Kimi K2.7-Code, an open-source update to its K2 coding model family, with claimed performance gains and reduced thinking tokens.

VentureBeat
Jun 12, 2026·1 min read
Zyphra Release Zamba2-VL: Hybrid Mamba2–Transformer Vision-Language Models That Cut Time-to-First-Token by About an Order of Magnitude
AI Models

Zyphra Release Zamba2-VL: Hybrid Mamba2–Transformer Vision-Language Models That Cut Time-to-First-Token by About an Order of Magnitude

Zyphra has released Zamba2-VL, a family of open vision-language models.

MarkTechPost
Jun 12, 2026·1 min read
Xiaomi's MiMo Code Outperforms Claude Code on Long-Horizon Coding Tasks
AI Models

Xiaomi's MiMo Code Outperforms Claude Code on Long-Horizon Coding Tasks

Xiaomi's open-source MiMo Code V0.1.0 beats Anthropic's Claude Code on agentic coding benchmarks, especially on 200+ step tasks.

VentureBeat
Jun 11, 2026·1 min read
Google Releases DiffusionGemma, a Diffusion-Based Language Model
AI Models

Google Releases DiffusionGemma, a Diffusion-Based Language Model

Google's DiffusionGemma generates 256 tokens in parallel, self-correcting as it goes, with speeds up to 4x faster than standard models on GPUs.

VentureBeat
Jun 11, 2026·1 min read
Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B Active Parameters for Agentic Coding
AI Models

Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B Active Parameters for Agentic Coding

This week, Cohere AI team shipped its first developer-facing coding model named ‘ North Mini Code ‘.

MarkTechPost
Jun 11, 2026·1 min read
Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster
AI Models

Google DeepMind releases DiffusionGemma, a model that runs local AI 4x faster

Another day, another AI model from Google.

Ars Technica
Jun 10, 2026·1 min read
Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation
AI Models

Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Faster Generation

Google AI team including the Google DeepMind researchers have just released DiffusionGemma, an experimental open model for text generation.

MarkTechPost
Jun 10, 2026·1 min read
Google Defends Use of YouTube Music for AI Training
AI Models

Google Defends Use of YouTube Music for AI Training

Google faces lawsuit over allegedly using YouTube music uploads to train Lyria 3 AI model

The Verge
Jun 10, 2026·1 min read
Instagram lets users tweak algorithm on main feed
AI Models

Instagram lets users tweak algorithm on main feed

Instagram introduces 'Your Algorithm' feature to let users customize their main feed.

The Verge
Jun 10, 2026·1 min read
Anthropic Releases Claude Fable 5 and Claude Mythos 5: Same Underlying Model, Different Safeguards, New Mythos-Class Tier
AI Models

Anthropic Releases Claude Fable 5 and Claude Mythos 5: Same Underlying Model, Different Safeguards, New Mythos-Class Tier

Anthropic released two models on June 9, 2026: Claude Fable 5 and Claude Mythos 5.

MarkTechPost
Jun 10, 2026·1 min read
Anthropic Releases Restricted Version of Claude Mythos AI Model
AI Models

Anthropic Releases Restricted Version of Claude Mythos AI Model

Anthropic makes new AI model available to public while restricting use in sensitive areas.

The Guardian Technology
Jun 09, 2026·1 min read
AI industry faces cost pressure as demand shifts to cheaper models
AI Models

AI industry faces cost pressure as demand shifts to cheaper models

The AI boom's assumption that bigger models are more powerful is being tested as cost-conscious users turn to smaller, cheaper models.

TechCrunch
Jun 09, 2026·1 min read
Anthropic Unveils Claude Fable 5 and Mythos 5 with Major Coding and Science Gains
AI Models

Anthropic Unveils Claude Fable 5 and Mythos 5 with Major Coding and Science Gains

Anthropic releases Claude Fable 5 and Mythos 5, claiming major improvements in coding and research over the Opus generation.

The Decoder
Jun 09, 2026·1 min read
Anthropic Launches Claude Fable 5 and Claude Mythos 5 AI Models
AI Models

Anthropic Launches Claude Fable 5 and Claude Mythos 5 AI Models

Anthropic releases Claude Fable 5 and Claude Mythos 5, its most powerful generally available AI models, with enhanced performance and safeguards.

VentureBeat
Jun 09, 2026·1 min read
Anthropic Releases Upgraded AI Models with Enhanced Capabilities
AI Models

Anthropic Releases Upgraded AI Models with Enhanced Capabilities

Anthropic launches Claude Fable 5 and Claude Mythos 5 AI models with improved capabilities and safeguards.

Wired
Jun 09, 2026·1 min read
Anthropic’s Claude Fable 5 is a version of Mythos the public can access today
AI Models

Anthropic’s Claude Fable 5 is a version of Mythos the public can access today

Anthropic is bringing its most powerful AI model to the general public for the first time, but it’s doing it with guardrails.

TechCrunch
Jun 09, 2026·1 min read
Introducing North Mini Code: Cohere’s First Model For Developers
AI Models

Introducing North Mini Code: Cohere’s First Model For Developers

Today, we are releasing North Mini Code, a 30B-parameter Mixture-of-Experts model with 3B active parameters with powerful agentic coding capabilities, available on Hugging Face under the Apache 2.0 license.

Hugging Face
Jun 09, 2026·1 min read
Xiaomi MiMo and TileRT Achieve 1000 Tokens Per Second on 1-Trillion-Parameter Model
AI Models

Xiaomi MiMo and TileRT Achieve 1000 Tokens Per Second on 1-Trillion-Parameter Model

Xiaomi's MiMo team and TileRT systems group release UltraSpeed, a high-speed serving mode that decodes over 1000 tokens per second on a 1-trillion-parameter model.

MarkTechPost
Jun 08, 2026·1 min read
Perplexity Unshackles AI Models with 'Search as Code' Architecture
AI Models

Perplexity Unshackles AI Models with 'Search as Code' Architecture

Perplexity introduces 'Search as Code', allowing AI models to write their own search routines in Python, outperforming rivals while slashing token costs.

The Decoder
Jun 07, 2026·1 min read
Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Stateful Search Harness on gpt-oss-20b
AI Models

Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Stateful Search Harness on gpt-oss-20b

A team of researchers introduces Harness-1, a 20B retrieval subagent that uses reinforcement learning inside a stateful search harness to improve search decisions and evidence gathering.

MarkTechPost
Jun 07, 2026·1 min read