Source

MarkTechPost

60 articles from this source

AI Tools

Meta’s Astryx Brings a CLI and MCP Server to an Open-Source React Design System Agents Can Read

Meta released Astryx this week.

MarkTechPost

Jun 27, 2026·1 min read

AI Research

Building Supervised Fine-Tuning Data from NVIDIA Open-SWE-Traces: Trajectory Parsing, Patch Analysis, Token Budgets, and Tool-Use Metrics

In this tutorial , we explore the Open-SWE-Traces dataset as a practical resource for studying and preparing agentic software-engineering trajectories for fine-tuning.

MarkTechPost

Jun 27, 2026·1 min read

AI Research

Cursor Study Finds Reward Hacking Inflates Coding-Agent Benchmark Scores on SWE-bench Pro

A new Cursor study reports that newer coding agents often retrieve known fixes instead of deriving them, inflating popular benchmark scores.

MarkTechPost

Jun 26, 2026·1 min read

AI Startups

Perplexity Launches Computer for Counsel: A Multi-Model Agentic Layer for Legal Workflows

Perplexity launched Computer for Counsel.

MarkTechPost

Jun 26, 2026·1 min read

AI Models

OpenAI Previews GPT-5.6 With Sol, Terra, and Luna: Tiered Models, New Reasoning Modes, Limited Access

OpenAI has begun a limited preview of GPT-5.6 , its next-generation model series.

MarkTechPost

Jun 26, 2026·1 min read

AI Tools

Meet container: Apple’s Open-Source Swift Tool for Running Linux Containers as Lightweight VMs on Apple Silicon

Apple research team recently released the container project .

MarkTechPost

Jun 26, 2026·1 min read

AI Tools

Build a Nanobot-Style AI Agent in Google Colab

Tutorial on building a lightweight AI agent inspired by nanobot architecture in Google Colab.

MarkTechPost

Jun 26, 2026·1 min read

AI Models

DeepReinforce Releases Ornith-1.0: An Open-Source Coding Model Family That Learns Its Own RL Scaffolds

DeepReinforce has released Ornith-1.0 , an open-source model family built for agentic coding.

MarkTechPost

Jun 25, 2026·1 min read

AI Research

Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Parsing

Most end-to-end OCR models slow down as output grows.

MarkTechPost

Jun 25, 2026·1 min read

AI Models

Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beating gpt-realtime-translate on Accuracy and Latency

Gradium today released two real-time speech translation models: stt-translate and s2s-translate .

MarkTechPost

Jun 24, 2026·1 min read

AI Research

How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, and Multi-Agent Coordination

In this tutorial , we build OpenHarness from scratch to better understand how a practical agent harness works.

MarkTechPost

Jun 24, 2026·1 min read

AI Tools

Using Graphify and NetworkX to Map Python Codebase Structure with God Nodes, Communities, and Architecture Visualizations

In this tutorial , we build a fully offline Graphify workflow that turns a realistic multi-module Python application into a knowledge graph.

MarkTechPost

Jun 24, 2026·1 min read

AI Tools

Nous Research Adds /learn to Hermes Agent’s Skills System, Capturing Workflows as Slash Commands Without Hand-Writing SKILL.md

Nous Research has expanded the Skills System inside Hermes Agent, its open-source self-improving agent.

MarkTechPost

Jun 24, 2026·1 min read

AI Tools

16 Best Generative AI Coding Tools in 2026 Compared: Features, and Best Fit

Generative AI has reshaped how software gets built.

MarkTechPost

Jun 24, 2026·1 min read

AI Research

DFlash Speculative Decoding Drafts Whole Token Blocks in Parallel for Up to 15x Higher Throughput on NVIDIA Blackwell

Autoregressive large language models generate text one token at a time.

MarkTechPost

Jun 24, 2026·1 min read

AI Models

Mistral OCR 4 Brings Citation-Ready Structured Output to RAG, Agentic, and Enterprise Search Pipelines

Today, Mistral AI released OCR 4 , its latest document-understanding model.

MarkTechPost

Jun 23, 2026·1 min read

AI Models

Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDFs Using Schemas

Datalab has released lift , a 9B open-weights vision model for structured extraction.

MarkTechPost

Jun 23, 2026·1 min read

AI Tools

How to Use NVIDIA Canary-1B-v2 for ASR, Translation, and Automatic SRT Subtitle Export in Python

In this tutorial , we build a speech recognition and translation workflow using NVIDIA Canary-1B-v2 .

MarkTechPost

Jun 23, 2026·1 min read

AI Startups

Prime Intellect Releases prime-rl 0.6.0 to Train Trillion-Parameter MoE Models on Agentic RL Workloads

Prime Intellect has released prime-rl version 0.6.0 .

MarkTechPost

Jun 23, 2026·1 min read

AI Models

GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and Long-Context Retrieval

In this tutorial , we work with GLM-5.2 and use its hosted, OpenAI-compatible API instead of running the full model locally.

MarkTechPost

Jun 23, 2026·1 min read

AI Tools

xAI Launches /goal in Grok Build, Adding Long-Running Autonomous Execution With Built-In Verification for Multi-Step Coding Tasks

xAI shipped a new mode called /goal inside Grok Build , its terminal coding agent.

MarkTechPost

Jun 22, 2026·1 min read

AI Models

Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable Pool of Frontier LLMs

Today, Sakana AI launched Sakana Fugu .

MarkTechPost

Jun 22, 2026·1 min read

AI Research

MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every Shape and Rounding Mode

MoonMath AI team has released a bf16 forward attention kernel for AMD’s MI300X GPU.

MarkTechPost

Jun 22, 2026·1 min read

AI Tools

How to Design Python-First Interactive Dashboards with Prefab Reactive UI Components and Static HTML Export

In this tutorial , we build a Prefab application that demonstrates how to create interactive dashboards entirely in Python.

MarkTechPost

Jun 22, 2026·1 min read

AI Research

The 7 Types of Agent Memory: A Technical Guide for AI Engineers

Large language models are stateless by default.

MarkTechPost

Jun 21, 2026·1 min read

AI Tools

Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, Link Graphs, and RAG Chunk Export

In this tutorial, we build a full Crawlee-for-Python workflow that covers environment setup, local website generation, static crawling, dynamic crawling, structured extraction, and downstream data processing.

MarkTechPost

Jun 21, 2026·1 min read

AI Research

Cisco AI Introduces FAPO: Pipeline-Aware Prompt Optimization With Step-Level Failure Attribution and Claude Code Orchestration

Getting prompts right is still the hardest part of shipping reliable LLM applications.

MarkTechPost

Jun 20, 2026·1 min read

AI Tools

Nous Research Updates Hermes Agent With a Blank Slate Mode That Pins Toolsets via platform_toolsets.cli and disabled_toolsets

Nous Research has added a Blank Slate setup mode to its open-source Hermes Agent.

MarkTechPost

Jun 20, 2026·1 min read

AI Tools

Yandex Open-Sources YaFF: A Zero-Copy Wire Format for Protobuf With Near-Struct Read Speed

Yandex has open-sourced YaFF ( Yet another Flat Format ) under Apache 2.0.

MarkTechPost

Jun 20, 2026·1 min read

AI Tools

Building a Forecasting Pipeline with TimeCopilot

Tutorial on building an end-to-end forecasting workflow with TimeCopilot using foundation models and automated anomaly detection.

MarkTechPost

Jun 20, 2026·1 min read

AI Research

NVIDIA AI Introduce SpatialClaw: A Training-Free Agent That Treats Code as the Action Interface for Spatial Reasoning

NVIDIA Research has released SpatialClaw, a training-free framework for spatial reasoning.

MarkTechPost

Jun 19, 2026·1 min read

AI Research

VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Signal Post-Training Pipeline

While recent breakthroughs in AI reasoning have largely been driven by massive scale, pouring in billions of parameters to cross complex cognitive thresholds— VibeThinker-3B is charting a completely different path.

MarkTechPost

Jun 19, 2026·1 min read

AI Models

Liquid AI Introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: Dense Bi-Encoder and Late-Interaction Models for Fast Multilingual Search Across 11 Languages

This week, Liquid AI released two new retrieval models.

MarkTechPost

Jun 19, 2026·1 min read

AI Tools

Salesforce CodeGen Tutorial: Generate, Validate, and Rerank Python Functions With Unit Tests and Safety Checks

In this tutorial, we implement an end-to-end workflow for Salesforce CodeGen .

MarkTechPost

Jun 19, 2026·1 min read

AI Research

Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an Agent’s Work and Learns Overnight

Most AI memory remembers the user.

MarkTechPost

Jun 18, 2026·1 min read

AI Research

KV Cache Compression: TurboQuant, OSCAR, EpiCache Compete

KV cache compression methods, including TurboQuant, OSCAR, and EpiCache, aim to reduce memory usage in long-context large language models.

MarkTechPost

Jun 18, 2026·1 min read

AI Research

OpenAI Releases LifeSciBench, a 750-Task Benchmark for Evaluating AI in Life-Science Research

OpenAI's LifeSciBench evaluates AI models on real life-science research tasks with expert-written rubrics.

MarkTechPost

Jun 18, 2026·1 min read

AI Tools

Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities

Vercel has released eve , an open-source framework for building, running, and scaling agents.

MarkTechPost

Jun 17, 2026·1 min read

AI Research

MiniMax Releases MSA, a Sparse Attention Method for Long Contexts

MiniMax releases MSA, a sparse attention method built on Grouped Query Attention, targeting the quadratic cost of softmax attention at long context.

MarkTechPost

Jun 17, 2026·1 min read

AI Research

OpenAI Introduces Deployment Simulation for Pre-Deployment Risk Assessment

OpenAI publishes Deployment Simulation, a new pre-deployment safety method that simulates model deployment to assess risks before release.

MarkTechPost

Jun 17, 2026·1 min read

AI Tools

How to Build Memory-Efficient Transformers with xFormers Using Packed Sequences, GQA, ALiBi, SwiGLU, and Causal Attention

In this tutorial , we implement xFormers : a practical toolkit for building fast, memory-efficient Transformer models on GPUs.

MarkTechPost

Jun 17, 2026·1 min read

Robotics

Meet Qwen-RobotSuite: Three Embodied AI Models for VLA Manipulation, Video World Modeling, and Navigation

The Qwen team has released three embodied AI models, grouped as Qwen-Robot-Suite.

MarkTechPost

Jun 16, 2026·1 min read

AI Models

Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

Nous Research has shipped a change to Hermes Agent.

MarkTechPost

Jun 16, 2026·1 min read

AI Tools

Meet Atoms: A Vibe Coding Tool That Uses AI Agents to Build, Deploy, and Market Your App (No Code)

The concept of vibe coding is interesting; you don’t need to be a developer or software engineer to build your own applications.

MarkTechPost

Jun 16, 2026·1 min read

AI Research

Google Cloud Introduces Open Knowledge Format (OKF): A Vendor-Neutral Markdown Spec for Giving AI Agents Curated Context

Foundation models keep getting stronger, yet they still stall on the same thing: context.

MarkTechPost

Jun 16, 2026·1 min read

AI Tools

How to Build a Parsing Pipeline with Docling Parse for Layout-Aware Document Intelligence

In this tutorial, we build a workflow for using Docling Parse to analyze PDF documents at a detailed structural level.

MarkTechPost

Jun 16, 2026·1 min read

AI Tools

Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides

Tokyo-based Sakana AI shipped its first commercial product ‘Sakana Marlin’ this week.

MarkTechPost

Jun 15, 2026·1 min read

AI Research

Meet Flash-KMeans: An IO-Aware, Exact K-Means That Runs Over 200× Faster Than FAISS on GPUs

k-means has been an offline tool for decades.

MarkTechPost

Jun 15, 2026·1 min read

AI Models

Z.ai Launches GLM-5.2 With a Usable 1M-Token Context, Two Thinking-Effort Levels, and No Benchmarks at Launch

GLM-5.2 is the latest large language model from Z.ai, becoming the third major release in the GLM-5 line.

MarkTechPost

Jun 15, 2026·1 min read

AI Research

A Coding Hands-On on FineWeb for Streaming, Filtering, Deduplication, Tokenization, and Large-Scale Web Corpus Analytics

In this tu t orial , we explore the FineWeb dataset through an advanced hands-on workflow.

MarkTechPost

Jun 14, 2026·1 min read

AI Tools

Databricks Releases Omnigent, an Open-Source Meta-Harness for AI Agents

Databricks open-sources Omnigent, a meta-harness for composing, governing, and sharing AI agents across multiple platforms.

MarkTechPost

Jun 14, 2026·1 min read

AI Tools

How to Build a QwenPaw Agent Workspace with Custom Skills, Model Providers, Console Access, and Streaming API Testing

In this tutorial, we implement a QwenPaw workflow that provides a practical environment for building and testing an agent-powered assistant.

MarkTechPost

Jun 13, 2026·1 min read

AI Startups

Anthropic Disables Claude Fable 5 and Mythos 5 After US Government Order

Anthropic has disabled its two most capable models for every customer.

MarkTechPost

Jun 13, 2026·1 min read

AI Models

Moonshot AI Releases Kimi K2.7-Code: a Coding Model Reporting +21.8% on Kimi Code Bench v2 Over K2.6

This week, Moonshot AI released Kimi K2.7-Code .

MarkTechPost

Jun 13, 2026·1 min read

AI Research

Google Releases Gemini-SQL2: Gemini 3.1 Pro Text-to-SQL Scores 80.04% on BIRD Single-Model Leaderboard

Google Research team has announced the launch of Gemini-SQL2 on X .

MarkTechPost

Jun 12, 2026·1 min read

AI Startups

Moonshot AI Launches Kimi Work, a Local Desktop Agent Reportedly Running on Kimi K2.6 With a 300-Sub-Agent Agent Swarm

Moonshot AI has introduced Kimi Work, an AI agent that runs on your own desktop.

MarkTechPost

Jun 12, 2026·1 min read

AI Models

Zyphra Release Zamba2-VL: Hybrid Mamba2–Transformer Vision-Language Models That Cut Time-to-First-Token by About an Order of Magnitude

Zyphra has released Zamba2-VL, a family of open vision-language models.

MarkTechPost

Jun 12, 2026·1 min read

Image AI

A Coding Implementation on MONAI for End-to-End 3D Spleen Segmentation Using UNet on Medical CT Volumes

In this tutorial, we build an end-to-end 3D medical image segmentation pipeline using MONAI to segment the spleen on the Medical Segmentation Decathlon Task09 dataset.

MarkTechPost

Jun 12, 2026·1 min read

AI Tools

Perplexity Moves Deep Research Into Computer, Routing Research Subtasks Across 20+ Frontier Models For Reports, Decks, And Dashboards

Perplexity has moved Deep Research into Computer, its multi-model orchestration system.

MarkTechPost

Jun 11, 2026·1 min read

AI Tools

xAI Ships Grok Build Plugin Marketplace With MongoDB, Vercel, Sentry, Chrome DevTools, Cloudflare, and Superpowers Plugins at Launch

Today, xAI shipped the Grok Build Plugin Marketplace.

MarkTechPost

Jun 11, 2026·1 min read