Source

Hugging Face

10 articles from this source

MedQA: Fine-Tuning a Clinical AI on AMD ROCm — No CUDA Required

A complete walkthrough of LoRA fine-tuning Qwen3-1.7B on MedMCQA using AMD MI300X, built for the AMD Developer Hackathon on lablab.ai.

Hugging Face

May 08, 2026·1 min read

AI Research

Correctness Before Corrections: The vLLM V0 to V1 Migration

PipelineRL's vLLM inference engine upgrade from V0 to V1 required fixing backend behavior to match training dynamics.

Hugging Face

May 06, 2026·1 min read

AI Research

The AI Evaluation Bottleneck: How Cost is Redefining the Field

The cost of evaluating AI models has skyrocketed, making it a new bottleneck in the field, with some evaluations costing tens of thousands of dollars.

Hugging Face

Apr 29, 2026·1 min read

AI Research

Granite 4.1 LLMs: A Technical Walkthrough of Data Engineering and Training

A detailed look at the data engineering, pre-training, supervised fine-tuning, and reinforcement learning behind the Granite 4.1 LLMs.

Hugging Face

Apr 29, 2026·1 min read

AI Tools

DeepInfra Joins Hugging Face as Supported Inference Provider

DeepInfra is now a supported Inference Provider on the Hugging Face Hub, expanding serverless inference capabilities.

Hugging Face

Apr 28, 2026·1 min read

AI Models

Introducing NVIDIA Nemotron 3 Nano Omni: Long-Context Multimodal Intelligence for Documents, Audio and Video Agents

NVIDIA unveils Nemotron 3 Nano Omni, a cutting-edge AI model capable of processing and understanding long-context multimodal data, including documents, audio, and video.

Hugging Face

Apr 28, 2026·1 min read

AI Tools

Building Scalable Web Apps with OpenAI's Privacy Filter

OpenAI's open-source Privacy Filter enables developers to build scalable web apps that detect personally identifiable information (PII) in text.

Hugging Face

Apr 26, 2026·1 min read

AI Models

DeepSeek-V4: A Million-Token Context That Agents Can Actually Use

DeepSeek releases V4 with a 1M-token context window, competitive benchmark numbers, and innovative architecture for efficient large context length support.

Hugging Face

Apr 23, 2026·1 min read

AI Tools

How to Use Transformers.js in a Chrome Extension

A step-by-step guide on integrating Transformers.js into a Chrome extension, leveraging Gemma 4 E2B for enhanced web navigation.

Hugging Face

Apr 22, 2026·1 min read

AI Research

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

QIMMA validates benchmarks before evaluating models, ensuring reported scores reflect genuine Arabic language capability in LLMs.

Hugging Face

Apr 21, 2026·1 min read