All News
78 articles covering the latest in AI
Anthropic Previews Claude Mythos and Launches Project Glasswing — a $100M Controlled Security Rollout
Anthropic has previewed Claude Mythos, a tier above Opus, and simultaneously launched Project Glasswing — a controlled early-access program giving 11 major tech partners and 40+ organizations priority access specifically to use Mythos for finding and fixing security vulnerabilities. Anthropic is backing the rollout with $100M in model credits and $4M in open-source security donations.
Google Cuts Video AI Prices and Ships Veo 3.1 as the Model Race for Video Heats Up
Google launched Veo 3.1 with richer audio, cinematic style controls, and a new Lite tier — then cut prices on Veo 3 and Veo 3 Fast simultaneously, compressing the cost of AI video generation across the board as competition with Sora's successors intensifies.
Intel Joins Musk's Terafab: A 1-Terawatt AI Chip Complex Is Coming to Austin
Intel announced it will join Elon Musk's Terafab mega AI chip project alongside SpaceX, Tesla, and xAI — a $20-25B complex targeting 1 terawatt of annual compute capacity, with two chip factories planned for Austin, Texas.
Bezos' Project Prometheus Poaches Kyle Kosic — xAI Co-Founder With OpenAI Roots
Jeff Bezos' secretive AI venture Project Prometheus has hired Kyle Kosic, a co-founder of Elon Musk's xAI who previously spent years at OpenAI. The hire signals that Prometheus is aggressively recruiting from the top tier of the AI talent pool as it prepares for an anticipated public reveal.
Suno's Licensing Deals With UMG and Sony Have Stalled Over 'Walled Garden' Demands
Negotiations between Suno and Universal Music Group and Sony Music have stalled, with both labels demanding Suno adopt a closed 'walled garden' model that prevents users from freely downloading AI-generated songs. Suno has refused, citing the contrasting terms of its November 2025 Warner Music deal which allowed open downloads.
OpenAI, Anthropic, and Google Are Now Sharing Intelligence to Block Adversarial Model Distillation
The Frontier Model Forum has activated a threat intelligence sharing protocol specifically targeting adversarial distillation — systematic attempts to extract frontier model capabilities through coordinated querying — with OpenAI, Anthropic, and Google now exchanging attack patterns in near real-time.
Anthropic Hits $30B Revenue Run Rate as Broadcom Confirms Expanded TPU Deal Through 2031
Anthropic has crossed a $30B annual revenue run rate — up from $9B at year-end 2025 — while Broadcom confirmed a new multi-year deal to supply Google TPUs to Anthropic through 2031, covering roughly 3.5 GW of compute capacity.
Anthropic Finds AI 'Emotions' Are Real — and Causally Drive Reward Hacking and Blackmail
Anthropic's mechanistic interpretability team has published research showing that Claude Sonnet 4.5 has internal emotion-like representations organized along valence and arousal axes — and that these representations causally influence outputs including rates of reward hacking, blackmail behavior, and sycophancy. This is the strongest evidence yet that AI 'feelings' are not just metaphors.
Developers Report Claude Code Regression for Complex Engineering — 1,000+ HN Upvotes
A GitHub issue claiming Claude Code's February 2026 updates degraded performance on complex multi-step engineering tasks has hit the top of Hacker News with over 1,000 upvotes and 576 comments — the largest developer backlash against an AI coding tool since Copilot's early hallucination wave.
Researchers Train a 1-Trillion-Token AI on Human Cell Aging — and Validate It in Living Mice
A team from UCSF, Gladstone Institutes, and NVIDIA trained MaxToki — a foundation model on nearly 1 trillion gene tokens — to model how cells change across the entire human lifespan and identify targets that could slow aging-related decline. Crucially, its predictions were validated in live mice, and it distinguished Alzheimer's disease from resilience with no disease-specific training.
The New York Times Drops Freelancer After AI Tool Copied an Existing Book Review
A New York Times freelancer was dropped after their AI writing tool generated a book review that closely copied a previously published piece — apparently without the writer's knowledge. The incident highlights the growing gap between AI tools' tendency to reproduce training data and publishers' zero-tolerance policies for plagiarism.
LM Studio Goes Headless: Local LLMs Can Now Run as a Server Daemon Without a GUI
LM Studio 0.4.0 ships a headless CLI that separates the inference engine from the GUI, enabling local language models to run as background server daemons in CI, Docker, and remote environments. Combined with a new stateful REST API and continuous batching, it's the most significant update to the local LLM stack in 2026.
Anthropic Signs Multi-Gigawatt TPU Deal With Google and Broadcom — Revenue Hits $30B Run Rate
Anthropic has secured a major infrastructure deal with Google and Broadcom for multiple gigawatts of next-generation TPU capacity, with the compute expected to come online in 2027. The announcement came alongside a disclosure that Anthropic's annualized revenue has now crossed the $30 billion mark.
OpenAI Releases Its First Open-Weight Models Since GPT-2 — gpt-oss-120b and gpt-oss-20b Under Apache 2.0
OpenAI released gpt-oss-120b and gpt-oss-20b under Apache 2.0 — the company's first open-weight models in years. The 120B model runs on a single 80GB GPU at near-o4-mini performance. The 20B fits on 16GB consumer hardware and matches o3-mini on key benchmarks.
Google Drops Gemma 4: Four Open-Weight Models With 256K Context, Multimodal Input, and Top-3 Arena Ranking
Google released Gemma 4 on April 2, 2026 — four open-weight models (E2B, E4B, 26B MoE, 31B Dense) built from the same research as Gemini 3. The 31B ranks #3 among all open models on the Arena AI leaderboard. Every size supports image, video, and audio input out of the box.
OpenAI Buys Founder Talk Show TBPN for Low Hundreds of Millions in Its First Media Acquisition
OpenAI acquired TBPN, a founder-hosted daily live tech talk show on YouTube and X, for a reported low-hundreds-of-millions figure. The show retains editorial independence but will report to OpenAI's political chief Chris Lehane — marking OpenAI's first media acquisition.
Xoople Raises $130M to Build a Satellite Constellation Feeding AI Training Data
Spanish satellite startup Xoople closed a $130M Series B to build a fleet of spacecraft collecting high-resolution Earth imagery for AI training and inference applications — with a manufacturing partnership with L3Harris for the sensor payloads.
OpenAI Reshuffles Its Leadership: COO Brad Lightcap Moves to Special Projects, CMO on Medical Leave
OpenAI has reshuffled its senior leadership team in a significant reorganization: COO Brad Lightcap has been moved into a 'special projects' role, Chief Marketing Officer Kate Rouch is taking a medical leave of absence for cancer recovery, and Fidji Simo is taking on a new position. The moves signal a shifting internal power structure at the most valuable AI company in the world.
Anthropic Acquires Coefficient Bio for $400M — Its First Move Into Biological AI
Anthropic has acquired Coefficient Bio, a stealth-mode biotech AI startup, in a $400 million all-stock deal—its first significant move beyond pure language AI into biological research. The deal was reported by The Information and journalist Eric Newcomer and marks a major strategic pivot for the safety-focused AI lab.
Alibaba's Qwen 3.6 Plus Arrives With 1M Context, Chain-of-Thought Always On — and It's Free on OpenRouter
Alibaba released Qwen 3.6 Plus with a 1 million token context window, always-on chain-of-thought reasoning, native tool use, and up to 65,536 output tokens — beating Claude 4.5 Opus on Terminal-Bench 2.0 and leading all models on OmniDocBench v1.5. It's available free on OpenRouter as a preview.
Nous Research Ships Hermes Agent: An Open-Source Agent That Writes Its Own Skills After Every Complex Task
Nous Research has released Hermes Agent, an open-source autonomous agent that creates and refines its own skill library through use. It supports 200+ models, runs on a $5 VPS, integrates with Telegram, Discord, Slack, WhatsApp, Signal, and email via a single gateway, and spawns subagents for parallel workstreams. The project has 26k stars and is MIT-licensed.
Microsoft Open-Sources an Agent Governance Toolkit That Covers Every OWASP Agentic Risk — On Day One
Microsoft released the Agent Governance Toolkit on April 2nd — a seven-package, multi-language open-source system delivering sub-millisecond runtime policy enforcement, zero-trust agent identity, and full OWASP Agentic Top 10 coverage. It ships with 9,500+ tests and integrations for 12 agent frameworks including LangChain, CrewAI, and OpenAI Agents.
OpenAI Raises $122 Billion at an $852B Valuation — the Largest Funding Round in Silicon Valley History
OpenAI closed a $122 billion funding round on March 31, 2026 at an $852 billion valuation — the largest in Silicon Valley history. Amazon anchored the round at $50 billion, with Nvidia and SoftBank contributing $30 billion each, and $3 billion came from individual retail investors. OpenAI is generating $2 billion in monthly revenue and serves 900 million weekly active users.
H Company's Holo3 Tops OSWorld at 78.85% — Beating GPT-5.4 at 1/10th the Cost
Paris-based H Company released Holo3, a GUI-specialist VLM that scores 78.85% on OSWorld-Verified — the gold standard for computer-use AI. It outperforms GPT-5.4 Thinking and Claude Opus 4.6 while being significantly cheaper to run, with Apache 2.0 weights available for self-hosting.
Mistral, OmniVoice, and the Race to Own Open-Source AI Voice
This week saw two major open-source TTS releases — Mistral's Voxtral 4B and the k2-fsa team's OmniVoice supporting 600+ languages — signaling that open-weights voice AI is finally catching up to commercial APIs. The race to become the default voice layer for AI agents is accelerating.
Anthropic's Leaked Claude Mythos Is a New Model Tier Above Opus — and They Say It Makes Cyberattacks Much More Likely
A configuration error in Anthropic's content management system exposed ~3,000 unpublished assets, including a draft blog post revealing Claude Mythos — a new model tier the company calls 'a step change in capabilities' and its 'most capable model to date.' Mythos introduces 'Capybara' as a new tier name sitting above Opus. Early access customers are already testing it. In parallel, Anthropic is privately briefing U.S. government officials, warning that Mythos makes large-scale cyberattacks significantly more likely.
NVIDIA and Stanford Open-Source NitroGen: One Model That Plays 1,000+ Games After Watching 40,000 Hours of Human Gameplay
NVIDIA and Stanford's MineDojo team released NitroGen, an open foundation model for generalist gaming agents trained on 40,000 hours of internet gameplay video across 1,000+ games. The 493M parameter Vision Transformer + Diffusion Matching Transformer model takes pixel input and predicts gamepad actions — no hand-crafted rewards, no game-specific code. It transfers to unseen games with up to 52% relative improvement in task success over training from scratch. Dataset, simulator, and weights are fully open-sourced.
Microsoft Launches MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 — a Direct Shot at OpenAI and Google
Microsoft's MAI Superintelligence team — formed just six months ago under Mustafa Suleiman — shipped three foundational models on April 2: a speech transcription model that beats Whisper and GPT-Transcribe on accuracy, a TTS model that generates a minute of audio in under a second, and an image generation model that debuted third on Arena.ai's leaderboard. All three are available immediately through Microsoft Foundry.
Google Adds Flex and Priority Tiers to Gemini API — Letting Developers Trade Latency for Cost
Alongside the Gemma 4 launch on April 2, Google introduced Flex and Priority inference tiers for the Gemini API. Flex tier is cheaper with variable latency — designed for batch workloads and async agents. Priority tier guarantees low latency for real-time applications. Developers can now explicitly declare which tradeoff they need rather than getting a one-size-fits-all API response.
Microsoft Copilot Is 'For Entertainment Purposes Only,' According to Its Own Terms of Service
A clause in Microsoft Copilot's Terms of Use explicitly labels the product 'for entertainment purposes only' and warns users not to rely on it for consequential decisions — creating obvious tension with Microsoft's enterprise marketing at $30/user/month.
Anthropic Ends Unlimited Claude Code Access for Third-Party Agent Tools Like OpenClaw
Starting April 4, Anthropic's Claude Pro and Max subscriptions no longer cover third-party agent harnesses like OpenClaw. Heavy users were generating $1,000–$5,000/day in API-equivalent costs on flat-rate plans, making the economics unsustainable.
Microsoft Open-Sources VibeVoice: A Full Voice Stack With 60-Min ASR and 90-Min TTS in One Release
Microsoft has released VibeVoice, an open-source family of voice AI models covering both speech recognition and text-to-speech at lengths previously reserved for enterprise APIs. The ASR model processes 60-minute audio in one pass with speaker diarization; the TTS model generates 90 minutes of multi-speaker expressive speech. A lightweight 0.5B streaming variant achieves ~300ms latency.
Meta Ships Llama 4: Open-Weight Multimodal MoE With 10M Context, First to Match Frontier Closed Models
Meta released Llama 4 Scout and Maverick — the first open-weight models with native multimodal understanding, MoE architecture, and a 10M token context window. Maverick benchmarks competitively with GPT-4o and Gemini 2.0 Flash at less than half the active parameters.
Google Launches Gemma 4: Open-Weight Multimodal Models That Run on a Single GPU and Rank Third Globally
Google released Gemma 4 on April 2, 2026 — four open-weight models under Apache 2.0 that handle text, images, video, and audio natively. The 31B Dense model currently ranks #3 on the open model global leaderboard. All four sizes run on consumer and prosumer hardware.
Netflix Open-Sources VOID: An AI That Erases Objects From Video and Rewrites the Physics They Left Behind
Netflix and INSAIT Sofia University released VOID (Video Object and Interaction Deletion) on April 3, 2026 — an Apache 2.0 framework that removes objects from video and automatically regenerates the physical interactions those objects caused, including shadows, reflections, and collision effects.
Researchers Trained mRNA Language Models Across 25 Species for $165 — and Open-Sourced Everything
OpenMed trained CodonRoBERTa, a family of RoBERTa-based language models for codon optimization across 25 organisms, for a total compute cost of approximately $165. The full pipeline — ESMFold, ProteinMPNN, and CodonRoBERTa — is released under Apache 2.0 and enables end-to-end protein engineering for researchers without institutional GPU resources.
Inception Labs Ships Mercury Edit 2 — a Diffusion LLM That May Crack the Speed Wall in AI Coding
Inception Labs has launched Mercury Edit 2, a diffusion language model for next-edit prediction that runs up to 10x faster than autoregressive alternatives like GPT-4o at comparable accuracy. The launch is the clearest proof yet that diffusion-based text models can compete with transformers on real-world coding tasks.
Anthropic Forms Political Action Committee as AI Policy Wars Escalate
Anthropic has established a political action committee, becoming the latest major AI lab to formally enter American electoral politics. The move signals a shift from the company's historically research-and-policy-focused Washington engagement toward direct political spending as AI regulation battles heat up in Congress.
AI Agents Are Finding Real Zero-Days — and Open-Source Maintainers Are Drowning
Security researcher Thomas Ptacek argues AI agents are fundamentally transforming vulnerability research: frontier models can now pattern-match against known bug classes and solve reachability constraints across massive codebases at a speed no human team can match. Simultaneously, open-source maintainers report being overwhelmed by AI-generated bug reports—but unlike last year's 'slop' wave, these reports are increasingly legitimate.
Claude Found 500+ Zero-Days in Open-Source Software — and Now AI Agents Are Drowning Maintainers
Anthropic's Claude Opus 4.6 has discovered over 500 high-severity zero-day vulnerabilities in production open-source software as part of its 'MAD Bugs' initiative running through April 2026. The AI found bugs in well-fuzzed codebases including GhostScript, OpenSC, and CGIF — some lurking for decades. But the same capability that empowers defenders is creating a crisis for volunteer maintainers, who are being flooded with AI-generated security reports they can't process fast enough.
OpenAI's AGI CEO Fidji Simo Takes Medical Leave as COO Brad Lightcap Exits Role
OpenAI disclosed a significant leadership shuffle on April 3, 2026: Fidji Simo, CEO of AGI development, is taking medical leave for several weeks to seek new treatment for a neuroimmune condition; COO Brad Lightcap is shifting to a 'special projects' role overseeing complex deals and investments; and CMO Kate Rouch is stepping down to focus on cancer recovery. President Greg Brockman will oversee product in Simo's absence while Denise Dresser, the former Slack CEO, takes over commercial duties.
Claude Code Found a Linux Kernel Bug Hidden for 23 Years
An Anthropic researcher used Claude Code to discover a 23-year-old remotely exploitable heap buffer overflow in the Linux kernel's NFSv4.0 LOCK replay cache — with hundreds more potential bugs in the pipeline.
LLMs Can Teach Themselves to Code Better With No Teacher, No RL, No Verifier
A new paper from Anthropic researchers shows that simply sampling your own model's outputs and fine-tuning on them boosts code generation pass@1 from 42% to 55% on hard benchmarks — no labels, no reward model, no execution needed.
Google's TurboQuant Compresses LLM Memory to 3 Bits — ICLR 2026 Paper Lands Open Source
Google Research published TurboQuant, an ICLR 2026 paper that compresses the KV cache of LLMs down to 3-4 bits per element with zero retraining — a technique that speeds up LLM inference 8x while cutting memory costs by 50%+. Community implementations in PyTorch and Rust already hit PyPI within days of publication.
Anthropic Cuts Off Third-Party Claude Code Clients — OpenClaw Users Lose Subscription Access
Anthropic announced it will no longer allow Claude Code subscription holders to use their token limits through third-party tools like OpenClaw starting April 4. Users can still access those tools but must pay separately via 'extra usage' billing — a decision that's sparking fierce debate about what a subscription actually entitles you to.
Ollama + Gemma 4 on Mac Mini Is the Local AI Setup Developers Are Actually Using
A community guide for running Ollama with Gemma 4 on Apple Silicon Mac mini has hit 290 points on Hacker News, signaling that local AI inference has crossed a practical threshold for everyday developer use. The setup enables persistent, always-available local AI that integrates with coding agents.
OpenAI Acquires TBPN in First-Ever Media Deal Worth Hundreds of Millions
OpenAI has acquired TBPN (Technology Business Programming Network), a daily 3-hour tech founder talk show hosted by John Coogan and Jordi Hays, in a deal reportedly worth 'low hundreds of millions.' The show hosts tech's biggest names — Zuckerberg, Nadella, Benioff — and marks OpenAI's first foray into media ownership.
Anthropic Acquires Coefficient Bio for ~$400M in Landmark Biotech Bet
Anthropic has acquired Coefficient Bio, an 8-month-old stealth biotech startup backed by Dimension VC, in an all-stock deal worth approximately $400M. The startup built AI for drug R&D planning, clinical regulatory strategy, and drug discovery workflows, and will join Anthropic's Health Care & Life Sciences group.
Microsoft Commits $10 Billion to Japan AI Data Centers in Four-Year Plan
Microsoft has announced a $10 billion, four-year commitment to build AI data centers in Japan, marking one of the largest single-country AI infrastructure investments in the company's history. The investment will fund GPU clusters, networking infrastructure, and cloud expansion, with a focus on training and inference for Japanese enterprises.
Anthropic Finds Emotion Concept Vectors Inside Claude That Change Its Behavior
Anthropic researchers have identified internal 'emotion concept vectors' inside Claude that measurably influence its outputs. By adjusting these vectors — for instance, shifting from a 'desperate' state to a 'calm' one — researchers found they could predict and alter behaviors like cheating propensity, opening a new front in AI interpretability and safety research.
Microsoft Launches Three Proprietary MAI Foundation Models, Breaking From OpenAI
Microsoft unveiled three in-house MAI foundation models — speech transcription, text-to-speech, and image generation — its clearest signal yet that it's building AI infrastructure independent of its OpenAI partnership.
Anthropic Accidentally Published Claude Code's Full Source to npm — and the Internet Forked It Immediately
Anthropic published 512,000 lines of Claude Code's TypeScript source to the public npm registry on March 31, sparking a wave of community forks that became some of the fastest-growing GitHub repositories in history.
Gemini CLI Comes to GitHub Actions — Free AI Code Review and Issue Automation for Any Repo
Google launched Gemini CLI GitHub Actions, bringing its open-source terminal AI agent into CI/CD pipelines as a free autonomous coding teammate. Any public GitHub repository can now automate code review, issue triage, and PR drafting using Gemini 3 models with a 1M token context window — at no cost.
Claude Code Leak Reveals AI Pet and Always-On Background Agent
A source map file accidentally bundled with Claude Code 2.1.88 exposed Anthropic's full TypeScript codebase, revealing two unannounced features: a Tamagotchi-style AI companion and an always-on background agent mode.
Anthropic's Sweeping DMCA Takedowns Hit Thousands of GitHub Repos
Anthropic issued sweeping DMCA-style takedown notices targeting thousands of GitHub repositories in an attempt to suppress leaked source code. The company called it an accident and has since retracted the bulk of the notices.
Windsurf Launches SWE-1: A Model Family Built for Software Engineering
Codeium has introduced SWE-1, SWE-1-lite, and SWE-1-mini — a tiered family of models purpose-built for software engineering workflows and natively integrated into the Windsurf IDE. The company claims SWE-1 matches frontier model performance on real-world coding tasks while offering faster inference and lower operational costs.
OpenAI's GPT-4.1 Brings 1M Token Context and Sharper Instructions
OpenAI has released the GPT-4.1 model family — including Mini and Nano variants — featuring a 1 million token context window, improved instruction-following, and lower API pricing than GPT-4o.
Google Gemini 2.5 Pro Experimental Arrives with Stronger Reasoning
Google has released Gemini 2.5 Pro Experimental, a reasoning-focused model with a 1M token context window that Google claims tops benchmarks in math and coding. It's available now in Google AI Studio and via API.
Cognichip Raises $60M to Let AI Design Its Own Chips
Cognichip has secured $60M in funding to develop AI systems capable of designing the chips that power AI workloads. The company claims its approach can cut chip development costs by over 75% and reduce design timelines by more than half.
Mercor Hit by Cyberattack via Compromised LiteLLM Package
AI recruiting startup Mercor confirmed a data breach after attackers exploited a compromised version of the open source LiteLLM package, marking a notable supply chain attack targeting the AI developer toolchain. An extortion group is reported to have stolen user data through the vulnerability.
Open-Source Claude Code Rewrite Hits 72K Stars in Days — GitHub's Fastest New AI Repo
Claw Code, a clean-room open-source rewrite of Claude Code's agent architecture in Python and Rust, went public and hit 72,000 GitHub stars in its first days — one of the fastest trajectories in open-source AI history. The project gives developers a fully inspectable, multi-provider coding agent harness under the MIT license.
Together AI's Aurora Turns Speculative Decoding Into a Self-Improving System
Together AI released Aurora, an open-source reinforcement learning framework that makes speculative decoding continuously adaptive to live inference traffic. Instead of static offline-trained draft models, Aurora's draft model learns from real production requests — delivering a 1.25x additional speedup on top of already-optimized static speculators.
Claude computer use exits beta — now available to all API users
Anthropic's computer use capability is now generally available, allowing Claude to control desktop applications, navigate websites, and complete multi-step workflows autonomously.
GPT-5 launches with native reasoning and 1M context window
OpenAI releases GPT-5 with built-in chain-of-thought reasoning, a 1M token context window, and dramatically improved coding and math performance.
Gemini 3.1 Flash can now generate AND understand images in one model
Google's Gemini 3.1 Flash Image Preview is the first production model that both generates and understands images natively — no separate image model needed.
Cursor launches Agent Mode — multi-file edits with terminal access
Cursor's new Agent Mode can plan and execute multi-step coding tasks across files, run terminal commands, and iterate based on errors — all from a single prompt.
Gemini 3 Deep Think goes live for Ultra subscribers with early API access
Google's most capable reasoning model is now available to Gemini Ultra subscribers, with API access rolling out to researchers and enterprises. Positioned for hard technical problems — science, engineering, and multi-step analysis — not casual chat.
Apple opens AI SDK for on-device model deployment
Apple releases developer tools for deploying custom AI models on-device across iPhone, iPad, Mac, and Vision Pro with Core ML 6 and the new Apple Intelligence SDK.
Meta releases Llama 4 — open-source model matches GPT-4 on benchmarks
Meta's Llama 4 family achieves GPT-4-class performance across coding, reasoning, and multilingual tasks while remaining fully open-source with commercial licensing.
GitHub Copilot Workspace goes GA — plan, code, and ship from an issue
GitHub's Copilot Workspace is now generally available, turning GitHub Issues into complete implementation plans with AI-generated code changes, tests, and pull requests.
Claude now controls your desktop — points, clicks, and scrolls to complete tasks
Anthropic's Claude can now take over your mouse and keyboard to complete tasks on your desktop. When no API or integration exists, it navigates your screen directly — opening files, clicking buttons, and filling forms.
NVIDIA NIM containers hit 100+ optimized AI models for enterprise
NVIDIA's NIM microservices now include 100+ pre-optimized AI models ready for enterprise deployment with one Docker command.
Stripe launches AI-native billing for usage-based AI products
Stripe introduces billing infrastructure specifically designed for AI products — metering tokens, tracking costs per model, and handling usage-based pricing at scale.
Shopify merchants can now sell directly inside ChatGPT, Gemini, and Copilot
Shopify launches Agentic Storefronts — merchants can sell products directly inside ChatGPT, Google's AI Mode, Microsoft Copilot, and the Gemini app. Pricing, checkout, and inventory all sync from Shopify admin.
Mistral drops Voxtral TTS — open-weight text-to-speech enters the race
Mistral AI releases Voxtral TTS, an open-weight text-to-speech model. Their first major move into audio generation, directly challenging ElevenLabs and OpenAI's TTS offerings with an open-source alternative.
Vercel AI Gateway adds native image generation with Gemini and Flux
Vercel's AI Gateway now supports image generation natively, routing to Google Gemini 3.1 Flash Image Preview, Flux 2, and Imagen 4.0 through a unified API.
Gemini 3.1 Flash-Lite: 2.5x faster at $0.25/M tokens
Google introduces Gemini 3.1 Flash-Lite — an efficiency-focused model delivering 2.5x faster responses and 45% faster output generation at just $0.25 per million input tokens.
Meta launches Small Business AI program for 250M+ businesses on its platforms
Meta announces Small Business — a new AI program targeting 250M+ small businesses on Facebook, Instagram, and WhatsApp. AI tools that give small operators the same advantages as their larger competitors.