Briefs

Daily roundup

Models, tools, enterprise AI, research, policy, product. Distilled to what changes positioning.

Friday, Jun 12, 2026

Jun 12 Anthropic pricing and model card anthropicclaude

RAG is not dead. It just got a smaller job.

A month after Anthropic shipped Claude Opus 4.7 with a 1M-token context window, dev teams are quietly retiring their bespoke retrieval pipelines for the under-800-page case. The interesting question is what RAG keeps now that 'just read the thing' is a viable answer.

Thursday, Jun 11, 2026

Jun 11 Mercedes-Benz apptronikapollo

Mercedes just put Apptronik's Apollo on the Tuscaloosa SUV line

Mercedes-Benz and Apptronik announced Thursday that the Apollo humanoid will deploy to the Tuscaloosa, Alabama plant in early 2027 for parts-kitting and pre-assembly work. Mercedes is already an Apptronik Series A and Series B investor, which is why the deal skipped a competitive bake-off. With Figure at BMW Spartanburg, Atlas at Hyundai Georgia, and now Apollo at Mercedes Tuscaloosa, the German-and-Korean OEMs have locked in three of the four humanoid production lines that exist.

Jun 11 Snowflake snowflakesnowflake-summit

Snowflake's pitch to the enterprise: keep the data, rent the brain

Snowflake announced general availability of Cortex Agents on the final day of Snowflake Summit, letting customers orchestrate Claude Fable 5, Gemini 3 Pro, GPT-5.5, and Snowflake's own Arctic models against data already sitting inside the warehouse. No egress, no copy, governance handled by existing Snowflake row-level security. The shot is aimed squarely at Databricks AI/BI and reframes the model-routing wars as a question of where the data lives.

Jun 11 Reuters samsungsk-hynix

Samsung finally got its HBM4 past Nvidia's qualification cycle

Samsung Electronics confirmed Friday that its 12-high HBM4 modules cleared Nvidia's qualification audit for Vera Rubin Ultra accelerators, ending roughly eighteen months of failed cycles. Samsung will supply Vera Rubin Ultra production volumes starting Q4 alongside SK hynix and Micron. SK hynix shares dropped 4.2% in Seoul trading. Samsung jumped 6.8%. The Korean memory duopoly is back to being a duopoly.

Wednesday, Jun 10, 2026

Jun 10 Apple Developer applefoundation-models

Apple's Foundation Models framework treats Claude and Gemini as plugins

Apple's WWDC 2026 Foundation Models update introduced a public Swift LanguageModel protocol that third-party providers implement to slot into iOS apps. Anthropic's Claude and Google's Gemini conform on day one. Apple's own on-device model conforms too. The developer writes session logic once, and swaps providers by changing a single import line. Apple just made the model the part of the stack it does not need to win.

Jun 10 European Commission eu-ai-actarticle-50

Brussels finalized its deepfake labeling rules. The watermarks that actually work were not included.

The European Commission published the final Code of Practice for AI-generated content on Wednesday, ahead of the Article 50 transparency obligations under the AI Act taking effect August 2. Deepfakes (defined broadly) and AI-generated public-interest text must be visibly marked. Systems on the EU market before August 2 get a transitional window through December 2. New launches after August 2 must ship compliant from day one.

Jun 10 OpenAI openaioracle

OpenAI moved into the Oracle procurement portal

OpenAI and Oracle announced Wednesday that Oracle Cloud Infrastructure customers can apply existing Oracle Universal Credits toward OpenAI models and Codex. Rollout begins in the coming weeks. It is the third hyperscaler distribution channel for OpenAI after Azure and AWS, and the cleanest path yet for an enterprise whose finance team already pre-committed billions to a non-Microsoft cloud.

Tuesday, Jun 9, 2026

Jun 9 Anthropic anthropicclaude

Anthropic put its Mythos model on a leash and shipped it to everybody

Claude Fable 5 launched Tuesday as the first publicly available Mythos-class model. Same underlying model, hard classifiers that bounce cybersecurity, biology, chemistry, and distillation prompts back to Opus 4.8. Priced at $10 in / $50 out per million tokens, twice Opus 4.8. Free inside Pro and Max plans through June 22, then credits-only. Available on the Anthropic API and AWS Bedrock day one.

Jun 9 Yahoo Finance / Reuters intelintel-foundry

Google ordered three million TPUs from Intel, which is now a sentence that makes sense

Intel closed up 11.2% Tuesday after reports that Alphabet placed an Intel Foundry order for more than three million Tensor Processing Units, scheduled for 2028 delivery. AMD added 5.1%, Broadcom 2.8%, and the XLK tech ETF closed up 2.15%. The read on the order: the largest non-NVIDIA AI silicon customer in the world is willing to sole-source a TPU generation through an American foundry, and that foundry is no longer TSMC.

Jun 9 Boston Dynamics boston-dynamicsatlas

Boston Dynamics has already sold every Atlas it will build this year

Every unit of the electric Atlas slated for 2026 production is committed, with the entire run going to Hyundai's Robotics Metaplant Application Center and Google DeepMind. Hyundai is targeting 30,000 units a year by 2028 across Kia and Hyundai assembly lines. DeepMind is integrating Gemini Robotics as Atlas's reasoning layer. The bottleneck for the rest of the humanoid market is now Boston Dynamics's own manufacturing line in Massachusetts.

Monday, Jun 8, 2026

Jun 8 Korea JoongAng Daily nvidiajensen-huang

Jensen Huang spent five days in Korea and came back with six signed partnerships

NVIDIA's CEO swung through Seoul June 4-8 after Computex, met Hyundai, LG, SK, Samsung, and Naver, and left with six announced or signed deals across memory, gigawatt AI cloud, automotive, robotics, and a national R&D center. Korea is no longer just a memory supplier to NVIDIA. It is now a sovereign-AI customer, an OEM partner, and the location of a Seoul-based NVIDIA R&D facility.

Jun 8 Apple Newsroom applewwdc

Apple shipped Siri AI to a slide deck. The actual Siri AI ships 'later this year.'

WWDC 2026 announced Siri AI, branded it, gave it an app, ran the sizzle reel, then put it in beta 'later this year' on supported devices in English only. The rest of Apple Intelligence is in the developer beta as of Monday. The EU and China are excluded. It is Tim Cook's last WWDC, with John Ternus taking over September 1.

Jun 8 Bloomberg uberwayve

Uber and Wayve quietly opened London's robotaxi waitlist, and Waymo is right behind them

Uber added a Wayve interest list to the London app on Monday. Launch is 'in the next couple of months,' contingent on UK regulators finalizing the framework. Fleet is Ford Mustang Mach-E EVs with a human safety driver in the early phase. Waymo is also entering London, setting up Europe's first head-to-head robotaxi market between two materially different stacks.

Sunday, Jun 7, 2026

Jun 7 Federal News Network anthropicpentagon

The Anthropic-Pentagon court fight is now genuinely about whether labs are allowed to say no

A DC Circuit panel heard oral arguments May 19 on whether the Pentagon can blacklist Anthropic for refusing to drop its red lines on autonomous weapons and domestic surveillance. The panel was visibly split. A California judge already sided with Anthropic. The DC appeals court denied a stay.

Jun 7 Microsoft Foundry Blog microsoftfoundry

Microsoft wants 'hosted agents' to be the new containers, which is the kind of thing Microsoft says

Microsoft Build introduced Hosted Agents in Foundry Agent Service: per-session sandboxes with isolated compute, memory, and filesystem, framework-agnostic, general availability targeted for early July. The pitch is that agents need their own runtime primitive the way cloud-native workloads needed containers. The pitch is also kind of true.

Jun 7 Figure AI figurefigure-02

The first humanoid-robot deployment numbers anyone can verify are in, and they are pretty good

Figure's robots ran ten-hour shifts at BMW's Spartanburg plant for eleven months, contributed to over 30,000 X3 vehicles, loaded 90,000 parts, and held accuracy above 99 percent. BMW is now expanding the deployment to Figure 03 and to a second plant. This is the first humanoid-robot story with actual operating data attached.

Friday, Jun 5, 2026

Jun 5 AppleInsider: iOS 27, macOS 27, Siri preview applewwdc

WWDC opens Monday, and the Siri 2.0 test is whether it ships in the developer beta the same afternoon

The WWDC 2026 keynote is June 8 at 10am Pacific. The credibility test for Apple Intelligence is whether the rebuilt Siri lands in the iOS 27 developer beta that day, not 'rolling out throughout the year.' The leaks say a custom 1.2 trillion parameter Gemini model is doing the cloud reasoning under the hood for roughly $1 billion a year.

Thursday, Jun 4, 2026

Jun 4 Global Banking & Finance intelfoxconn

Intel found a friend at Computex, and his name is Foxconn

Foxconn and Intel announced a strategic collaboration on next-generation rack-scale AI infrastructure at Computex 2026, combining Xeon CPUs and Intel's AI accelerator line with Foxconn's manufacturing and systems integration. No timeline, no customers, no dollar figure.

Jun 4 Cybersecurity Dive trumpexecutive-order

Trump signed the AI security order he pulled three weeks ago, with the review window cut from 90 days to 30

The 'Promoting Advanced Artificial Intelligence Innovation and Security' executive order asks AI labs to voluntarily hand over frontier models for up to 30 days of government review before release. DHS, Treasury, NIST, and the National Cyber Director's office have 60 days to define which models qualify.

Jun 4 American Bazaar tsmcc-c-wei

TSMC's CEO would like Nvidia and AMD to know that he is already working very hard, thank you

C.C. Wei said on Wednesday that AI chip demand from Nvidia, AMD, and Apple continues to outrun TSMC's capacity, and the company is expanding fabs in Taiwan, the US, and Japan to catch up. Leading-edge capacity remains the bottleneck on AI compute.

Wednesday, Jun 3, 2026

Jun 3 Council of the European Union eu-ai-actdigital-omnibus

The EU AI Act got a 16-month homework extension, except for the part that bans nudifier apps

Brussels' Digital Omnibus on AI delays the high-risk compliance regime to December 2027 while keeping a new prohibition on AI-generated non-consensual intimate imagery and CSAM on a December 2026 enforcement date.

Jun 3 Nvidia Newsroom nvidiaunitree

Nvidia shipped a humanoid reference robot, and it has more finger joints than the engineer reviewing this spec

GTC Taipei produced the Isaac GR00T Reference Humanoid Robot, a Unitree H2 Plus chassis with Sharpa five-finger hands, Jetson Thor on board, and 75 total degrees of freedom. Available from Unitree in late 2026.

Jun 3 Anthropic anthropicclaude-partner-network

Anthropic gave its consultant army formal rank insignia, and the top tier wants 1,000 certified practitioners

The Claude Partner Network's new Services Track formalizes consulting partners into Select, Preferred, and Global Premier tiers with hard numerical floors on staff, deployments, and public stories. Accenture, Cognizant, Deloitte, and KPMG are all in scope.

Tuesday, Jun 2, 2026

Jun 2 Microsoft AI microsoftmai-thinking-1

Microsoft built its own reasoning model from scratch, no OpenAI in the training set

MAI-Thinking-1 landed at Build 2026 as Microsoft's first in-house reasoning model, a 35B-active-parameter sparse MoE trained on commercially licensed data with zero distillation from anyone else's outputs.

Jun 2 The GitHub Blog githubcopilot

GitHub Copilot grew a desktop app, and it wants to merge your pull requests for you

GitHub used Build 2026 to graduate the Copilot desktop app from technical preview, with parallel agent sessions in auto-managed git worktrees and an Agent Merge mode that drives CI, reviewers, and final merge.

Jun 2 Nvidia Newsroom nvidiatsmc

Nvidia is now helping TSMC make Nvidia chips, which is either elegant or vaguely ouroboros-shaped

TSMC committed to using Nvidia's accelerated computing across lithography, materials simulation, defect detection, and an Omniverse-built virtual fab called FabTwin. The loop where Nvidia silicon helps fabricate Nvidia silicon is now official.

Monday, Jun 1, 2026

Jun 1 AWS News Blog openaiaws

GPT-5.5 and Codex hit GA on AWS Bedrock, four days after Anthropic's Mythos got the same shelf space

AWS announced general availability of OpenAI GPT-5.5, GPT-5.4, and Codex on Amazon Bedrock on Monday, at the same per-token rates OpenAI charges direct. Bedrock is now the neutral procurement front door for both frontier labs.

Jun 1 CNBC anthropicipo

Anthropic dropped a confidential S-1 on the SEC, beating OpenAI to the IPO desk by a few weeks

Anthropic confidentially submitted a draft Form S-1 to the SEC on Monday, putting it on the on-ramp for a 2026 or 2027 IPO. OpenAI's matching filing is reportedly in the next couple of weeks.

Jun 1 Tom's Hardware nvidiacomputex

Nvidia put a 6,144-core Blackwell GPU and a 20-core Arm CPU on one die, and it ships in Dell laptops this fall

Jensen Huang used the Computex keynote to unveil the RTX Spark Superchip, a single-package Arm CPU plus Blackwell GPU with 128GB of unified memory. Ships in fall 2026 Windows laptops from Dell, HP, Lenovo, Asus, MSI, and a Microsoft Surface Ultra.

Sunday, May 31, 2026

May 31 The Robot Report 1xneo

1X opened preorders for a $20,000 home humanoid, and the deposit to lock yours in is $200

The Norwegian humanoid company 1X started taking consumer preorders for NEO, priced at $20,000 outright or $499 per month on subscription. First deliveries land in 2026, U.S. only, and yes, there are configurable no-go zones.

May 31 TrendForce tsmc3nm

TSMC quietly told 3nm customers to expect a 15% price hike in H2, and the customers booked more capacity anyway

Trendforce reports TSMC has signaled price increases of up to 15% on 3nm in the second half of 2026, with another 5 to 10% projected for 2027. Apple, Nvidia, AMD, and Broadcom all eat the increase, and capacity is still fully booked through year-end.

May 31 Fortune anthropicfunding

Anthropic raised $65 billion at a $965 billion valuation, and three of the new investors make memory chips

The Series H closed Thursday with Samsung, SK Hynix, and Micron sitting on the cap table next to the usual hedge funds and hyperscalers. Anthropic also shipped Claude Opus 4.8 the same day. The valuation now leapfrogs OpenAI's $852 billion mark from March.

Friday, May 29, 2026

May 29 AWS Bedrock Documentation anthropicclaude-mythos

AWS quietly listed Claude Mythos Preview on Bedrock 18 hours after the Glasswing report, which is a launch in everything but the press release

Mythos showed up in the Bedrock model catalog Friday morning as a preview tier, available in us-east-1 and us-west-2, with throughput capped and an enterprise-only allowlist. There was no Anthropic blog post, no AWS keynote, and no analyst briefing. Just a console update and a model card.

Thursday, May 28, 2026

May 28 Anthropic anthropicclaude-mythos

Anthropic let a not-yet-released Claude model loose on open-source code and it found 10,000 vulnerabilities including a 16-year-old FFmpeg bug

Project Glasswing partners ran Claude Mythos Preview against critical infrastructure and surfaced 23,019 issues, 6,202 of them high or critical severity, with a 90 percent true-positive rate when independent security firms went back and audited. The white-hat AI bug hunter just stopped being theoretical.

May 28 OpenAI Help Center openaixai

OpenAI, xAI, and Anthropic all now call their customizable-AI primitive 'Skills,' which is either the most boring or the most important standardization of the year

xAI shipped Custom Skills for Grok on May 26. OpenAI shipped governance controls for Skills in ChatGPT Enterprise on May 27. Anthropic has had Skills in Managed Agents since March. Three frontier labs picking the same noun for the same primitive is rarely an accident.

May 28 TechCrunch cognitiondevin

Cognition raised a billion at $26B, doubled its valuation in eight months, and the only independent coding agent left standing is now priced like one

Devin's maker closed a $1B round led by Lux Capital, General Catalyst, and 8VC at a $26 billion post-money valuation, more than double its September mark. After Cursor merged with Windsurf and Karpathy migrated to Anthropic, Cognition is the last large independent in the coding-agent category, and the round is priced accordingly.

Wednesday, May 27, 2026

May 27 DigiTimes Asia agibothumanoids

China's Agibot says its humanoid hit a 100 percent factory success rate, and the open-source pitch is the real news

Agibot livestreamed a factory deployment last week claiming 100 percent task success and pushed a coordinated open-source release: Link OS, Genie Studio, and a robotics dataset large enough to actually move the needle. The US humanoid plays are vertically integrated. The Chinese pitch is starting to look like Android.

May 27 Hugging Face mistralopen-weights

Mistral shipped a 128B open-weight model that opens its own pull requests, and the SWE-Bench number is two points off Claude

Mistral Medium 3.5 landed under a modified MIT license: 128B dense parameters, 256k context, multimodal, 77.6 percent on SWE-Bench Verified. A coding agent that drafts and submits PRs ships in the same box. The French open-weights pitch finally has a coding number worth bragging about.

May 27 ERP Today servicenownvidia

ServiceNow and Nvidia announced an agent that runs on your desktop in a sandbox you can audit, which is a less fun pitch than 'agentic AI for everyone'

Project Arc, unveiled at Knowledge 2026, is a desktop agent built on ServiceNow Action Fabric, secured by Nvidia OpenShell, and governed through ServiceNow's AI Control Tower. The interesting part is not the agent. It is that the company finally chose to lead with the governance plumbing instead of the demo.

Tuesday, May 26, 2026

May 26 The Robot Report roboticshumanoids

The Robotics Summit opened in Boston this morning, and humanoids have quietly stopped being a demo

The 2026 Robotics Summit & Expo kicked off May 27 with a 'State of Humanoids' keynote panel and a set of milestones that are no longer projections. Agility's Digit just crossed 100,000 totes in live commercial work, Boston Dynamics has sold out its entire 2026 Atlas production run, and the room is starting to talk like a manufacturing industry instead of a research field.

May 26 South China Morning Post chinahuawei

China certified nine domestic AI chips for government procurement, and Nvidia is not on the list

Beijing's security evaluation centers added a new 'AI training and inference chips' category to the national procurement allowlist this week and stamped nine homegrown processors, including Huawei Ascend, with Level I clearance. The list defines which silicon government agencies and state-owned enterprises can actually buy.

May 26 Tom's Hardware nvidiaasic

Custom AI ASICs are growing faster than Nvidia GPUs for the first time, and the hyperscalers built the wedge themselves

TrendForce projects 44.6 percent shipment growth for custom AI ASICs in 2026 versus 16.1 percent for merchant GPUs. Counterpoint says global ASIC shipments will triple from 2024 to 2027. Google TPU, AWS Trainium, Microsoft Maia, and Meta MTIA are all hitting full production at the same time, and their procurement budgets are quietly diverting away from the company that sold them last cycle's gear.

Monday, May 25, 2026

May 25 Anthropic anthropicmcp

Anthropic shipped the agent features that let enterprise security teams stop saying no

Two new Claude Managed Agents features (self-hosted sandboxes in public beta, MCP tunnels in research preview) let enterprise customers keep their data and tool execution inside their own perimeter. Translation: Anthropic just removed the two objections every CISO has been using to block agent rollouts.

May 25 Fortune nexteradominion

NextEra is buying Dominion for $67 billion, and the strategy slide is essentially 'we power the data centers now'

NextEra Energy agreed to a $67 billion acquisition of Dominion Energy on May 18, creating the largest utility in the world. The combined company controls the grid serving Northern Virginia's data center alley, which is where most of America's AI compute already plugs in.

May 25 Time vaticanpope-leo-xiv

Anthropic's co-founder stood next to the Pope at the Vatican yesterday, which is a real sentence we can now write

Pope Leo XIV released his first encyclical, Magnifica Humanitas, on safeguarding the human person in the age of AI. He chose to present it alongside Anthropic co-founder Chris Olah, which is the Vatican saying out loud which lab it thinks is the serious one.

Sunday, May 24, 2026

May 24 Bloomberg anthropicfinance

Anthropic told investors it will post its first operating profit this quarter, with several large asterisks

Anthropic projected $10.9 billion in Q2 revenue and a $559 million operating profit, its first ever, in materials shared with investors. The number is real, the framing is convenient, and the compute bill is right behind it.

May 24 Tech Times nvidiatsmc

Jensen flew to Taiwan over the holiday weekend because the bottleneck on the next Nvidia chip is a glue problem

Jensen Huang spent Memorial Day weekend meeting with TSMC's chairman about advanced packaging capacity for Vera Rubin, the platform Huang has publicly called the largest product launch in the history of Taiwan.

May 24 OpenAI openaimathematics

OpenAI's model disproved an 80-year-old geometry conjecture, and the math department checked the homework

An OpenAI reasoning model autonomously disproved Erdős's unit distance conjecture from 1946, and nine mathematicians including Fields medalist Tim Gowers wrote a companion paper confirming the proof holds up.

Friday, May 22, 2026

May 22 Fortune openaiipo

OpenAI filed for an IPO, which means the most secretive AI company has to put its real numbers on a PDF

OpenAI confidentially filed IPO paperwork that could pave the way for a public listing as soon as September, at a valuation up to $1 trillion. The S-1 is going to be one of the most consequential disclosures the AI industry has ever produced.

Thursday, May 21, 2026

May 21 Anthropic anthropicstainless

Anthropic bought the SDK shop that every AI lab uses, then started turning the lights off

Anthropic acquired developer tools startup Stainless for a reported $300 million plus and announced it is winding down the hosted SDK generator that OpenAI, Google, and Cloudflare also depend on.

May 21 Apple Newsroom applewwdc

Apple set WWDC for June 8, and the headline rumor is a Siri quietly powered by Gemini

Apple confirmed its Worldwide Developers Conference for the week of June 8, with the long-promised Siri overhaul expected as the keynote's centerpiece and reports pointing to a Google Gemini partnership running under the hood.

May 21 CNBC trumpexecutive-order

Trump pulled his AI cybersecurity order at the last minute because he didn't want to slow down the team

The White House postponed Thursday's signing of an AI cybersecurity executive order that would have given federal agencies a 90-day early look at frontier models, with the president saying he didn't want anything getting in the way of America's lead.

Wednesday, May 20, 2026

May 20 Boston Dynamics boston-dynamicsatlas

Boston Dynamics' Atlas can lift a 100-pound load it never trained on, and Hyundai wants 30,000 a year

A new technical writeup shows Atlas executing industrial lifts with a single whole-body control policy trained almost entirely in simulation, with zero-shot transfer to weights beyond the training distribution.

May 20 TechCrunch anthropicopenai

Karpathy is at Anthropic now, in case anyone needed another sign of where the talent gravity is pointing

Andrej Karpathy started this week on Anthropic's pre-training team, his second high-profile AI lab move since leaving Tesla, with a mandate to use Claude to accelerate Claude.

May 20 OpenAI openaichatgpt

ChatGPT is now an ad platform you can buy into with a credit card, on the road to a targeted $100B a year

OpenAI's self-serve Ads Manager has opened to all US businesses with CPC bidding and no minimum spend, the operational scaffolding behind a $2.5B-this-year and $100B-by-2030 revenue target.

Tuesday, May 19, 2026

May 19 AMD amdmi450

AMD's MI450 hits customer sampling, with twelve gigawatts of OpenAI and Meta capacity waiting

AMD has begun sampling its MI450 AI accelerator with twelve gigawatts of OpenAI and Meta deployment slated for the back half of 2026, which means Nvidia's first credible second source may actually ship this year.

May 19 Figure AI figurehumanoid

A humanoid robot helped build 30,000 BMWs, then quietly retired

Figure AI's Figure 02 wrapped its 11-month BMW Spartanburg pilot with 1,250 operational hours and a 99 percent sheet-fitting success rate, then got put out to pasture.

May 19 CNBC googlegemini

Google ships an agent called Spark, a world model called Omni, and a cheaper Flash for the back row

Google I/O 2026 lands with a general-purpose agent, a world model demo, and a frontier-class Flash tier at roughly a third of competitor pricing.

Sunday, May 17, 2026

May 17 Anthropic anthropicclaude

Claude Opus 4.7 grows a 1M-token brain, mostly to read your repo

Anthropic catches Gemini on the context-window leaderboard. The interesting part is what happens to long-document pipelines when the price of remembering everything stops being absurd.

Saturday, May 16, 2026

May 16 OpenAI openaicodex

OpenAI bills Microsoft and Stripe per merged PR, sees how that goes

Codex moves from limited preview to named enterprise rollouts. The pricing model, billed per successful task, is either the future of AI procurement or a slow-motion CFO revolt.

Friday, May 15, 2026

May 15 European Commission eu-ai-actregulation

EU AI Act final rules drop, frontier labs discover they have to show their homework

Final implementing acts published, 12-month compliance window opens. The training-data summary requirement is the part the labs were hoping nobody would actually enforce.

Thursday, May 14, 2026

May 14 TechCrunch cursorwindsurf

Cursor and Windsurf give up pretending it was a two-horse race, merge

$2.1B all-stock deal combines the two leading AI-native IDEs. The actual story is that VS Code plus Copilot apparently leaves room for exactly one independent.

Tuesday, May 12, 2026

May 12 Meta AI metallama

Llama 4.5 lands, Meta keeps quietly funding the open-weight resistance

600B sparse MoE, 17B active parameters, frontier-adjacent evals. The architecture choice means a single 8x H200 node can serve it, which is the actual headline for everyone outside Twitter.

Monday, May 11, 2026

May 11 White House white-houseexecutive-order

Two White Houses, two AI executive orders, one NIST that just keeps going

Biden's 2023 order got partially undone, partially renamed, and partially absorbed. The durable institutional changes are quietly more important than the press cycle around either order.

Sunday, May 10, 2026

May 10 Salesforce salesforceagentforce

Salesforce charges $2 per resolved conversation, dares CFOs to do the math

Agentforce 2.0 hits GA with ServiceNow and HSBC as named customers. Per-resolved-conversation pricing is either the cleanest unit economics in enterprise AI or the boldest negotiating opener of the year.

Saturday, May 9, 2026

May 9 AI Brief github-copilotfine-tuning

GitHub Copilot vs. internal fine-tuned models in large engineering orgs

Large engineering organizations are increasingly running side-by-side evaluations of GitHub Copilot against internally fine-tuned code models. The results depend heavily on codebase characteristics.

Friday, May 8, 2026

May 8 OpenAI openaigovernance

OpenAI's safety board restructuring and what it signals about frontier lab governance

OpenAI's board has been restructured twice in 18 months. The pattern reveals something about how frontier AI labs are resolving the tension between governance and commercial velocity.

May 8 Vercel vercelai-sdk

Vercel ships an agent loop, retires roughly 8,000 lines of glue code per repo

AI SDK 5.0 adds a first-class agent runtime with resumability. The framework that already owned the React-side LLM integration now wants the server-side too.

Wednesday, May 6, 2026

May 6 AI Brief ragenterprise

Why 70% of enterprise RAG pilots don't make it to production

RAG system failure in enterprise pilots follows predictable patterns. The problems are rarely the retrieval architecture -- they're organizational and operational.

Tuesday, May 5, 2026

May 5 NIST nistai-rmf

NIST AI Risk Management Framework 1.1 update and enterprise compliance implications

NIST's AI RMF 1.1 release adds specificity on generative AI risk categories, giving enterprise compliance teams their most concrete US-government guidance to date.

May 5 California Legislature californiaab-2013

California signs AB-2013 training-data disclosure into law

First US state to mandate training-data summaries for generative AI products sold to California customers. Effective January 2027; the disclosure template is the binding detail.

Sunday, May 3, 2026

May 3 arXiv deepmindevals

DeepMind benchmarks long-horizon agents, frontier models clear 30%

New benchmark stresses multi-day tasks with realistic tools. Top closed models score under 30%. The gap between demo-day agents and production agents finally has a number on it.

Saturday, May 2, 2026

May 2 AI Brief goldman-sachsenterprise

Goldman Sachs' internal LLM deployment and what it tells us about enterprise AI ops

Details of Goldman's internal AI stack have surfaced through job postings, conference talks, and industry reporting. The architecture choices reflect constraints specific to regulated financial services.

Friday, May 1, 2026

May 1 European Commission eu-ai-actregulation

The EU AI Act's high-risk classification: what it means for US companies selling into Europe

The AI Act's Annex III high-risk categories have specific compliance obligations that many US AI companies are not operationally ready to meet. The 2-year compliance window is shorter than it sounds.

Monday, Apr 27, 2026

Apr 27 Klarna klarnacustomer-service

Klarna's '700 agents replaced by AI' headline, audited 18 months later

The most-cited datapoint in enterprise AI deployment turns out to be technically true and rhetorically misleading. The actual deployment is more interesting than the press release.

Sunday, Apr 19, 2026

Apr 19 AI Brief portkeyhelicone

The rise of AI gateway middleware: what Portkey, Helicone, and Braintrust are solving

A new middleware category is crystallizing around the need to manage, observe, and optimize traffic across multiple LLM providers. These products solve real production problems that individual API integrations cannot.

Saturday, Apr 11, 2026

Apr 11 Hugging Face hugging-faceinference

Hugging Face's Inference API pricing changes and the open-source model hosting market

Hugging Face restructured its Inference API pricing to reflect actual GPU costs, ending the free-tier economics that subsidized most experimental deployments. The change signals a maturation of the open-source model hosting market.