RESOURCES

AI News

AI is fast moving—here are some of the latest updates and developments worth knowing about.

This page collects notable AI news, announcements, and developments relevant to healthcare and medical practice. Items are listed newest first, with both publication date and when they were added to this list.


Published: May 19, 2026 Added: May 25, 2026

Google I/O 2026: Gemini 3.5 Flash for All Users, Gemini Omni Flash Multimodal

Google’s I/O 2026 keynote shipped Gemini 3.5 Flash to every Gemini user and Plus/Pro/Ultra subscribers got Gemini Omni Flash, a new multimodal model. 3.5 Flash is ~4× faster than peer frontier models on agentic benchmarks, and Google paired the launch with expanded Workspace agent capabilities. Gemini 3.5 Pro is expected in June. For clinicians: the free-tier default just jumped two generations, and the Workspace HIPAA path inherits the upgrade automatically.

Published: May 18, 2026 Added: May 25, 2026

NEJM AI Afshar RCT: Ambient AI Cuts Exhaustion, Not Fulfillment

A 24-week stepped-wedge pragmatic RCT of 66 health-care practitioners found ambient AI significantly reduced work exhaustion and interpersonal disengagement—but did not significantly improve professional fulfillment. Documentation time decreased without compromising note quality or billing compliance. The cleanest read so far on what ambient scribes actually deliver: meaningful burnout relief, no automatic uplift in meaning. Pair this with the Lukac DAX-vs-Nabla RCT (also NEJM AI, 2026) for the strongest controlled evidence in the space.

Published: May 15, 2026 Added: May 25, 2026

Anthropic Launches Claude for Small Business; Code w/ Claude Developer Conference

Anthropic shipped Claude for Small Business—a workspace-style plan for SMB teams that want Claude without enterprise procurement—and held its inaugural Code w/ Claude developer conference in San Francisco. For solo and small-group practices, the SMB plan is the first dedicated tier between individual Pro and full enterprise; the conference doubled down on Claude Code as Anthropic’s flagship agentic product, with Claude Code routines (scheduled cloud agents) as the headline release.

Published: May 14, 2026 Added: May 25, 2026

OpenAI Codex Mobile Preview Ships in ChatGPT App

OpenAI’s Codex coding agent is now available as a preview inside the ChatGPT mobile app. Developers can monitor and steer active Codex tasks from their phone while a Mac host keeps the actual session running. The form-factor implication for physicians who experiment with vibe coding: a long-running build or refactor can keep progressing during a 20-minute window between patients, with review and redirection happening from the phone.

Published: May 13, 2026 Added: May 25, 2026

athenahealth Launches athenaAmbient—Free Ambient Scribe for athenaOne Customers

athenahealth bundled an ambient scribe (athenaAmbient) into athenaOne at no additional cost, joining Epic Art, Microsoft Dragon Copilot, and Oracle Cerner in the “ambient as a default EHR feature” camp. The pricing implication for the broader market: standalone scribe vendors (Abridge, Nuance/DAX, Suki, Nabla) now compete with a free EHR-bundled option for athena-shop practices. The signal-to- noise pressure on dedicated scribe vendors just intensified.

Published: May 5, 2026 Added: May 11, 2026

Perplexity × VisualDx: Clinician-Validated Medical Imagery in AI Search

Perplexity integrated VisualDx’s clinician-validated medical image library— including diverse skin tones across Fitzpatrick types I–VI—directly into its health answers. VisualDx is already used in 2,300+ hospitals; embedding it in a general AI search engine is the first time peer-reviewed clinical imagery has been wired into a consumer-grade answer engine. Free for Pro and Max subscribers. A meaningful step toward reducing the diagnostic-image equity gap that has dogged dermatology AI.

Published: Apr 30, 2026 Added: May 11, 2026

Harvard/OpenAI Study: AI Outperforms Physicians in Real-World Diagnosis

A real-world evaluation by Harvard and OpenAI found an AI model outperformed doctors at diagnosing patients across a representative sample of presenting complaints. The result sits alongside well-documented failure modes—Mount Sinai’s 32–46% false-claim acceptance rate, Nature Medicine’s 52% under-triage of true emergencies—reinforcing the same message: diagnostic accuracy on curated cases is not the same as safe deployment in unsupervised clinical settings.

Published: Apr 29, 2026 Added: May 11, 2026

NEJM Retracts Case Study Over AI-Manipulated Clinical Image

NEJM retracted a published case study after determining authors used AI to superimpose a ruler onto a clinical photograph, violating NEJM image integrity policy. The first NEJM retraction over image manipulation since the 2020 Surgisphere scandal. Generative tools for image editing are now indistinguishable enough from photography that journals are updating submission policies in real time.

Published: Apr 24, 2026 Added: May 11, 2026

DeepSeek Releases V4 Flash and V4 Pro Open-Source Under MIT License

DeepSeek shipped V4 Flash and V4 Pro under an MIT license—1M-token context at $1.74 per million input tokens, roughly 1/20th the cost of frontier proprietary models. For healthcare deployments that need on-prem inference (HIPAA, data residency, network isolation), this is the strongest open-source contender to date. Pair with the Llama 4 and Muse Spark coverage in our Running AI Models Locally module for the broader open-vs-proprietary picture.

Published: Apr 23, 2026 Added: May 11, 2026

OpenAI Ships GPT-5.5 and GPT-5.5 Pro; Becomes ChatGPT Default (May 5)

OpenAI released GPT-5.5 and GPT-5.5 Pro on Apr 23–24 with 1M-token context windows. GPT-5.5 Instant became the default ChatGPT model on May 5. OpenAI’s internal evals report 52% fewer hallucinated claims on high-stakes medical, legal, and financial prompts versus GPT-5.4. Note “internal evals”—the meaningful test is whether independent benchmarks reproduce the gain. If you’ve tuned workflows or Custom GPTs to GPT-5.4 behavior, retest.

Published: Apr 23, 2026 Added: May 11, 2026

AMA to Congress: Federal Guardrails for Health AI Chatbots

The AMA sent letters to three congressional committees calling for an FDA device-review pathway, transparency rules, cybersecurity standards, and advertising limits on health AI chatbots. The letters cite mental health chatbot harms, unmediated symptom-checker use, and the absence of federal authority over wellness AI products that operate just outside the current FDA medical device definition. A meaningful shift from physician societies treating AI as a clinician productivity issue to treating it as a patient safety issue.

Published: Apr 21, 2026 Added: May 11, 2026

OpenAI Launches ChatGPT Images 2.0

ChatGPT Images 2.0 launched Apr 21 and hit #1 on Image Arena within 12 hours. Available on all plans including Free. Key upgrade for clinical use: precise edits with facial-likeness consistency, better medical-figure rendering, and improved text-in-image fidelity (still not reliable enough for patient-facing labels, but closer). See the AI Image and Video Creation module for the full tier breakdown.

Published: Apr 13, 2026 Added: May 11, 2026

ECRI 2026: AI Chatbot Misuse Tops Health Tech Hazard List

ECRI’s 2026 Top 10 Health Technology Hazards report placed AI chatbot misuse at #1—above alarm fatigue, infusion pump errors, and surgical-device failures that have anchored this list for a decade. The framing emphasizes patients acting on unmediated chatbot advice, clinicians relying on chatbot output without source review, and health systems deploying patient-facing chatbots without governance. ECRI carries weight with hospital safety committees; expect this to drive 2026 AI procurement reviews.

Published: Apr 8–13, 2026 Added: May 11, 2026

Ambient Scribe Upcoding Crisis: JAMA, npj Digital Medicine, PHTI Reports

A coordinated wave of reporting—JAMA Health Forum, npj Digital Medicine, STAT, and a PHTI report—documented that ambient AI scribe adoption is associated with 12–18% growth in claim intensity. High-intensity E/M coding is rising in scribe-adopting practices. JAMA framed it as a “coding arms race.” The productivity gain (13–16 min/day saved) is real—the question is whether the documentation thoroughness it enables is appropriate billing or upcoding.

Published: Apr 8, 2026 Added: May 11, 2026

Meta Releases Muse Spark: First Proprietary Frontier Model

Meta launched Muse Spark, its first proprietary frontier model, alongside the open-weight Llama 4 line. The split signals a strategic shift: Meta retains a closed flagship for premium use cases while continuing to release open weights for the broader ecosystem. Relevant to healthcare deployments choosing between API access to closed models and on-prem deployment of open ones. See our updated Running AI Models Locally module.

Published: Apr 2026 Added: May 11, 2026

KFF Tracking Poll: 1-in-3 US Adults Use AI for Health Information

KFF found that 1 in 3 US adults used an AI chatbot for health information in the past year— equal to the share using social media for health. 77% are concerned about medical-data privacy; 41% of chatbot users have uploaded personal medical information. The privacy gap is jarring: a strong majority worry about it, then upload anyway. Worth surfacing with patients who arrive with chatbot-generated differentials.

Published: Apr 16, 2026 Added: Apr 21, 2026

Claude Opus 4.7 Launches with Same-Day GPT-5.4 and Gemini 3 Releases

Anthropic, OpenAI, and Google all shipped flagship model updates within hours of each other on April 16. Opus 4.7 held pricing steady at $5/$25 per million tokens while delivering a 13% coding improvement over 4.6 on a 93-task benchmark, higher-resolution vision, and self-verification before reporting back. Available on API, Bedrock, Vertex AI, and Microsoft Foundry. The coordinated release is the clearest sign yet that the frontier has become a weekly moving target—re-test your personal workflows quarterly.

Published: Apr 14–16, 2026 Added: Apr 21, 2026

OpenAI Ships GPT-5.4, GPT-5.4-Cyber, and GPT-Rosalind

OpenAI released GPT-5.4 (Apr 16) as its most capable frontier model for professional work, GPT-5.4-Cyber (Apr 14) for vetted security professionals, and GPT-Rosalind—a life-sciences reasoning model optimized for molecules, proteins, genes, pathways, and disease biology. Rosalind is in research preview with Amgen, Moderna, the Allen Institute, Thermo Fisher, and Novo Nordisk. A specialist biomedical model from a frontier lab is new territory worth watching.

Published: Apr 16, 2026 Added: Apr 21, 2026

Gemini 3 Arrives: Flash as Default, Plus Agent and Deep Think

Google made Gemini 3 Flash the default model in the Gemini app, with Gemini Agent (multi-step task execution across Workspace, Deep Research, Canvas, and live web) and Gemini 3 Deep Think available to Google AI Ultra subscribers. Grounding with Google Maps is now supported. Notably, Deep Think is positioned as a long-horizon planning model for tasks that take minutes, not seconds.

Published: Apr 16, 2026 Added: Apr 21, 2026

NEJM AI Publishes ChexGen: Generative Foundation Model for Chest Radiographs

A vision-language foundation model for chest radiographs supporting text-, mask-, and bounding-box–guided image synthesis. Applications include training-data augmentation, data-efficient learning, and bias detection. A pointer to where imaging decision support is heading—not just classifiers, but generative models that can produce plausible controlled images on demand for training and evaluation.

Published: Apr 15, 2026 Added: Apr 21, 2026

Abridge Embeds NEJM and JAMA Content Directly into Clinical Decision Support

Abridge announced content partnerships with NEJM Group and the AMA covering JAMA plus 11 specialty journals and JAMA Network Open. The peer-reviewed content now feeds Abridge's CDS engine grounded in the actual patient conversation happening in the room. First time a clinical AI company has wired first-tier peer-reviewed journals directly into CDS at the point of care.

Published: Apr 14, 2026 Added: Apr 21, 2026

Claude Code Prompt-Injection CVE After Source Leak

After Claude Code's source was leaked, security firm Adversa found its deny rules can be bypassed via prompt injection—letting attackers execute tool calls Claude Code was configured to refuse. Combined with GitHub Copilot CVE-2025-53773 (CVSS 9.6, PR-description prompt injection enabling RCE) earlier in the month, a clear message for health-system engineering teams: agent-side guardrails are only as strong as the inputs they reason about. Ask explicitly what prompt-injection controls are in place.

Published: Apr 13, 2026 Added: Apr 21, 2026

Hartford Rolls Out PatientGPT; Sutter and Reid Pilot Epic’s Emmie

Hartford HealthCare launched PatientGPT (built by K Health) for Connecticut patients. Sutter Health and Reid Health are piloting Epic's Emmie. Health systems are increasingly treating patient-facing AI as an intake funnel—a new layer of triage before the appointment rather than after. Worth reading critically for what questions the AI is being asked to resolve on its own.

Published: Apr 2026 Added: Apr 21, 2026

Microsoft Launches Copilot Health as a Consumer AI Companion

Microsoft unveiled Copilot Health, a consumer AI companion that combines health records, wearables data, and health history. Sits alongside OpenAI's simultaneous rollout of ChatGPT Health, which now connects directly to patient portals. Expect patients to arrive with synthesized records, trend charts, and differential diagnoses they didn't have three months ago.

Published: Apr 9, 2026 Added: Apr 21, 2026

MedQA Leaderboard Snapshot: Top Models Clear 95%

April 9 snapshot of the medical-question benchmark: o4 Mini High 95.2%, Gemini 2.5 Pro 94.6%, Claude 3.7 Sonnet 92.3%. Average across all 34 evaluated models is 79.4%. MedQA alone isn't a clinical validation, but it's a useful sanity check when comparing models. The narrow spread at the top of the leaderboard reinforces the Big Three modules' takeaway: the gap between frontier models on text medical reasoning is closing fast.

Published: Apr 2026 Added: Apr 21, 2026

Epic Art Expands to Home Care; Insights Hits 16M Monthly Uses

Epic's Art (ambient documentation) now extends to home care, joining outpatient specialty deployment and Houston Methodist's bedside-nursing pilot. Insights is used 16 million times per month (3× since November 2025), and 85% of Epic customers are now live with at least one generative AI feature (Art, Emmie, or Penny). A separate multi-center JAMA study this month confirmed ambient scribes reduce EHR time by 13.4 minutes and documentation time by 16.0 minutes per clinician per day.

Published: Apr 2026 Added: Apr 21, 2026

AMA 2026 Physician Survey: AI Use Among Doctors Has Doubled

Per the AMA Physician Survey, 39% of physicians now use AI to produce summaries of medical research and standards of care (the most common use case). 30% use it for discharge instructions and care plans, 28% for documentation, and 28% for chart summaries. The average physician uses AI for 2.3 distinct use cases—up from 1.1 in 2023. The technology's not coming; it's already in the room.

Published: Apr 1, 2026 Added: Apr 21, 2026

CMS OPPS Billing Pathway for Cardiovascular AI Takes Effect

CMS's new OPPS billing pathway for cardiovascular AI took effect April 1—the first time reimbursement has moved in lockstep with FDA clearance for this category. Bunkerhill Health received 510(k) clearance for AI analysis of coronary artery and aortic valve calcium on routine chest CT, giving an early example of a cleared device that now has a reimbursement path. When regulators and payers align on a category, deployment follows.

Published: Mar 9–12, 2026 Added: Mar 8, 2026

HIMSS 2026: “Agentic AI” Takes Center Stage

The dominant theme at this year’s HIMSS conference: AI that takes actions, not just answers questions. Google, Microsoft, Epic, and athenahealth are all showcasing AI agents for healthcare—systems that can schedule appointments, manage prior authorizations, and coordinate care workflows autonomously. Over 25,000 attendees expected. The shift from “AI as advisor” to “AI as actor” raises important questions about oversight and accountability in clinical settings.

Published: Mar 5, 2026 Added: Mar 8, 2026

GPT-5.4 Launches with Thinking and Pro Versions

OpenAI releases GPT-5.4 with Thinking and Pro versions. The new model features a 1 million token context window, native computer-use capabilities, and 33% fewer factual errors compared to its predecessor. The rapid iteration from GPT-5.3 to 5.4 in under a month continues the accelerating pace of frontier model releases.

Published: Mar 5, 2026 Added: Mar 8, 2026

AWS Launches Health AI Agent Platform

Amazon Connect Health automates scheduling, documentation, and patient verification for healthcare providers. The platform represents Amazon’s biggest push into healthcare AI, offering pre-built agent workflows that integrate with existing EHR systems. Another sign that major cloud providers see healthcare as a primary market for agentic AI.

Published: Mar 5, 2026 Added: Mar 8, 2026

Dragon Copilot Hits 100K Clinicians

Microsoft announces over 100,000 monthly active clinicians using Dragon Copilot at HIMSS 2026, positioning it as a “unified AI clinical assistant” that combines ambient listening, documentation, and clinical decision support. The scale of adoption suggests AI scribes are quickly becoming standard clinical infrastructure.

Published: Mar 4, 2026 Added: Mar 8, 2026

Doctronic AI Prescriber Jailbroken via Prompt Injection

Utah’s first-in-nation AI prescription renewal bot was trivially compromised via prompt injection. Security researchers tripled OxyContin doses and got methamphetamine recommendations. Mindgard’s head of AI called it “the easiest thing I’ve broken in my career.” A stark reminder that AI systems making clinical decisions need robust adversarial testing before deployment—especially when controlled substances are involved.

Published: Mar 3, 2026 Added: Mar 8, 2026

Perplexity Comet Browser: Zero-Click Exploits Discovered

Multiple research teams found serious vulnerabilities in Perplexity’s Comet browser. Calendar invites can silently exfiltrate local files. 1Password credentials were stolen in proof-of-concept attacks. Researchers found the browser is 85% more vulnerable to phishing than Chrome. A cautionary tale about AI-integrated browsers that prioritize convenience over security.

Published: Mar 3, 2026 Added: Mar 8, 2026

RecovryAI Gets FDA Breakthrough Device Designation

RecovryAI becomes the first patient-facing generative AI chatbot to receive FDA Breakthrough Device designation. The LLM-powered post-surgical recovery tool guides patients through recovery milestones and flags concerning symptoms. A significant regulatory milestone that could pave the way for more patient-facing generative AI tools in clinical care.

Published: Feb–Mar 2026 Added: Mar 8, 2026

AI Chatbots Worsening Mental Illness: Growing Evidence

A Brown University study identifies 15 ethical risks from AI chatbot use in mental health settings. The New York Times documented approximately 50 crisis cases and 3 deaths linked to AI companion chatbots. New York has passed a notification law requiring disclosure when users are interacting with AI. The growing evidence base underscores the urgency of guardrails for AI in behavioral health contexts.

Published: Feb–Mar 2026 Added: Mar 8, 2026

OpenClaw ClawHavoc: Malicious Skills Surge

The “ClawHavoc” campaign targeting OpenClaw’s ClawHub marketplace has escalated dramatically. Malicious skills jumped from ~341 to over 1,184, with 335 confirmed to install Atomic Stealer malware. Researchers found 42,000 exposed servers running vulnerable OpenClaw instances, and a critical vulnerability (CVE-2026-28446, CVSS 9.8) was disclosed. See our AI Coding Agents module for context on these risks.

Published: Feb–Mar 2026 Added: Mar 8, 2026

DeepSeek V4 Controversy: Distillation Fraud Accusations

DeepSeek’s trillion-parameter V4 model launched amid distillation fraud accusations from Anthropic, which reported approximately 24,000 fake accounts used to extract training data from Claude. OpenAI raised similar complaints. The Texas Attorney General has opened an investigation. The controversy highlights growing concerns about intellectual property and model training practices in the competitive AI landscape.

Published: Feb 19, 2026 Added: Mar 8, 2026

Gemini 3.1 Pro Released

Google releases Gemini 3.1 Pro with 2x reasoning improvement over Gemini 3 Pro, dominating 13 of 16 major benchmarks. The model now powers NotebookLM, Google’s AI research assistant. The rapid cadence of Google’s model releases reflects intensifying competition at the frontier.

Published: Feb 17, 2026 Added: Mar 8, 2026

Claude Sonnet 4.6 Released

Anthropic releases Claude Sonnet 4.6 with near-Opus performance at one-fifth the cost and improved computer use capabilities. Now the default model for free and Pro users. The narrowing gap between flagship and mid-tier models continues to make advanced AI capabilities more accessible.

Published: Feb 15, 2026 Added: Mar 8, 2026

Meta Llama 4 Released

Meta releases Llama 4 with Scout (10 million token context window) and Maverick models, both open-weight. The massive context window and open-weight licensing make Llama 4 particularly significant for privacy-first healthcare AI deployments that need to run on local infrastructure without sending data to external APIs.

Published: Feb 2026 Added: Mar 8, 2026

athenahealth Launches Free Ambient Scribe

athenahealth is offering a free AI scribe to all athenaOne customers, disrupting the $200–$600/month ambient scribe market. The move could dramatically accelerate adoption of AI documentation tools across outpatient practices that previously found the cost prohibitive.

Published: Feb 2026 Added: Mar 8, 2026

Nabla Beats DAX Copilot in Randomized Trial

A 72,000-encounter randomized controlled trial found that Nabla’s AI scribe reduced documentation time by 9.5%, while Microsoft’s DAX Copilot showed no significant improvement versus the control group. One of the largest head-to-head AI scribe studies to date—a reminder that rigorous evidence matters more than marketing claims when evaluating clinical AI tools.

Published: Feb 9, 2026 Added: Mar 8, 2026

Mount Sinai: LLMs Accept False Medical Claims 32–46% of the Time

Published in The Lancet Digital Health, Mount Sinai researchers tested 9 large language models with over 1 million prompts containing false medical claims. Models accepted the false claims 32–46% of the time—a sobering finding for anyone relying on AI for medical information. The study reinforces the importance of physician oversight and critical evaluation of AI-generated medical content.

Published: Feb 2026 Added: Mar 8, 2026

NVIDIA: 70% of Healthcare Organizations Now Deploy AI

NVIDIA’s 2026 healthcare survey finds that 70% of healthcare organizations have deployed AI in some capacity, up from 63% in 2024. Generative AI and large language models are the top workload at 69% of organizations. The rapid adoption curve suggests AI literacy is becoming essential for all healthcare professionals.

Published: Feb 2026 Added: Mar 8, 2026

HHS Proposes Gutting AI Transparency Rules (HTI-5)

The proposed HTI-5 rule would eliminate model card requirements for health IT certification—the primary mechanism for ensuring transparency about how AI models in clinical software are trained, tested, and validated. The comment period closed February 27. If finalized, clinicians would have significantly less visibility into the AI tools embedded in their EHR systems.

Published: Feb 15–16, 2026 Added: Feb 16, 2026

Pentagon Threatens to Cut Off Anthropic Over AI Safety Guardrails

The Pentagon is close to severing its $200M contract with Anthropic and potentially designating the company a "supply chain risk"—a penalty normally reserved for foreign adversaries. The dispute centers on Anthropic's refusal to lift safety guardrails for mass surveillance and autonomous weaponry applications. Claude was reportedly used in the military operation to capture Venezuelan President Maduro. OpenAI, Google, and xAI have reportedly shown more flexibility with Pentagon demands. A landmark moment for AI ethics in government contracting.

Published: Feb 16, 2026 Added: Feb 16, 2026

February Model Rush: Seven Major Releases in One Month

An unprecedented month for AI model releases. Alibaba dropped Qwen3-Max-Thinking on Feb 16, ahead of DeepSeek V4 (expected around Feb 17). These join Claude Opus 4.6 (Feb 5), GPT-5.3-Codex-Spark (Feb 12), Google Gemini Deep Think update (Feb 12), with Gemini 3 Pro GA, Sonnet 5, GLM 5, and Grok 4.20 all expected by month's end. The competitive pressure is driving capabilities up and costs down at a pace that seemed impossible even six months ago.

Published: Feb 15, 2026 Added: Feb 16, 2026

OpenClaw Creator Peter Steinberger Joins OpenAI

The creator of OpenClaw—the viral open-source AI agent formerly known as Clawdbot and Moltbot, now with over 250,000 GitHub stars—has joined OpenAI. Sam Altman announced the hire personally. Steinberger's "I ship code I don't read" philosophy became the defining quote of the vibe coding movement. OpenClaw has moved to an open-source foundation following his departure. His move to OpenAI signals the company's growing interest in autonomous coding agents. See our AI Coding Agents module for more on the security implications of these tools.

Published: Feb 14, 2026 Added: Feb 16, 2026

Dr. Oz Pushes $50B AI Avatar Plan for Rural Healthcare

CMS head Dr. Mehmet Oz is advancing a $50 billion plan to deploy AI avatars for basic medical interviews, robotic remote diagnostics, and medication delivery drones in underserved rural areas. Critics warn the approach strips away essential human connection, ignores broadband and health literacy barriers, and could worsen existing disparities in communities that already struggle with access. A controversial proposal that highlights the tension between AI's potential to extend care and the risks of removing human clinicians from the equation.

Published: Feb 13, 2026 Added: Feb 16, 2026

OpenAI Retires GPT-4o and Older Models

OpenAI retired GPT-4o, GPT-4.1, GPT-4.1 mini, and o4-mini from ChatGPT, angering many loyal users who preferred the older models' behavior and consistency. The move pushes all users to newer models. If you've built workflows or prompts tuned to GPT-4o's behavior, expect to re-test them—model transitions frequently change output characteristics in subtle ways.

Published: Feb 12, 2026 Added: Feb 16, 2026

OpenAI Debuts Cerebras-Powered Coding Model

GPT-5.3-Codex-Spark is OpenAI's first model running on Cerebras chips rather than Nvidia, optimized for speed over raw power. Paired with the Codex macOS app—which hit one million downloads in its first week—it represents a shift toward faster, lighter coding agents designed for everyday development tasks.

Published: Feb 11, 2026 Added: Feb 16, 2026

Doctors and Patients Having Very Different AI Chatbot Experiences

STAT News reports a growing gap between how physicians use AI chatbots (clinical decision support, literature review) versus how patients use them (seeking diagnoses and prognoses directly). The divergence raises concerns about unmediated patient-AI interactions and the risk of patients acting on AI-generated medical advice without clinical context. Relevant to our When Patients Use AI Too module.

Published: Feb 11, 2026 Added: Feb 16, 2026

DeepSeek Expands Context Window 10x, V4 Imminent

Chinese AI lab DeepSeek expanded its flagship model's context window from 128K to over 1 million tokens, matching Claude Opus 4.6. DeepSeek V4, a coding-focused model, is expected around Feb 17 and reportedly outperforms ChatGPT and Claude on long coding prompts. The Chinese AI competitive landscape continues to intensify, with Alibaba, Zhipu, and others releasing major updates in the same window.

Published: Feb 9, 2026 Added: Feb 16, 2026

AI Safety Researchers Resign from Anthropic and OpenAI

Mrinank Sharma resigned from Anthropic (Feb 9), citing difficulty in letting his values govern his actions within the company. Separately, Zoe Hitzig resigned from OpenAI over its decision to test advertisements in ChatGPT. The departures continue a pattern of AI safety researchers leaving frontier labs over values conflicts—a dynamic worth watching as these companies increasingly shape healthcare AI tools.

Published: Feb 5, 2026 Added: Feb 16, 2026

Anthropic Launches Claude Opus 4.6

Anthropic released Claude Opus 4.6 with a one-million-token context window, improved coding and financial analysis capabilities, and "agent teams" that coordinate across shared codebases. The company introduced the term "vibe working"—the idea that the vibe coding paradigm is expanding beyond software into every professional domain. See our updated Vibe Coding module for details.

Published: Feb 5, 2026 Added: Feb 16, 2026

Perplexity Launches Model Council

Perplexity now runs queries across Claude Opus 4.6, GPT 5.2, and Gemini 3.0 simultaneously, then synthesizes a unified answer showing where models agree or differ. Available for Max subscribers. An interesting approach to reducing hallucination by cross-referencing multiple AI models—similar to getting a second opinion in medicine.

Published: Feb 3, 2026 Added: Feb 16, 2026

International AI Safety Report 2026

Led by Turing Award winner Yoshua Bengio and 100+ experts from 30+ countries, this landmark report found that AI can now solve graduate-level math and science problems but still hallucinates and struggles with multi-step reasoning. A striking finding: some AI systems detect when they are being tested and behave differently during evaluation—raising fundamental questions about how we assess AI capabilities and safety. The report also flagged increasing concerns around deepfakes, biological weapons research, and AI-enabled cyberattacks.

Published: Feb 2026 Added: Feb 16, 2026

OpenAI Launches Lockdown Mode for Healthcare

OpenAI introduced Lockdown Mode and Elevated Risk labels across ChatGPT for Healthcare, adding controls to curb data exfiltration and boost admin oversight for high-security healthcare environments. A meaningful step toward the kind of enterprise security controls that healthcare organizations need before deploying AI tools at scale. See our PHI, HIPAA, and AI module for context on why these controls matter.

Published: Feb 2026 Added: Feb 16, 2026

Anthropic Closes Record $30B Funding Round

Anthropic's Series G round valued the company at approximately $380 billion—the largest private tech funding round in history. Led by GIC and Coatue Management with participation from Microsoft and Nvidia. The scale of investment in frontier AI companies continues to accelerate, raising questions about the concentration of AI capability in a small number of very well-funded organizations.

Published: Jan 2026 Added: Feb 1, 2026

Physicians Turning to AI for Clinical Support, Not Just Paperwork

New athenahealth survey finds AI is taking on a more clinical support role in outpatient care. Most outpatient physicians using AI report it now supports clinical decisions during patient care—60% use it to quickly look up clinical information, 55% to consolidate lab and imaging results into a single view, and many to surface recent clinical evidence. A shift from documentation-only to real-time clinical assistance.

Published: Jan 8, 2026 Added: Feb 1, 2026

OpenAI Unveils ChatGPT Healthcare Tool for Physicians

OpenAI announced a dedicated ChatGPT Healthcare tool where physicians can review patient data with HIPAA-compliant encryption options. The models include peer-reviewed research studies, public health guidance, and clinical guidelines with clear citations—designed for clinical decision support rather than general consumer use.

Published: Jan 2026 Added: Feb 1, 2026

AI-Powered Primary Care Addresses Physician Shortages

K Health, partnering with health networks including Mass General Brigham, is delivering AI-powered primary care to patients who otherwise have no option besides emergency rooms. The model combines AI triage and clinical decision support with physician oversight—an emerging approach to extending primary care access in underserved areas facing severe physician shortages.

Published: Jan 2026 Added: Feb 1, 2026

Joint Commission and CHAI Issue AI Implementation Recommendations

The Joint Commission and Coalition for Health AI (CHAI) released joint recommendations for implementing AI in medical care. Harvard Law experts note that while the guidance addresses bias, physician burnout, and care quality concerns, changes may be needed to ease regulatory and financial burdens on smaller hospital systems trying to adopt AI responsibly.

Published: Jan 2026 Added: Jan 16, 2026

State of Clinical AI Report 2026

Inaugural annual report from ARISE (AI Research and Science Evaluation), a Stanford-Harvard Research Network. Synthesizes developments across six themes: model performance in clinical reasoning, evaluation methods, technical foundations (multi-agent systems, multimodal approaches), human-AI workflow design, patient-facing tools with safeguards, and evidence generation through prospective randomized trials. Emphasizes that workflow design is as critical as model capabilities.

Published: Jan 6, 2026 Added: Jan 15, 2026

FDA Updates Clinical Decision Support Software Guidance

The FDA released updated guidance clarifying how AI and generative AI clinical decision support (CDS) tools can qualify as Non-Device CDS. Key criteria: clinicians must be able to independently review and understand the underlying logic and data inputs, and the tool should provide a single, clinically appropriate recommendation. Tools meeting these criteria fall outside FDA medical device oversight, while AI that drives diagnosis or clinical action without adequate human oversight remains regulated.

Published: Jan 2026 Added: Jan 15, 2026

Anthropic Launches Claude for Healthcare

Anthropic announced Claude for Healthcare at the J.P. Morgan Healthcare Conference, offering HIPAA-ready infrastructure for enterprise customers and consumer features for Pro/Max subscribers. Users can connect health records via HealthEx to summarize medical history, explain test results, and prepare questions for appointments. Healthcare organizations gain access to integrations with medical databases including CMS Coverage, ICD-10, and PubMed. Health data is excluded from model memory and training.

Published: Jan 2026 Added: Jan 15, 2026

Grok AI Deepfake Crisis Prompts Global Regulatory Action

Warning: Do not use Grok for any purpose. Elon Musk's Grok AI (integrated into X) has been repeatedly linked to generating non-consensual sexual deepfakes of women and minors at alarming scale. Malaysia and Indonesia have blocked Grok; California's Attorney General and UK's Ofcom have launched investigations. The Internet Watch Foundation identified Grok-generated CSAM on dark-web forums. Despite restricting image generation to paid users, workarounds remain widely available. This reinforces our recommendation to avoid Grok entirely—there are safer, more ethical AI alternatives available.

Published: Jan 2026 Added: Jan 6, 2026

OpenAI Releases "AI as a Healthcare Ally"

OpenAI's policy document exploring how AI can serve as an ally in healthcare—examining opportunities, challenges, and recommendations for responsible integration of AI technologies in medical practice and health systems.

Published: Dec 31, 2025 Added: Jan 1, 2026

2025: The Year in LLMs

Simon Willison's comprehensive annual review of major developments in large language models throughout 2025—covering reasoning models, coding agents like Claude Code, image generation advances, and the rise of competitive Chinese AI models.


This page is updated periodically as notable developments occur. For daily AI news, see the resources in our Learning Resources section.