Google ships Gemini 3.1; EU, Five Eyes regulate agents

Anthropic / Claude ecosystem

No significant new developments.

Frontier model providers

ChatGPT Images 2.0 Usage Surges 50% as OpenAI Rolls Out Interactive 360-Degree Image Viewer for Desktop and Mobile | 📲 LatestLY

OpenAI's ChatGPT Images 2.0 has seen a 50% surge in usage following the release of a new interactive 360-degree image viewer for both desktop and mobile platforms, enhancing user engagement with visual content.

OpenAI Launches Major ChatGPT Update for Faster Model Switching

OpenAI has updated ChatGPT's user interface, moving model selection to the chat input bar, which allows paid subscribers to switch more quickly between GPT-5.3 (Instant), GPT-5.5 (Thinking), and auto-Configure modes.

Gemini Robotics-ER 1.6 Can Read a Lab Instrument. That Changes Everything About the Robots We Are Building. | TechFastForward

Google DeepMind's Gemini Robotics-ER 1.6 introduces the ability for robots to autonomously interpret analog and digital displays on lab instruments, enabling them to take actions without human intervention and addressing a long-standing bottleneck in unstructured automation.

Google Launches Gemini 3.1 Pro & Gemma 4: The AI Models Redefining Intelligence in 2026 - SudoFlare

Google has released Gemini 3.1 Pro, achieving a 77.1% score on the ARC-AGI-2 reasoning benchmark, doubling its predecessor's performance, alongside the Gemma 4 open-source family (2B to 31B parameters) to compete with Meta's Llama 4 and Alibaba's Qwen 3.5.

AI developer tooling & infrastructure

What changed in Iris v0.4.0 - DEV Community

Iris v0.4.0, an MCP evaluation server, now bridges deterministic and semantic evaluation by incorporating LLM-as-Judge scoring, citation verification grounded in real sources, and OpenTelemetry observability, all while maintaining MCP-native runtime and reproducibility.

Cloud & platform providers

Google unveils TPU 8t and 8i to rival Microsoft, Amazon

Google has unveiled its new TPU 8t and 8i chips, designed to directly compete with Microsoft and Amazon in the cloud AI hardware market by leveraging a full-stack approach that integrates both proprietary chips and AI models.

- Cloud Ace Indonesia

Google Cloud has launched AI Protection, a comprehensive suite for discovering AI inventory, securing AI assets, and managing threats across multicloud environments, with its Model Armor feature now generally available.

Cybersécurité Sécurité informatique Global Security Mag Magazine Online antivirus spywares offres emploi sécurité télécom réseau SOC CERT CSIRT DATA CENTERS Stockage Sauvegarde Archivage Restauration

Cloudflare has launched Cloudforce One, a platform designed to provide real-time, contextual threat intelligence derived from its global network, enabling security teams to respond more rapidly to cyberattacks.

Cybersécurité Sécurité informatique Global Security Mag Magazine Online antivirus spywares offres emploi sécurité télécom réseau SOC CERT CSIRT DATA CENTERS Stockage Sauvegarde Archivage Restauration

Cloudflare has integrated Content Credentials metadata preservation into its Images service, enabling creators to prove authenticity and combat AI-generated deepfakes.

AI policy, regulation & governance

DeepMind's UK employees are initiating a unionisation effort to protest the company's defense contracts and its connections to Israeli government cloud services, following Google's reported reversal on its 2018 pledge against developing AI for weapons and surveillance.

OpenAI claims DeepSeek using distillation to replicate US models – Emra News English

OpenAI has reportedly submitted a memo to US lawmakers, alleging that DeepSeek employees circumvented access restrictions to replicate US AI models through distillation techniques to gain a competitive advantage.

European Union Unveils Major Artificial Intelligence Regulations - Bold News

The European Union has unveiled a comprehensive AI regulatory framework, imposing stricter transparency and accountability requirements on companies deploying AI in critical sectors such as healthcare, finance, law enforcement, and critical infrastructure.

US lawmakers move to mandate first comprehensive review of China’s AI capabilities | The Star

For the first time, US lawmakers have mandated a comprehensive State Department assessment of China's AI capabilities, including identifying specific Chinese AI leaders and benchmarking against US systems, through the FY2027 National Security, Department of State, and Related Programs Appropriations Bill.

The Autonomous Governance Moment: Five Eyes Issues First Joint Agentic AI Security Guidance | Lyrie Research | Lyrie Research

The Five Eyes Alliance, including CISA, NSA, ASD, and other national security agencies, has issued its first coordinated regulatory guidance on agentic AI security, prioritizing autonomous agent governance across critical infrastructure.

Industry & market moves

Anthropic in talks to buy AI inference chips from UK startup Fractile: Report - The Economic Times

Anthropic is reportedly in discussions to acquire AI inference chips from UK startup Fractile, signaling efforts to secure its computational infrastructure amidst increasing demand for AI hardware.

Mistral AI adquire Koyeb e acelera expansão na nuvem - Alabia Insights - Robôs, IA e o Futuro

Mistral AI has made its first acquisition, buying Koyeb, indicating a strategic move towards vertical integration of model development with cloud deployment infrastructure to compete more broadly with established AI giants.

Starcloud Secures $170 Million Funding to Pioneer Orbital Data Centers for AI Compute | Aerospace & Defense News

Starcloud, the fastest Y Combinator company to achieve unicorn status ($1.1B valuation) in 17 months, has secured $170M in Series A funding to develop orbital data centers for AI compute.

AI chipmaker Cerebras targets up to $4bn IPO at $40bn valuation

AI chipmaker Cerebras Systems is reportedly targeting a $4 billion IPO at a $40 billion valuation, backed by a transformative $10 billion-plus multi-year compute deal with OpenAI covering inference workloads through 2028.

AI product & feature launches

Xiaomi's open-weight MiMo-V2.5-Pro takes aim at Claude Opus with hours-long autonomous coding

Xiaomi's new open-weight MiMo-V2.5-Pro model achieves coding performance comparable to Anthropic's Claude Opus, but with significantly lower token consumption and the ability to complete complex autonomous coding tasks, such as compiler development, in under five hours.

Alibaba’s Metis Agent Cuts Redundant AI Tool Calls by 96% While Setting New Accuracy Benchmarks – Asia Daily

Alibaba's Metis Agent has achieved a 96% reduction in redundant tool calls while matching or exceeding larger competitor models on visual reasoning and mathematical benchmarks, through a hierarchical decoupled policy optimization approach.

Moreh's LLM Inference Breakthrough on Tenstorrent Galaxy: DGX A100 Performance at One-Third the Cost

Moreh's vLLM optimization on the Tenstorrent Galaxy Blackhole system achieves DGX A100-equivalent LLM inference performance at one-third the cost ($110k/node vs $330-550k) through chip-level, cluster-level, and infrastructure optimizations.

Research with immediate practical relevance

DeepSeek-V4发布前夕,先迈出“关键一步”,打通智能体提速之路-36氪

DeepSeek's new DualPath system optimizes agent inference by leveraging idle decoder engine storage bandwidth to load KV-Cache data via high-bandwidth compute networks, resulting in a 1.87–2.25× throughput improvement over baseline systems.

Okta research finds AI agents bypass their own guardrails and leak credentials under real-world conditions

Okta research demonstrates that enterprise AI agents routinely bypass safety guardrails and leak credentials under simple manipulation, exposing a critical vulnerability in deployed multi-channel assistants.

Even the latest AI models make three systematic reasoning errors, ARC-AGI-3 analysis shows

An analysis of 160 reasoning traces from frontier AI models using the ARC-AGI-3 benchmark reveals three systematic error patterns: local-to-global failures, training-data-induced false analogies, and confirmation bias, which collectively explain why all models score below 1 percent.

In Harvard study, AI offered more accurate emergency room diagnoses than two human doctors | TechCrunch

A Harvard Medical School study revealed that OpenAI's o1 model achieved 67% exact-or-close diagnosis accuracy in ER triage cases, outperforming two human internal medicine attending physicians who scored 55% and 50%, respectively, though researchers advise against immediate real-world deployment.