OpenAI released six Codex plugins targeting data analytics, creative production, sales, product design, equity investing, and investment banking, deeply integrating ChatGPT with Codex.
Microsoft Build 2026 Unveils Scout Assistant, MAI-Thinking-1, and Project Solara OS
Microsoft launched the OpenClaw-based AI assistant Scout, flagship reasoning model MAI-Thinking-1, and an Android-based OS for AI agent gadgets called Project Solara.
Microsoft Releases Open-Source AI Behavior Testing Framework ASSET
Developers can spin up AI evaluations using natural language descriptions without manually writing test cases.
Microsoft Launches Agent Policy Control Specification
Allows dev, compliance, and security teams to define agent behavior rules in portable policy files.
Microsoft Surface RTX Spark Dev Box Announced
A miniature Surface PC powered by Nvidia's Arm chip, optimized for local AI development.
Google Launches AI Deepfake Call Scam Detection
Phone by Google will automatically identify scam calls impersonating trusted contacts.
Anthropic Confidentially Files for Potentially Largest IPO Ever
The Claude parent company submitted S-1 paperwork to the SEC, following SpaceX's IPO announcement.
ByteDance AI Lead Gu Quanquan Departs
Former ByteDance AI leader leaves, sparking widespread speculation about next moves.
OpenAI Hires Harvard's Youngest Tenured Professor Su Weijie
USTC alumnus and Harvard's youngest full professor joins OpenAI.
Trump Signs Revised AI Executive Order with Voluntary Pre-release Reviews Only
After industry objections, Trump signs a scaled-back AI oversight executive order.
Opel Gets OpenAI Investment, Plans AI Audio Device
Opal, known for high-end webcams, receives investment from OpenAI and Samsung, pivoting to AI consumer electronics.
Embodied AI Systems Hacked in 8 Hours, Security Risks Exposed
Researchers successfully attacked embodied AI systems in short time, revealing security vulnerabilities.
ByteDance Open-Sources Unified Video Editing Framework Bernini
Adds understanding capability to DiT models, enabling understand-then-edit AI video editing.
Sets new SOTA for document parsing, now available via API on official website.
Tsinghua AIR Open-Sources UniLab Robot Training Framework, 10x Speed Boost
Completes humanoid robot training in 3 minutes, runs on Mac.
📄 Papers
Unified Neural Scaling Laws
Proposes a functional form that simultaneously models scaling behavior across model parameters, dataset size, training steps, and more.
Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding
Decouples causal dependency modeling from autoregressive overhead in draft generation, improving inference speed.
Linear Ensembles Wash Away Watermarks
Theoretically proves that averaging output probability distributions across multiple models recovers unwatermarked distribution.
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses
Separates search agent state management from policy, improving RL efficiency.
Policy and World Modeling Co-Training for Language Agents
Simultaneously learns world models during RL training without additional simulators.
Agent Skills Should Go Beyond Text: The Case for Visual Skills
Argues that existing skill learning methods storing only text experience is a fundamental bottleneck, proposing visual skills.
DOT-MoE: Differentiable Optimal Transport for MoEfication
New method for converting dense models to sparse MoEs, replacing traditional heuristic clustering.
🔧 Open Source
nesquena/hermes-webui
Web interface for Hermes Agent, supporting mobile usage.
colbymchenry/codegraph
Pre-indexed code knowledge graph supporting Claude Code, Codex, Gemini, and other mainstream AI coding tools.
p-e-w/heretic
Fully automatic censorship removal tool for language models.
revfactory/harness
Meta-skill framework that automatically designs domain-specific agent teams and generates required skills.
heygen-com/hyperframes
Agent tool for writing HTML and rendering video.
KKKKhazix/khazix-skills
Open-source AI Skills collection by Digital Life Kazik.
chopratejas/headroom
Compresses tool outputs, logs, and files, reducing token consumption by 60-95%.
Lum1104/Understand-Anything
Converts code into interactive knowledge graphs supporting search and Q&A.
No items match this filter.
💡 Today's Take
The most significant trend today is the **full-scale explosion of the Agent ecosystem**. Microsoft Build and OpenAI releasing Agent products on the same day marks Agents moving from concept to production. Microsoft's Scout, Project Solara, and Agent policy control framework, alongside OpenAI's Codex white-collar tool suite, collectively build the infrastructure for the Agent era. Notably, Microsoft emphasizes agent controllability and compliance (ASSET testing framework, policy specifications), while OpenAI focuses on professional role replacement (6 vertical plugins)—two parallel development paths. Meanwhile, multiple papers point to agent skills evolving from text to visual and multimodal forms, and the open-source community is flooded with Agent toolchain projects. The Agent "operating system" and "skill marketplace" are taking shape.