Developer AI news

July 2026

ByteDance launches Seedream 5.0 Pro image model for advanced design work

ByteDance has rolled out Seedream 5.0 Pro, a new image model designed to "understand design" with powerful editing features, improved text rendering, and layer separation for editable designs. It aims to provide frontier-level control and creative partnership for user workflows.

ViewGen: Open-source Unreal Engine plugin integrating ComfyUI for 3D viewport data

Michael N. developed ViewGen, an open-source Unreal Engine plugin that integrates ComfyUI to leverage 3D viewport and camera data. This allows users to generate precise depth and color reference images and movies from UE, piping them into Comfy-styled graphs directly within the engine for advanced 3D development.

Guide: Enhance employee 1-on-1s with Claude Cowork for structured feedback

This guide demonstrates how to leverage Claude Cowork to improve employee 1-on-1s by setting up a project with past transcripts, creating a customized call template, and an evaluation rubric. It enables managers to track progress and automate template updates for consistent feedback.

AI workflow for commercial mortgage identification and prospect ranking using Claude

A commercial loan portfolio manager developed an AI workflow using Claude to process public mortgage data, extract commercial filings, enrich contact information, and rank prospects by priority. The system generates an Excel file with an executive summary and targeted suggestions for outreach.

Anthropic introduces 'Reflections' dashboard for Claude usage analysis and optimization

Anthropic has launched a 'Reflections' dashboard to help users analyze their interaction patterns with Claude. This feature provides insights into usage habits and suggests skill creation based on personal data, aiming to optimize user experience and prompt engineering.

Meta to begin manufacturing in-house 'Iris' AI chip in September

Meta is reportedly set to begin manufacturing its proprietary 'Iris' AI chip in September, with plans to double its computing capacity to 14 GW by 2027. This move signifies Meta's increasing investment in custom AI hardware to support its growing AI infrastructure.

RobbyAnt releases LingBot-World 2, a real-time world model for embodied AI

Embodied AI lab RobbyAnt has launched LingBot-World 2, a new world model capable of generating every frame in real-time without requiring a 3D engine. This represents a significant advancement for embodied AI systems, enabling more dynamic and responsive virtual environments.

OpenAI retracts endorsement of SWE-Bench Pro coding benchmark due to task issues

OpenAI has published research retracting its endorsement of the SWE-Bench Pro coding benchmark, after discovering that nearly a third of the benchmark's tasks contained issues. This highlights challenges in accurately evaluating AI coding capabilities and the need for robust benchmarks.

Reve 2.1 image model released, achieving high Arena ranking with less compute

Reve has launched version 2.1 of its native-4K image model, which has re-secured the No. 2 spot on Arena's overall leaderboard. This model achieves its high performance while being trained on less than a tenth of the compute used by its rivals, highlighting significant efficiency gains.

Guide: Optimize Fable token usage with an orchestrator setup for cost efficiency

This guide details how to reduce Fable token consumption by strategically using it as a planner and reviewer, while delegating token-heavy tasks like browsing, coding, and research to lower-cost models such as Codex or other Claude variants. It provides a step-by-step orchestrator workflow.

Microsoft replaces third-party AI models with in-house MAI in Office applications

Microsoft is transitioning tens of thousands of prompts in Excel and Outlook to its proprietary MAI models, aiming to reduce and ultimately eliminate reliance on external AI providers like OpenAI and Anthropic. This strategic shift enhances internal control and optimizes costs.

Anthropic's Claude Cowork expands to mobile and web with background task execution

Claude Cowork, Anthropic's agent, is now available on mobile and web, allowing tasks to run in the background even when devices are offline. This enables seamless project continuity across devices, though desktop remains exclusive for local file and browser access.

Cognition releases SWE-1.7, a fast, near-frontier coding model for Devin

Cognition has launched SWE-1.7, a new coding model built on a Kimi K2.7 base, which operates at 1,000 tokens per second. It achieves performance comparable to frontier models and is integrated into Devin, served on Cerebras, enhancing automated code generation.

Willow offers free, unlimited AI dictation with Frontier Mini model

Willow has launched its Frontier Mini model, providing free and unlimited cloud-based AI dictation with zero data retention. It claims superior speed and accuracy compared to competitors like Wispr Flow, OpenAI, and Deepgram for transcribing speech into text across applications.

OpenAI introduces GPT-Live for natural, full-duplex voice conversations

GPT-Live, OpenAI's next-gen voice model, replaces ChatGPT Voice with a full-duplex architecture that enables simultaneous listening and talking, eliminating awkward pauses. It can delegate complex reasoning to stronger models and supports real-time translation, offering a more human-like conversational experience.

Google DeepMind's AlphaEvolve algorithm-inventing agent now generally available

AlphaEvolve, the DeepMind system capable of writing and improving its own code, has reached general availability. Klarna has already utilized it to double its machine-learning training throughput, demonstrating its utility for optimizing ML training processes.

Google Cloud Run sandboxes enter public preview for secure AI code execution

Google Cloud Run sandboxes are now in public preview, offering isolated, ephemeral environments for AI agents to securely run code. These sandboxes provide zero network access by default, spin up in milliseconds, and are available at no additional cost beyond standard Cloud Run fees.

Mistral Releases Leanstral 1.5 Open Model for Verified Math Proofs

Mistral introduced Leanstral 1.5, an open model specifically designed for the generation and verification of mathematical proofs. This model aims to advance automated reasoning and formal verification in mathematics.

ByteDance Introduces Seed Audio 1.0 for Unified Audio Generation

ByteDance launched Seed Audio 1.0, a versatile model capable of generating speech, music, and sound effects in a single pass. This model offers a unified solution for various audio content creation needs.

Kyutai and General Intuition Release MIRA World Model for Rocket League Simulation

Kyutai and General Intuition, in collaboration with Epic Games, released MIRA, an open-source world model capable of running a live 2v2 Rocket League game entirely within a neural network. Trained on bot footage, it simulates game physics and renders details at 20 FPS on a single Nvidia GPU.

Replit Enables Rapid Mobile App Prototyping with AI Agent

Replit provides tools and a guide for developers to rapidly prototype mobile applications using its AI Agent. This allows quick transformation of app ideas into testable prototypes with Expo Go, facilitating iterative refinement of core user flows.

Tencent Hunyuan Open-Sources Efficient Hy3 Model with Apache 2.0 License

Tencent's Hunyuan released Hy3 as an open-source model under an Apache 2.0 license, achieving high efficiency by using a small subset of parameters per request. It competes with larger models on web research and tool use, offering a compelling option for developers.

Anthropic Research Uncovers 'J-space' Internal Workspace in Claude

Anthropic's research revealed 'J-space,' an internal workspace within Claude that functions like an internal notepad, holding active concepts and directing the model's thinking. This undesigned structure emerged during training and is crucial for multi-step problem completion.

Tufa Labs Wins ARC-AGI-3 Contest with Open-Source Qwen-Based Coding Agent

Tufa Labs secured first place in the ARC-AGI-3 milestone contest by developing an open-sourced coding agent. This agent, which wraps a small Qwen model, demonstrated effective problem-solving capabilities on game-based benchmarks.

DoorDash Introduces DashBench for Evaluating AI Code Reviewers

DoorDash developed DashBench, an internal benchmark to test AI code reviewers against historical code changes. A pairing of Kimi K2.6 and Claude Fable 5 achieved the best results, catching two-thirds of problems and 80% of critical bugs at $3.81 per review, enabling open models in their pipeline.

Google for Startups Releases Generative Media Technical Guide

Google for Startups launched a technical guide for building production-grade, multimodal creative AI applications using DeepMind models like Veo and Lyria. The blueprint emphasizes deterministic control, programmatic guardrails, cryptographic provenance, and economic scalability.

Researchers Develop 23MB Add-on for Tiny Models to Match Qwen3-32B Performance

Researchers have created a 23MB add-on that enables small AI models to achieve performance comparable to Qwen3-32B on common text tasks. This innovation allows for efficient offline execution on devices like a MacBook, significantly reducing model size requirements.

Cassidy Launches No-Code AI Agent Builder for Enterprise Workflows

Cassidy provides a no-code platform for building model-agnostic AI agents that integrate with enterprise tools like CRMs and chat apps. These agents automate repetitive tasks such as drafting proposals and triaging tickets, deployable in Slack, Teams, or browsers.

Meta's 'Watermelon' Model Reportedly Matches GPT-5.5 Performance

Meta's 'Watermelon' model, currently in training, is reported to achieve performance on par with OpenAI's GPT-5.5, albeit using 10x the compute of Muse Spark. An update to Muse Spark with significant coding and agentic improvements is also planned for Meta AI and its API.

Open-Source pxpipe Tool Reduces Claude API Costs by 70%

The open-source tool pxpipe enables developers to cut Claude Code API costs by up to 70% by converting text to PNG images, leveraging cheaper image token pricing. While it introduces some lossiness and latency, it offers significant savings for high-volume usage.

Meta Superintelligence Labs Releases Muse Image Generation Model

Meta's Muse Image, an in-house AI image generation model from Superintelligence Labs, ranks No. 2 on Arena leaderboards. It features agentic capabilities like web search and self-editing, rolling out across Meta AI, Instagram, and WhatsApp, with a Muse Video model also teased.

xAI Launches Grok Voice Agent Builder, a No-Code Platform for Voice Agents

xAI has launched Grok Voice Agent Builder, a no-code platform designed to enable developers to easily create and deploy voice agents without extensive coding knowledge.

Researchers Solve Open Math Problems Using GPT-5.5 Pro and Opus 4.8 Pipeline

Researchers, led by Binghui Peng, have successfully utilized a pipeline combining GPT-5.5 Pro and Opus 4.8 to solve nine long-standing open problems in mathematics and theoretical computer science, showcasing advanced problem-solving capabilities of these models.

Google Design.md and Stitch Skills Enable Agentic Website Design with Claude Code

Google has introduced Design.md, an open-source standard for agent-friendly design briefs, which can be used with Claude Code via the `google-labs-code/stitch-skills` plugin. This enables developers to leverage Google's AI design capabilities to generate consistent website prototypes and brand assets.

ZCode: Agentic Coding Environment Tuned for GLM-5.2

ZCode is an agentic coding environment developed by Z AI, specifically tuned for the GLM-5.2 model, designed to enhance developer workflows and automate coding tasks.

ByteDance's Frontier AI Video Model, Seedance 2.0, Now Generally Available

ByteDance has made its frontier AI video model, Seedance 2.0, generally available, offering advanced video generation capabilities to developers.

Specialized AI Outperforms Frontier Models in Financial Tasks at Lower Cost

Research by Thinking Machines Lab and Bridgewater demonstrated that a small, custom AI model (Qwen3-235B fine-tuned on TML's Tinker platform) significantly outperformed frontier models on specialized financial news filtering tasks, achieving 84.7% accuracy at 13.8 times lower cost. This highlights the power and cost-effectiveness of specialized AI solutions.

Claude Fable 5 Achieves 16.1% Automation Rate on Freelance Tasks in Remote Labor Index

The Remote Labor Index, a benchmark evaluating AI agents on real-world freelance tasks, shows Claude Fable 5 achieving a 16.1% automation rate, matching or surpassing human professionals. This performance is double that of the next best model, demonstrating significant advancements in agentic capabilities for complex tasks.

Google Gemini Spark Agent Now in macOS Beta with Custom MCP Support

Google's Gemini Spark agent is now available in beta on macOS, enabling local file processing and integrations with various applications. A key developer feature is its support for custom MCP (Multi-Cloud Platform) servers, allowing developers to connect the agent to a wider range of applications.

Meta Explores Cloud Business to Offer AI Compute and Model Access

Meta is reportedly developing a new cloud business to offer access to its extensive AI compute infrastructure and hosted models, including Muse Spark. This initiative aims to monetize Meta's significant AI investments by renting out spare data center capacity to developers and competing with major cloud providers.

Guide: Create Winning Ads with Claude and Goose Ads Skill

Developers can leverage Claude with the Goose Ads skill to generate ad creatives by analyzing website content and brand DNA. This integration allows for automated ad project creation and generation using specific templates.

Palmier Pro: Free AI-Enhanced Video Editor Integrating with Claude

Palmier Pro is a free AI-enhanced video editor that integrates with Claude, enabling automated transcription, captioning, cutting, and color grading of raw footage. This tool streamlines video post-production workflows for creators.

OpenAI's Codex

OpenAI's "Record & Replay" tool for macOS, integrated with Codex, allows developers to automate repetitive manual tasks by recording screen actions and converting them into reusable skills. This feature enhances developer productivity by streamlining workflows.

Microsoft Makes MAI-Code-1-Flash Coding AI Generally Available

Microsoft has made MAI-Code-1-Flash, its in-house coding AI, generally available for select users. This model provides advanced coding assistance and automation capabilities for developers.

Cognition Launches Devin Fusion, a Cost-Cutting Multi-Model Coding Harness

Cognition has launched Devin Fusion in preview, a new coding harness that combines a frontier AI model with a more cost-effective "sidekick" agent. This multi-model approach aims to maintain high code quality while significantly reducing development costs.

Base44 Introduces Base1, a New In-House AI Model for App Development

Base44 has introduced Base1, its new in-house AI model designed to facilitate the building of applications, providing developers with a dedicated foundation model for their app development workflows.

Meituan Releases Longcat 2.0 Open Coding Model Trained on Chinese Chips

Meituan has released Longcat 2.0, an open coding model specifically trained on Chinese chips, offering a new option for developers in the open-source AI coding landscape.

OpenAI Previews GPT-5.6 Model Family: Sol, Terra, and Luna

OpenAI has launched a limited preview of its GPT-5.6 model family, including Sol (flagship for complex problems with "ultra" mode and subagents), Terra (balanced, matching GPT-5.5 at half the cost), and Luna (fastest, cheapest for high-volume tasks). Sol demonstrates strong reasoning, outperforming Mythos 5 on Terminal-Bench 2.1.

DeepSeek Releases DSpark Framework to Accelerate DeepSeek V4 by 60-85%

DeepSeek has released DSpark, a new open-source speculative decoding framework that accelerates DeepSeek V4's per-user generation speed by 60% to 85% without compromising output quality.

Eluvio AI Revolutionizes Video Archiving with Frame-by-Frame Tagging and Search

Eluvio offers an AI-powered solution that processes video archives by tagging every face, line of speech, logo, and on-screen text frame-by-frame, making vast video libraries fully searchable directly within the streaming pipeline.