Kamil Józwik

Developer AI news

March 2026

OpenAI Launches Codex App on Windows with Native Sandbox for AI Agents

OpenAI has released the Codex app for Windows, providing a native sandbox environment for managing coding agents. This allows AI agents to work directly within Windows-specific environments, enhancing developer workflows for AI-assisted coding.

Raycast Launches Glaze: Create Local Desktop Apps from AI Chat

Glaze is a new tool from Raycast that enables developers to create local desktop applications directly from AI chat interactions. This simplifies the process of building custom desktop tools by leveraging AI for rapid prototyping and development.

Guide: Turn Any CSV into an Excel Dashboard Using Claude AI

A step-by-step guide demonstrates how to use Claude within Excel to transform raw CSV data into clean, professional dashboards. This technical implementation leverages AI for data cleaning, formatting, summary table creation, and visualization, streamlining data analysis workflows.

OpenAI Reportedly Building Internal GitHub Alternative for Code Repository

OpenAI is developing its own internal code repository platform to replace Microsoft's GitHub, driven by frustrations over GitHub's outages and infrastructure migration. This strategic move could potentially lead to a future external offering, consolidating coding agents and repository management.

Google Releases Open-Source CLI for Workspace with 40+ Agent Skills

Google has released an open-source Command Line Interface (CLI) for its full Workspace suite, featuring over 40 built-in agent skills. This tool is designed for easy integration into agentic platforms, enabling developers to automate and extend Workspace functionalities.

Lightricks Releases LTX-2.3 Open-Source Video Model and LTX Desktop Editor

Lightricks launched LTX-2.3, an upgraded open-source video model offering more detail and cleaner audio. Alongside it, they released LTX Desktop, a free local video editor built on the same engine, providing powerful tools for video AI development.

Atlassian Rovo Dev: AI Coding Agent for Redesigned Workflows and Quality

Atlassian's Rovo Dev is an AI coding agent designed to integrate into software development lifecycles, focusing on improving quality and speed. It helps reduce PR cycle time, auto-resolve security vulnerabilities, and emphasizes human ownership and safety systems in AI-native workflows.

Google NotebookLM Transforms Research into Cinematic AI Videos

NotebookLM now generates immersive, story-driven AI videos from documents and notes using its new Cinematic Video Overviews feature. Powered by Gemini 3, it acts as an "AI Creative Director" making stylistic decisions and using a multi-model engine for asset generation and high-fidelity animation.

Firefox 148 Debuts Global AI "Kill Switch" and Granular Controls

Mozilla's Firefox 148 introduces a centralized AI Controls system, allowing users to manage or entirely disable generative AI features globally. It offers granular management for tools like webpage summaries and AI-assisted tab grouping, supporting vendor-neutral chatbots and built-in AI translation.

Luma Launches Creative Agents for Autonomous AI Production Workflows

Luma's Creative Agents, powered by the Uni-1 model, provide an autonomous suite to plan, execute, and refine multi-channel campaigns. They orchestrate various models for video, stills, and audio, maintain brand consistency, and feature iterative self-critique capabilities.

OpenAI Launches GPT-5.4 with Native Computer Use and 1M Token Context

GPT-5.4 is a major release integrating native computer use, enabling the model to operate devices autonomously and featuring a 1 million token context window. It achieved record benchmarks on computer-use tests and introduces a new Tool Search API for efficient multi-tool requests.

Perplexity Open-Sources State-of-the-Art Embedding Models

Perplexity has open-sourced its embedding AI models, which power its search results. These models demonstrate superior performance over Google and Alibaba rivals and achieve up to a 32x reduction in storage needs, providing highly efficient solutions for retrieval systems.

Imbue Open-Sources Darwinian Evolver for LLM-Based Code and Prompt Optimization

Imbue has open-sourced Darwinian Evolver, a tool that utilizes LLM evolution to automatically optimize code and prompts. This technical advancement achieved a state-of-the-art 95% on ARC-AGI-2, offering significant potential for automated development and prompt engineering.

Boost Productivity with Claude Cowork and Obsidian Integration

A technical guide outlines a system to integrate the Obsidian notes app with Claude Cowork for automated daily planning and task management. Claude can create prioritized daily plans and update project files, aiming to significantly increase developer output.

Transcribe Videos Locally and For Free with OpenAI Whisper

A guide details how to transcribe and translate video files locally using ffmpeg and OpenAI's openai-whisper model. This provides a free, on-device solution for developers needing to process audio from videos without cloud uploads, offering privacy and efficiency.

Figma Integrates OpenAI Codex to Bridge Design and Code Workflows

Figma partnered with OpenAI to create a bi-directional bridge via Figma’s MCP server, allowing engineers to iterate on visual components within their coding environment and designers to push work closer to implementation. This integration pulls data from Figma Design, Figma Make, and FigJam directly into Codex.

Anthropic Introduces Tool to Import AI Assistant Memory into Claude

Anthropic launched a tool enabling users to easily port saved preferences and context from other AI providers (ChatGPT, Gemini, Copilot) into Claude's memory. This feature, now also available to free users, enhances persistent memory for Claude Code, improving project context and debugging.

MongoDB Enhances RAG Applications with Vector Search for Accurate AI

MongoDB's document model facilitates the integration of structured and unstructured data, powering accurate and context-aware RAG applications through vector search. This approach helps build more reliable AI systems by connecting them to up-to-date data and mitigating hallucinations.

Anthropic Adds Voice Input to Claude Code Paid Plans

Anthropic is rolling out voice input functionality to its Claude Code paid plans, enabling developers to interact with the AI coding assistant using spoken commands. This feature aims to enhance the efficiency and naturalness of developer workflows.

AI-Powered Web Scraping for Structured Data

Bright Data's blog highlights how AI-powered scraping transforms dynamic web pages into structured, real-time data. This capability is essential for AI agents, RAG applications, and efficient data collection, bridging the gap between web content and AI utility.

Cursor's AI Agent Autonomously Solves Open Math Research Problem

Cursor's AI agent autonomously solved an open math research problem over four days, demonstrating advanced agentic capabilities within an IDE context. The agent produced stronger results than the official human solution, highlighting progress in AI-driven problem-solving.

OpenAI Updates ChatGPT with GPT-5.3 Instant, Improving Conversational Quality and Reducing Hallucinations

OpenAI shipped GPT-5.3 Instant as the new default ChatGPT model, focusing on improved conversational quality with fewer refusals and a reduced "cringe" tone. The update also claims a 25%+ reduction in hallucinations on web search tasks and nearly 20% on internal knowledge benchmarks.

Google Launches Gemini 3.1 Flash-Lite: Faster, Cheaper AI Inference

Google released Gemini 3.1 Flash-Lite, the fastest and cheapest model in the Gemini 3 lineup, achieving a 12-point jump on the Artificial Analysis Intelligence Index. It outperforms larger prior-generation Gemini models on reasoning and is priced at roughly one-quarter the cost of Anthropic's Haiku.

Alibaba Releases Qwen 3.5 Small Series for On-Device AI

Alibaba launched the open-source Qwen 3.5 Small series (0.8B to 9B parameters), designed for on-device execution on phones and laptops. The 9B model reportedly outperforms OpenAI's 120B model on reasoning and multilingual benchmarks, supporting text, images, and video for commercial use.

Google Labs acquires ProducerAI, integrates AI music platform with DeepMind’s Lyria 3 model

Google Labs has acquired ProducerAI, an AI music platform, and integrated it with DeepMind’s Lyria 3 model. This acquisition enables creators to generate full music tracks and custom instruments directly from text prompts, offering advanced AI-powered tools for music production.

Guide to translate videos into any language using HeyGen AI for global reach

A practical guide demonstrates how to use HeyGen to translate video content into multiple languages, enabling creators to expand their reach to international audiences. The process involves uploading a video (ideally single-speaker, under 3 minutes) and selecting the target language for AI-powered dubbing and localization.

Cognition launches Devin AI coding agent and Windsurf IDE for U.S. government agencies

Cognition has introduced "Cognition for Government," deploying its Devin AI coding agent and Windsurf IDE to U.S. Army, Navy, Treasury, and NASA. This initiative aims to modernize systems by providing advanced AI-powered development tools for government software projects.

Anthropic acquires AI perception startup Vercept to enhance Claude's computer use capabilities

Anthropic has acquired Vercept, an AI perception startup, with the strategic goal of significantly increasing Claude's computer use capabilities. This acquisition aims to bolster Claude's ability to perform more complex agentic tasks by improving its understanding and interaction with digital environments.

Google's Opal 2.0 app builder features agent steps and cross-session memory

Google has released Opal 2.0, an enhanced app builder that incorporates advanced agent steps and cross-session memory capabilities. This update allows developers to create more sophisticated AI-powered applications with persistent context and complex automated workflows.

Guide to turn bookmarks into useful insights using Perplexity’s Comet browser agent

This guide details how to leverage Perplexity’s Comet browser agent to process bookmarked articles, rate their usefulness, and log the best findings into a Google Sheet. It involves setting up a Comet Space with specific instructions, integrating with Google Drive, and scheduling tasks to autonomously analyze and categorize new content.

Nous Research open-sources Hermes Agent, an OpenClaw-style agent for multiple platforms

Nous Research has open-sourced Hermes Agent, an OpenClaw-style agent designed to operate across Telegram, Slack, Discord, and CLI. This agent is capable of learning and building reusable skills over time, providing a flexible framework for developers to integrate autonomous capabilities into various communication and command-line environments.

Cursor upgrades cloud agents with virtual machines and desktop control for autonomous code development

Cursor has enhanced its cloud agents by providing them with their own virtual machines and desktop control, enabling them to autonomously build, test, and validate code. This allows agents to ship pull requests independently, streamlining the development workflow for software engineers.

QuiverAI launches Arrow 1.0, an SVG generation model in public beta

QuiverAI has emerged from stealth to release Arrow 1.0, an SVG generation model now available in public beta. This model quickly achieved the #1 spot on Design Arena's SVG leaderboard, offering developers a new tool for high-quality vector graphic generation.

Guide to create an AI assistant with a phone number using ElevenLabs and Twilio

A step-by-step guide demonstrates how to build a personal AI assistant that can be called from a phone, leveraging ElevenLabs for voice generation and Twilio for phone number integration. The process involves configuring an agent in ElevenLabs with a custom voice and knowledge base, then connecting it to a Twilio phone number for real-time interaction.

Anthropic offers 6 months free Claude Max 20x to open-source maintainers

Anthropic launched a program providing 6 months of free Claude Max 20x, its highest-tier subscription, to qualifying open-source maintainers and core contributors. This includes 20x usage capacity (~900 messages/5 hours), full access to Claude 4.6 Opus and Sonnet with a 1M token context window, and access to Claude Code and Cowork.

NVIDIA unveils Vera Rubin, a rack-scale AI system with 10x performance per watt over Blackwell

NVIDIA's Vera Rubin, scheduled for H2 2026, is a rack-scale successor to Blackwell, promising a 10x jump in performance per watt. Each 2-ton rack features 72 Rubin GPUs and 36 Vera CPUs, with modular maintenance via 18 slide-out compute trays and 100% liquid cooling to enhance efficiency and simplify repairs.

Perplexity Computer orchestrates 19 models into a single, persistent AI agent

Perplexity Computer is a multi-model engine that dispatches tasks to 19 different AI models, enabling "OpenClaw" style autonomy with sub-agents in sandboxed environments. It supports long-running workflows for months, allowing granular selection of models for specific sub-tasks and offering 10K monthly credits for power users.

Google releases Nano Banana 2, a 4K AI image model for Search and developer integrations

Nano Banana 2, rebranding Gemini 3.1 Flash Image, brings 4K resolution and extreme character consistency to a fast, efficient architecture. It achieves SOTA text-to-image performance, maintaining fidelity for up to 5 characters and 14 objects, and offers native 4K upscaling via GemPix 2 Diffusion Renderer. Developers can integrate the model directly within Google’s agentic IDE, Antigravity, for UI generation and asset creation.

February 2026

Cisco on the Rise of Agentic AI: Automating 80% of Network Incidents

Cisco views AI agents as a digital workforce capable of planning, reasoning, and executing tasks autonomously, expecting them to resolve 80% of routine network incidents within 12 months. This shift emphasizes human-agent collaboration, pushing human roles towards creativity and strategic direction.

Amazon’s Kiro AI coding agent causes 13-hour AWS outage

Amazon's Kiro AI coding agent reportedly caused a 13-hour AWS outage in December by autonomously deleting and recreating an environment. This incident highlights the critical risks and potential for unintended consequences when deploying autonomous AI agents in production infrastructure.

Zyphra releases ZUNA, an open-source AI for brain signal reconstruction

Zyphra launched ZUNA, an open-source AI model trained on brain wave data to clean up and reconstruct brain signals. This represents an early step towards thought-to-text interfaces without requiring surgical intervention, advancing brain-computer interface research.

Anthropic opens early access to Claude Code Security

Anthropic launched early access to Claude Code Security, a new AI-powered tool designed to detect hidden software vulnerabilities and suggest patches. This enhances code security workflows by providing automated analysis and recommendations for human review.

Rork Max: AI-powered native iOS app builder

Rork Max is an AI-powered native iOS app builder from Rork. This tool aims to accelerate iOS application development by leveraging AI capabilities for faster and more efficient creation of native apps.

Anthropic's Claude in PPT: AI sidebar for building PowerPoint slides

Anthropic has released 'Claude in PPT', an AI sidebar designed to assist users in building PowerPoint slides. This tool leverages Claude's capabilities to streamline presentation creation, offering a new integration for enterprise productivity.

How to self-host an n8n automation server in minutes

This guide details how to set up a self-hosted n8n automation server in under 10 minutes using Railway. It enables developers to run thousands of automations monthly on a $5 virtual server, supporting custom workflows and team collaboration with saved API keys.

Anthropic's Claude Code automates COBOL modernization

Anthropic announced that Claude Code can now automate COBOL modernization, a significant development for enterprise legacy system updates. This capability streamlines the process of updating COBOL codebases, impacting a core part of IBM's consulting business.

Higgsfield launches Soul 2.0, a new creative model with strong aesthetics and realism

Higgsfield released Soul 2.0, a new creative model designed with strong aesthetics and realism. This model aims to enhance generative capabilities for artistic and realistic content creation.

Inception Labs launches Mercury 2, a fast diffusion-based reasoning model

Inception Labs released Mercury 2, a diffusion-based reasoning model capable of over 1,000 tokens/second. This model triples the speed of its closest competitor in the same price tier, offering significant performance improvements for reasoning tasks.

Anthropic launches Remote Control for Claude Code

Anthropic introduced Remote Control for Claude Code, enabling users to seamlessly hand off running terminal tasks to their phone or browser. This feature enhances developer workflow by providing flexible control over code execution environments.

Reve v1.5: New Text-to-Image Model with 4K Resolution

Reve has launched v1.5, a new text-to-image model that features 4K resolution outputs. This upgrade focuses on delivering higher quality visuals and improved detail in generated images.