Here is your today’s AI Dev Brief from Marktechpost, covering core research, models, infrastructure tools, and applied updates for AI developers and researchers.

NVIDIA Releases Nemotron-Cascade 2: An Open 30B MoE with 3B Active Parameters, Delivering Better Reasoning and Strong Agentic Capabilities

NVIDIA’s Nemotron-Cascade 2 is an open 30B Mixture-of-Experts (MoE) model with 3B activated parameters designed to deliver high intelligence density in reasoning and agentic tasks. It achieved Gold Medal-level performance in the 2025 International Mathematical Olympiad (IMO), International Olympiad in Informatics (IOI), and the ICPC World Finals, utilizing 20x fewer parameters than some frontier-scale models. The model’s performance is driven by a post-training pipeline that integrates Cascade RL—a sequential, domain-wise reinforcement learning framework—with Multi-Domain On-Policy Distillation (MOPD) to stabilize training and recover performance regressions.… Read the full analysis/article here.

Meet Mamba-3: A New State Space Model Frontier with 2x Smaller States and Enhanced MIMO Decoding Hardware Efficiency

Mamba-3 is an advanced State Space Model (SSM) that redefines the performance-efficiency Pareto frontier through an "inference-first" architecture. Developed by researchers from CMU, Princeton, Together AI, and Cartesia AI, the model introduces three core innovations: exponential-trapezoidal discretization for second-order accurate recurrence, complex-valued state updates utilizing a data-dependent "RoPE trick" to solve previously unreachable state-tracking tasks, and a Multi-Input Multi-Output (MIMO) formulation that increases decoding FLOPs by up to 4x while maintaining hardware efficiency . Empirically, Mamba-3 achieves a 1.8-point gain in average downstream accuracy at the 1.5B scale compared to Gated DeltaNet and delivers comparable performance to Mamba-2 while utilizing only half the state size..… Read the full analysis/article here.

Latest Releases in Last 72 Hours

Project Notebooks/Tutorials

▶ A Coding Implementation Showcasing ClawTeam's Multi-Agent Swarm Orchestration with OpenAI Function Calling Codes Tutorial

▶ A Coding Implementation to Design an Enterprise AI Governance System Using OpenClaw Gateway Policy Engines, Approval Workflows and Auditable Agent Execution Codes Tutorial

▶ How to Build an Autonomous Machine Learning Research Loop in Google Colab Using Andrej Karpathy’s AutoResearch Framework for Hyperparameter Discovery and Experiment Tracking Codes Tutorial

▶ How to Design a Streaming Decision Agent with Partial Reasoning, Online Replanning, and Reactive Mid-Execution Adaptation in Dynamic Environments Codes Tutorial

▶ How to Build a Self-Designing Meta-Agent That Automatically Constructs, Instantiates, and Refines Task-Specific AI Agents Codes Tutorial

Upcoming AI Events

How was today’s email?

Awesome  |   Decent    |  Not Great

Keep Reading