Here is your today’s AI Dev Brief from Marktechpost, covering core research, models, infrastructure tools, and applied updates for AI developers and researchers.

Baidu Qianfan Team Releases Qianfan-OCR: A 4B-Parameter Unified Document Intelligence Model

Qianfan-OCR is a 4B-parameter unified end-to-end model developed by Baidu that integrates document parsing, layout analysis, and semantic understanding into a single vision-language architecture. By replacing traditional multi-stage pipelines with an image-to-Markdown approach and a "Layout-as-Thought" mechanism, the model preserves critical visual context often lost during text extraction. It currently ranks first among end-to-end models on the OmniDocBench v1.5 (93.12) and OlmOCR (79.8) benchmarks while outperforming much larger models like Qwen3-VL-235B and Gemini-3.1-Pro in Key Information Extraction (KIE) tasks..… Read the full analysis/article here.

Check out the Model here

Ant Group Researchers Unveil a Five-Layer Lifecycle-Oriented Security Framework to Mitigate Autonomous LLM Agent Vulnerabilities in OpenClaw

Researchers from Tsinghua University and Ant Group have conducted a comprehensive security analysis of the OpenClaw autonomous LLM agent framework, identifying critical vulnerabilities across its entire operational lifecycle. Their study reveals that OpenClaw’s "kernel-plugin" architecture, centered on the pi-coding-agent, is susceptible to multi-stage systemic risks such as skill poisoning, indirect prompt injection, memory poisoning, and intent drift. To address these threats, the team proposed a five-layer, lifecycle-oriented defense architecture—comprising Foundational Base, Input Perception, Cognitive State, Decision Alignment, and Execution Control layers—designed to replace fragmented point solutions. This framework utilizes advanced technical enablers, including eBPF for kernel-level sandboxing, Merkle-tree structures for memory integrity validation, and symbolic solvers for formal plan verification, to secure an agent’s complete operational trajectory against complex adversarial attacks..… Read the full analysis/article here.

Check out the Paper here

Latest Releases in Last 72 Hours

MiMo-V2-TTS (Xiaomi)
Capy (Capy AI)
MiMo-V2-Omni (Xiaomi)
Mistral Forge (Mistral)
GPT-5.4 mini and nano (OpenAI)
Nia CLI (Nozomio Labs)
MiroThinker (MiroMindAI)
Slate (Random Labs)
Leanstral 119B A6B (Mistral)
and many more…..

Project Notebooks/Tutorials

▶ How to Design an Agentic AI Architecture with LangGraph and OpenAI Using Adaptive Deliberation, Memory Graphs, and Reflexion Loops Codes Tutorial

▶ A Coding Guide to Design and Orchestrate Advanced ReAct-Based Multi-Agent Workflows with AgentScope and OpenAI Codes Tutorial

▶ How to Build a Production-Ready Multi-Agent Incident Response System Using OpenAI Swarm and Tool-Augmented Agents Codes Tutorial

▶ A Coding Implementation to Build a Self-Testing Agentic AI System Using Strands to Red-Team Tool-Using Agents and Enforce Safety at Runtime Codes Tutorial

▶ How to Design Transactional Agentic AI Systems with LangGraph Using Two-Phase Commit, Human Interrupts, and Safe Rollbacks Codes Tutorial

200+ more open codes/notebooks here ➡️

Upcoming AI Events

NVIDIA GTC 2026 [March 2026]
ICLR 2026 [April 2026]
AI Now Summit- Mistral AI [May 2026]
CVPR 2026 [June 2026]
ACL 2026 [July 2026]
ICML 2026 [July 2026]
EMNLP 2026 [Oct 2026]
and many…..

Baidu Team Releases Qianfan-OCR | A Security Analysis of the OpenClaw Autonomous LLM Agent Framework

Baidu Qianfan Team Releases Qianfan-OCR: A 4B-Parameter Unified Document Intelligence Model

Ant Group Researchers Unveil a Five-Layer Lifecycle-Oriented Security Framework to Mitigate Autonomous LLM Agent Vulnerabilities in OpenClaw

Latest Releases in Last 72 Hours

Project Notebooks/Tutorials

Upcoming AI Events

How was today’s email?

Awesome | Decent | Not Great

Keep Reading

The newsletter platform built for AI Devs