LlamaIndex ‘legal-kb’: Agentic Retrieval over Index v2 with retrieve, find, read, and grep Tools
Source: MarkTechPost LlamaIndex has published legal-kb, a public reference application on GitHub. It is described as a knowledge...
Structured PDF-to-JSON: A Guide to Open-Source Extraction Models in 2026
Source: MarkTechPost Most enterprise data still sits inside PDFs, scans, and slide decks. Large language models and agents...
Qwen’s Former Lead on What Hybrid Thinking Got Wrong — and Why He Now Backs Agents
Source: MarkTechPost Junyang Lin was the technical lead of Alibaba’s Qwen project. He announced he was stepping down...
Anthropic Launches Claude Science Beta: A Multi-Agent AI Workbench for Reproducible Genomics, Proteomics, and Cheminformatics Pipelines
Source: MarkTechPost This week, Anthropic released Claude Science. It is an app for scientists, available in beta. It...
NVIDIA HORIZON: A Hands-Free Agent that Evolves Git Worktrees and Hits 100% RTL Benchmark Completion
Source: MarkTechPost NVIDIA Research introduced HORIZON, a hands-free agent framework for hardware design. It treats hardware design as...
NVIDIA AI Introduces ASPIRE: A Self-Improving Robotics Framework Reaching 31% Zero-Shot on LIBERO-Pro Long Tasks
Source: MarkTechPost Traditional robot programming is hard to scale. It requires orchestrating multimodal perception, physical contact dynamics, diverse...
Mistral AI Releases Leanstral 1.5: An Apache-2.0 Lean 4 Code Agent Model Solving 587 of 672 PutnamBench Problems
Source: MarkTechPost Today, Mistral AI released Leanstral 1.5. It is a code agent model built for Lean 4....
Designing a Schema-Guided Invoice Intelligence Pipeline with lift-pdf for Accounts-Payable Extraction, Validation, and Ledger Generation
Source: MarkTechPost In this tutorial, we build an end-to-end accounts-payable extraction pipeline with lift-pdf, using synthetic invoice PDFs...
Meet WebBrain: An Open-Source, Local-First AI Browser Agent That Reads Pages and Automates Tasks in Chrome and Firefox
Source: MarkTechPost WebBrain is a free, open-source browser agent for Chrome and Firefox. It reads pages, extracts data,...
Interfaze Ships diffusion-gemma-asr-small, an Open-Source Diffusion ASR Model Transcribing Six Languages via DiffusionGemma’s Parallel Denoising Decoder
Source: MarkTechPost Interfaze, a young YC’s startup, has open-sourced a new speech recognition model. It is called diffusion-gemma-asr-small....