Offline Video-LLMs Can Now Understand Real-Time Streams: Apple Researchers Introduce StreamBridge to Enable Multi-Turn and Proactive Video Understanding
Source: MarkTechPost Video-LLMs process whole pre-recorded videos at once. However, applications like robotics and autonomous driving need causal...
AG-UI (Agent-User Interaction Protocol): An Open, Lightweight, Event-based Protocol that Standardizes How AI Agents Connect to Front-End Applications
Source: MarkTechPost The current generation of AI agents has made significant progress in automating backend tasks such as...
This AI Paper Introduces Effective State-Size (ESS): A Metric to Quantify Memory Utilization in Sequence Models for Performance Optimization
Source: MarkTechPost In machine learning, sequence models are designed to process data with temporal structure, such as language,...
LightOn AI Released GTE-ModernColBERT-v1: A Scalable Token-Level Semantic Search Model for Long-Document Retrieval and Benchmark-Leading Performance
Source: MarkTechPost Semantic retrieval focuses on understanding the meaning behind text rather than matching keywords, allowing systems to...

A Coding Implementation of Accelerating Active Learning Annotation with Adala and Google Gemini
Source: MarkTechPost In this tutorial, we’ll learn how to leverage the Adala framework to build a modular active...
Tencent Released PrimitiveAnything: A New AI Framework That Reconstructs 3D Shapes Using Auto-Regressive Primitive Generation
Source: MarkTechPost Shape primitive abstraction, which breaks down complex 3D forms into simple, interpretable geometric units, is fundamental...
Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization
Source: MarkTechPost Sparse large language models (LLMs) based on the Mixture of Experts (MoE) framework have gained traction...
ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search
Source: MarkTechPost Large language models are now central to various applications, from coding to academic tutoring and automated...
Microsoft Researchers Introduce ARTIST: A Reinforcement Learning Framework That Equips LLMs with Agentic Reasoning and Dynamic Tool Use
Source: MarkTechPost LLMs have made impressive gains in complex reasoning, primarily through innovations in architecture, scale, and training...

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data
Source: MarkTechPost LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies...