Offline Video-LLMs Can Now Understand Real-Time Streams: Apple Researchers Introduce StreamBridge to Enable Multi-Turn and Proactive Video Understanding
Source: MarkTechPost Video-LLMs process whole pre-recorded videos at once. However, applications like robotics and autonomous driving need causal...
PrimeIntellect Releases INTELLECT-2: A 32B Reasoning Model Trained via Distributed Asynchronous Reinforcement Learning
Source: MarkTechPost As language models scale in parameter count and reasoning complexity, traditional centralized training pipelines face increasing...
AG-UI (Agent-User Interaction Protocol): An Open, Lightweight, Event-based Protocol that Standardizes How AI Agents Connect to Front-End Applications
Source: MarkTechPost The current generation of AI agents has made significant progress in automating backend tasks such as...
NVIDIA AI Introduces Audio-SDS: A Unified Diffusion-Based Framework for Prompt-Guided Audio Synthesis and Source Separation without Specialized Datasets
Source: MarkTechPost Audio diffusion models have achieved high-quality speech, music, and Foley sound synthesis, yet they predominantly excel...
This AI Paper Introduces Effective State-Size (ESS): A Metric to Quantify Memory Utilization in Sequence Models for Performance Optimization
Source: MarkTechPost In machine learning, sequence models are designed to process data with temporal structure, such as language,...
LightOn AI Released GTE-ModernColBERT-v1: A Scalable Token-Level Semantic Search Model for Long-Document Retrieval and Benchmark-Leading Performance
Source: MarkTechPost Semantic retrieval focuses on understanding the meaning behind text rather than matching keywords, allowing systems to...
A Coding Implementation of Accelerating Active Learning Annotation with Adala and Google Gemini
Source: MarkTechPost In this tutorial, we’ll learn how to leverage the Adala framework to build a modular active...
Tencent Released PrimitiveAnything: A New AI Framework That Reconstructs 3D Shapes Using Auto-Regressive Primitive Generation
Source: MarkTechPost Shape primitive abstraction, which breaks down complex 3D forms into simple, interpretable geometric units, is fundamental...
Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization
Source: MarkTechPost Sparse large language models (LLMs) based on the Mixture of Experts (MoE) framework have gained traction...
ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search
Source: MarkTechPost Large language models are now central to various applications, from coding to academic tutoring and automated...