RoR-Bench: Revealing Recitation Over Reasoning in Large Language Models Through Subtle Context Shifts
Source: MarkTechPost In recent years, the rapid progress of LLMs has given the impression that we are nearing...
Boson AI Introduces Higgs Audio Understanding and Higgs Audio Generation: An Advanced AI Solution with Real-Time Audio Reasoning and Expressive Speech Synthesis for Enterprise Applications
Source: MarkTechPost In today’s enterprise landscape—especially in insurance and customer support —voice and audio data are more than...
OpenAI Open Sources BrowseComp: A New Benchmark for Measuring the Ability for AI Agents to Browse the Web
Source: MarkTechPost Despite advances in large language models (LLMs), AI agents still face notable limitations when navigating the...

Google AI Introduces Ironwood: A Google TPU Purpose-Built for the Age of Inference
Source: MarkTechPost At the 2025 Google Cloud Next event, Google introduced Ironwood, its latest generation of Tensor Processing...

ByteDance Introduces VAPO: A Novel Reinforcement Learning Framework for Advanced Reasoning Tasks
Source: MarkTechPost In the Large Language Models (LLM) RL training, value-free methods like GRPO and DAPO have shown...
T* and LV-Haystack: A Spatially-Guided Temporal Search Framework for Efficient Long-Form Video Understanding
Source: MarkTechPost Understanding long-form videos—ranging from minutes to hours—presents a major challenge in computer vision, especially as video...
This AI Paper Introduces a Machine Learning Framework to Estimate the Inference Budget for Self-Consistency and GenRMs (Generative Reward Models)
Source: MarkTechPost Large Language Models (LLMs) have demonstrated significant advancements in reasoning capabilities across diverse domains, including mathematics...
Unveiling Attention Sinks: The Functional Role of First-Token Focus in Stabilizing Large Language Models
Source: MarkTechPost LLMs often show a peculiar behavior where the first token in a sequence draws unusually high...

TorchSim: A Next-Generation PyTorch-Native Atomistic Simulation Engine for the MLIP Era
Source: MarkTechPost Radical AI has released TorchSim, a next-generation PyTorch-native atomistic simulation engine for the MLIP era. It...
Salesforce AI Released APIGen-MT and xLAM-2-fc-r Model Series: Advancing Multi-Turn Agent Training with Verified Data Pipelines and Scalable LLM Architectures
Source: MarkTechPost AI agents quickly become core components in handling complex human interactions, particularly in business environments where...