
ReasonFlux-PRM: A Trajectory-Aware Reward Model Enhancing Chain-of-Thought Reasoning in LLMs
Source: MarkTechPost Understanding the Role of Chain-of-Thought in LLMs Large language models are increasingly being used to solve...
Baidu Researchers Propose AI Search Paradigm: A Multi-Agent Framework for Smarter Information Retrieval
Source: MarkTechPost The Need for Cognitive and Adaptive Search Engines Modern search systems are evolving rapidly as the...

Baidu Open Sources ERNIE 4.5: LLM Series Scaling from 0.3B to 424B Parameters
Source: MarkTechPost Baidu has officially open-sourced its latest ERNIE 4.5 series, a powerful family of foundation models designed...

OMEGA: A Structured Math Benchmark to Probe the Reasoning Limits of LLMs
Source: MarkTechPost Introduction to Generalization in Mathematical Reasoning Large-scale language models with long CoT reasoning, such as DeepSeek-R1,...

TabArena: Benchmarking Tabular Machine Learning with Reproducibility and Ensembling at Scale
Source: MarkTechPost Understanding the Importance of Benchmarking in Tabular ML Machine learning on tabular data focuses on building...
LongWriter-Zero: A Reinforcement Learning Framework for Ultra-Long Text Generation Without Synthetic Data
Source: MarkTechPost Introduction to Ultra-Long Text Generation Challenges Generating ultra-long texts that span thousands of words is becoming...
MDM-Prime: A generalized Masked Diffusion Models (MDMs) Framework that Enables Partially Unmasked Tokens during Sampling
Source: MarkTechPost Introduction to MDMs and Their Inefficiencies Masked Diffusion Models (MDMs) are powerful tools for generating discrete...
University of Michigan Researchers Propose G-ACT: A Scalable Machine Learning Framework to Steer Programming Language Bias in LLMs
Source: MarkTechPost LLMs and the Need for Scientific Code Control LLMs have rapidly evolved into complex natural language...
UC San Diego Researchers Introduced Dex1B: A Billion-Scale Dataset for Dexterous Hand Manipulation in Robotics
Source: MarkTechPost Challenges in Dexterous Hand Manipulation Data Collection Creating large-scale data for dexterous hand manipulation remains a...

Build Custom AI Tools for Your AI Agents that Combine Machine Learning and Statistical Analysis
Source: MarkTechPost The ability to build custom tools is critical for building customizable AI Agents. In this tutorial,...