ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search
Source: MarkTechPost Large language models are now central to various applications, from coding to academic tutoring and automated...
Microsoft Researchers Introduce ARTIST: A Reinforcement Learning Framework That Equips LLMs with Agentic Reasoning and Dynamic Tool Use
Source: MarkTechPost LLMs have made impressive gains in complex reasoning, primarily through innovations in architecture, scale, and training...

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data
Source: MarkTechPost LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies...

NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)
Source: MarkTechPost NVIDIA continues to push the boundaries of open AI development by open-sourcing its Open Code Reasoning...
Hugging Face Releases nanoVLM: A Pure PyTorch Library to Train a Vision-Language Model from Scratch in 750 Lines of Code
Source: MarkTechPost In a notable step toward democratizing vision-language model development, Hugging Face has released nanoVLM, a compact...
Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 Turbo in Coding, Supports Native Video Understanding and Leads WebDev Arena
Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...
Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 in Coding, Supports Native Video Understanding and Leads WebDev Arena
Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...
Researchers from Fudan University Introduce Lorsa: A Sparse Attention Mechanism That Recovers Atomic Attention Units Hidden in Transformer Superposition
Source: MarkTechPost Large Language Models (LLMs) have gained significant attention in recent years, yet understanding their internal mechanisms...
This AI Paper Introduce WebThinker: A Deep Research Agent that Empowers Large Reasoning Models (LRMs) for Autonomous Search and Report Generation
Source: MarkTechPost Large reasoning models (LRMs) have shown impressive capabilities in mathematics, coding, and scientific reasoning. However, they...

Is Automated Hallucination Detection in LLMs Feasible? A Theoretical and Empirical Investigation
Source: MarkTechPost Recent advancements in LLMs have significantly improved natural language understanding, reasoning, and generation. These models now...