Microsoft Researchers Introduce ARTIST: A Reinforcement Learning Framework That Equips LLMs with Agentic Reasoning and Dynamic Tool Use
Source: MarkTechPost LLMs have made impressive gains in complex reasoning, primarily through innovations in architecture, scale, and training...

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data
Source: MarkTechPost LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies...
Google Redefines Computer Science R&D: A Hybrid Research Model that Merges Innovation with Scalable Engineering
Source: MarkTechPost Computer science research has evolved into a multidisciplinary effort involving logic, engineering, and data-driven experimentation. With...
ServiceNow AI Released Apriel-Nemotron-15b-Thinker: A Compact Yet Powerful Reasoning Model Optimized for Enterprise-Scale Deployment and Efficiency
Source: MarkTechPost AI models today are expected to handle complex tasks such as solving mathematical problems, interpreting logical...
Multimodal LLMs Without Compromise: Researchers from UCLA, UW–Madison, and Adobe Introduce X-Fusion to Add Vision to Frozen Language Models Without Losing Language Capabilities
Source: MarkTechPost LLMs have made significant strides in language-related tasks such as conversational AI, reasoning, and code generation....

NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)
Source: MarkTechPost NVIDIA continues to push the boundaries of open AI development by open-sourcing its Open Code Reasoning...
Hugging Face Releases nanoVLM: A Pure PyTorch Library to Train a Vision-Language Model from Scratch in 750 Lines of Code
Source: MarkTechPost In a notable step toward democratizing vision-language model development, Hugging Face has released nanoVLM, a compact...
Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 Turbo in Coding, Supports Native Video Understanding and Leads WebDev Arena
Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...
Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 in Coding, Supports Native Video Understanding and Leads WebDev Arena
Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...
Researchers from Fudan University Introduce Lorsa: A Sparse Attention Mechanism That Recovers Atomic Attention Units Hidden in Transformer Superposition
Source: MarkTechPost Large Language Models (LLMs) have gained significant attention in recent years, yet understanding their internal mechanisms...