Huawei Introduces Pangu Ultra MoE: A 718B-Parameter Sparse Language Model Trained Efficiently on Ascend NPUs Using Simulation-Driven Architecture and System-Level Optimization
Source: MarkTechPost Sparse large language models (LLMs) based on the Mixture of Experts (MoE) framework have gained traction...
ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search
Source: MarkTechPost Large language models are now central to various applications, from coding to academic tutoring and automated...
Microsoft Researchers Introduce ARTIST: A Reinforcement Learning Framework That Equips LLMs with Agentic Reasoning and Dynamic Tool Use
Source: MarkTechPost LLMs have made impressive gains in complex reasoning, primarily through innovations in architecture, scale, and training...

ByteDance Open-Sources DeerFlow: A Modular Multi-Agent Framework for Deep Research Automation
Source: MarkTechPost ByteDance has released DeerFlow, an open-source multi-agent framework designed to enhance complex research workflows by integrating...

Enterprise AI Without GPU Burn: Salesforce’s xGen-small Optimizes for Context, Cost, and Privacy
Source: MarkTechPost Language processing in enterprise environments faces critical challenges as business workflows increasingly depend on synthesising information...

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data
Source: MarkTechPost LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies...
Google Redefines Computer Science R&D: A Hybrid Research Model that Merges Innovation with Scalable Engineering
Source: MarkTechPost Computer science research has evolved into a multidisciplinary effort involving logic, engineering, and data-driven experimentation. With...
ServiceNow AI Released Apriel-Nemotron-15b-Thinker: A Compact Yet Powerful Reasoning Model Optimized for Enterprise-Scale Deployment and Efficiency
Source: MarkTechPost AI models today are expected to handle complex tasks such as solving mathematical problems, interpreting logical...
Ming-Lite-Uni: An Open-Source AI Framework Designed to Unify Text and Vision through an Autoregressive Multimodal Structure
Source: MarkTechPost Multimodal AI rapidly evolves to create systems that can understand, generate, and respond using multiple data...

Meta AI Open-Sources LlamaFirewall: A Security Guardrail Tool to Help Build Secure AI Agents
Source: MarkTechPost As AI agents become more autonomous—capable of writing production code, managing workflows, and interacting with untrusted...