Machine Learning – Page 21 – aifuturefront.com

JetBrains Open Sources Mellum: A Developer-Centric Language Model for Code-Related Tasks

Source: MarkTechPost JetBrains has officially open-sourced Mellum, a purpose-built 4-billion-parameter language model tailored for software development tasks. Developed...

May 2, 2025

Training LLM Agents Just Got More Stable: Researchers Introduce StarPO-S and RAGEN to Tackle Multi-Turn Reasoning and Collapse in Reinforcement Learning

Source: MarkTechPost Large language models (LLMs) face significant challenges when trained as autonomous agents in interactive environments. Unlike...

May 2, 2025

Xiaomi introduced MiMo-7B: A Compact Language Model that Outperforms Larger Models in Mathematical and Code Reasoning through Rigorous Pre-Training and Reinforcement Learning

Source: MarkTechPost With rising demand for AI systems that can handle tasks involving multi-step logic, mathematical proofs, and...

May 2, 2025

Building the Internet of Agents: A Technical Dive into AI Agent Protocols and Their Role in Scalable Intelligence Systems

Source: MarkTechPost As large language model (LLM) agents gain traction across enterprise and research ecosystems, a foundational gap...

May 2, 2025

DeepSeek-AI Released DeepSeek-Prover-V2: An Open-Source Large Language Model Designed for Formal Theorem, Proving through Subgoal Decomposition and Reinforcement Learning

Source: MarkTechPost Formal mathematical reasoning has evolved into a specialized subfield of artificial intelligence that requires strict logical...

May 1, 2025

Microsoft AI Released Phi-4-Reasoning: A 14B Parameter Open-Weight Reasoning Model that Achieves Strong Performance on Complex Reasoning Tasks

Source: MarkTechPost Despite notable advancements in large language models (LLMs), effective performance on reasoning-intensive tasks—such as mathematical problem...

May 1, 2025

meta-ai-introduces-reasonir-8b:-a-reasoning-focused-retriever-optimized-for-efficiency-and-rag-performance

Alibaba Qwen Team Just Released Qwen3: The Latest Generation of Large Language Models in Qwen Series, Offering a Comprehensive Suite of Dense and Mixture-of-Experts (MoE) Models

Source: MarkTechPost Despite the remarkable progress in large language models (LLMs), critical challenges remain. Many models exhibit limitations...

Apr 29, 2025

JetBrains Open Sources Mellum: A Developer-Centric Language Model for Code-Related Tasks

Training LLM Agents Just Got More Stable: Researchers Introduce StarPO-S and RAGEN to Tackle Multi-Turn Reasoning and Collapse in Reinforcement Learning

Xiaomi introduced MiMo-7B: A Compact Language Model that Outperforms Larger Models in Mathematical and Code Reasoning through Rigorous Pre-Training and Reinforcement Learning

Building the Internet of Agents: A Technical Dive into AI Agent Protocols and Their Role in Scalable Intelligence Systems

DeepSeek-AI Released DeepSeek-Prover-V2: An Open-Source Large Language Model Designed for Formal Theorem, Proving through Subgoal Decomposition and Reinforcement Learning

Microsoft AI Released Phi-4-Reasoning: A 14B Parameter Open-Weight Reasoning Model that Achieves Strong Performance on Complex Reasoning Tasks

Meta AI Introduces ReasonIR-8B: A Reasoning-Focused Retriever Optimized for Efficiency and RAG Performance

Making AI models more trustworthy for high-stakes settings

ThinkPRM: A Generative Process Reward Models for Scalable Reasoning Verification

Alibaba Qwen Team Just Released Qwen3: The Latest Generation of Large Language Models in Qwen Series, Offering a Comprehensive Suite of Dense and Mixture-of-Experts (MoE) Models