Machine Learning – Page 19 – aifuturefront.com

LightOn AI Released GTE-ModernColBERT-v1: A Scalable Token-Level Semantic Search Model for Long-Document Retrieval and Benchmark-Leading Performance

Source: MarkTechPost Semantic retrieval focuses on understanding the meaning behind text rather than matching keywords, allowing systems to...

May 11, 2025

ZeroSearch from Alibaba Uses Reinforcement Learning and Simulated Documents to Teach LLMs Retrieval Without Real-Time Search

Source: MarkTechPost Large language models are now central to various applications, from coding to academic tutoring and automated...

May 10, 2025

Microsoft Researchers Introduce ARTIST: A Reinforcement Learning Framework That Equips LLMs with Agentic Reasoning and Dynamic Tool Use

Source: MarkTechPost LLMs have made impressive gains in complex reasoning, primarily through innovations in architecture, scale, and training...

May 10, 2025

AI That Teaches Itself: Tsinghua University’s ‘Absolute Zero’ Trains LLMs With Zero External Data

Source: MarkTechPost LLMs have shown advancements in reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR), which relies...

May 9, 2025

NVIDIA Open-Sources Open Code Reasoning Models (32B, 14B, 7B)

Source: MarkTechPost NVIDIA continues to push the boundaries of open AI development by open-sourcing its Open Code Reasoning...

May 8, 2025

Hugging Face Releases nanoVLM: A Pure PyTorch Library to Train a Vision-Language Model from Scratch in 750 Lines of Code

Source: MarkTechPost In a notable step toward democratizing vision-language model development, Hugging Face has released nanoVLM, a compact...

May 8, 2025

Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 Turbo in Coding, Supports Native Video Understanding and Leads WebDev Arena

Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...

May 7, 2025

Google Launches Gemini 2.5 Pro I/O: Outperforms GPT-4 in Coding, Supports Native Video Understanding and Leads WebDev Arena

Source: MarkTechPost Just ahead of its annual I/O developer conference, Google has released an early preview of Gemini...

May 7, 2025

Researchers from Fudan University Introduce Lorsa: A Sparse Attention Mechanism That Recovers Atomic Attention Units Hidden in Transformer Superposition

Source: MarkTechPost Large Language Models (LLMs) have gained significant attention in recent years, yet understanding their internal mechanisms...

May 7, 2025

This AI Paper Introduce WebThinker: A Deep Research Agent that Empowers Large Reasoning Models (LRMs) for Autonomous Search and Report Generation

Source: MarkTechPost Large reasoning models (LRMs) have shown impressive capabilities in mathematics, coding, and scientific reasoning. However, they...

May 7, 2025