Machine Learning – Page 11 – aifuturefront.com

deepseek-ai-releases-deepseek-v4:-compressed-sparse-attention-and-heavily-compressed-attention-enable-one-million-token-contexts

DeepSeek AI Releases DeepSeek-V4: Compressed Sparse Attention and Heavily Compressed Attention Enable One-Million-Token Contexts

Source: MarkTechPost DeepSeek-AI has released a preview version of the DeepSeek-V4 series: two Mixture-of-Experts (MoE) language models built...

Apr 24, 2026

$mit-scientists-build-the-world’s-largest-collection-of-olympiad-level-math-problems,-and-open-it-to-everyone$

MIT scientists build the world’s largest collection of Olympiad-level math problems, and open it to everyone

Source: MIT News – Artificial intelligence Every year, the countries competing in the International Mathematical Olympiad (IMO) arrive...

Apr 24, 2026

google-deepmind-introduces-decoupled-diloco:-an-asynchronous-training-architecture-achieving-88%-goodput-under-high-hardware-failure-rates

Google DeepMind Introduces Decoupled DiLoCo: An Asynchronous Training Architecture Achieving 88% Goodput Under High Hardware Failure Rates

Source: MarkTechPost Training frontier AI models is, at its core, a coordination problem. Thousands of chips must communicate...

Apr 24, 2026

mend-releases-ai-security-governance-framework:-covering-asset-inventory,-risk-tiering,-ai-supply-chain-security,-and-maturity-model

Mend Releases AI Security Governance Framework: Covering Asset Inventory, Risk Tiering, AI Supply Chain Security, and Maturity Model

Source: MarkTechPost There’s a pattern playing out inside almost every engineering organization right now. A developer installs GitHub...

Apr 24, 2026

openai-releases-gpt-55,-a-fully-retrained-agentic-model-that-scores-827%-on-terminal-bench-20-and-84.9%-on-gdpval

OpenAI Releases GPT-5.5, a Fully Retrained Agentic Model That Scores 82.7% on Terminal-Bench 2.0 and 84.9% on GDPval

Source: MarkTechPost OpenAI has released GPT-5.5, its most capable model to date and the first fully retrained base...

Apr 23, 2026

a-coding-tutorial-on-openmythos-on-recurrent-depth-transformers-with-depth-extrapolation,-adaptive-computation,-and-mixture-of-experts-routing

A Coding Tutorial on OpenMythos on Recurrent-Depth Transformers with Depth Extrapolation, Adaptive Computation, and Mixture-of-Experts Routing

Source: MarkTechPost In this tutorial, we explore the implementation of OpenMythos, a theoretical reconstruction of the Claude Mythos...

Apr 23, 2026

google-cloud-ai-research-introduces-reasoningbank:-a-memory-framework-that-distills-reasoning-strategies-from-agent-successes-and-failures

Google Cloud AI Research Introduces ReasoningBank: A Memory Framework that Distills Reasoning Strategies from Agent Successes and Failures

Source: MarkTechPost Most AI agents today have a fundamental amnesia problem. Deploy one to browse the web, resolve...

Apr 23, 2026

xiaomi-releases-mimo-v25-pro-and-mimo-v2.5:-matching-frontier-model-benchmarks-at-significantly-lower-token-cost

Xiaomi Releases MiMo-V2.5-Pro and MiMo-V2.5: Matching Frontier Model Benchmarks at Significantly Lower Token Cost

Source: MarkTechPost Xiaomi MiMo team publicly released two new models: MiMo-V2.5-Pro and MiMo-V2.5. The benchmarks, combined with some...

Apr 23, 2026

alibaba-qwen-team-releases-qwen3.6-27b:-a-dense-open-weight-model-outperforming-397b-moe-on-agentic-coding-benchmarks

Alibaba Qwen Team Releases Qwen3.6-27B: A Dense Open-Weight Model Outperforming 397B MoE on Agentic Coding Benchmarks

Source: MarkTechPost Alibaba’s Qwen Team has released Qwen3.6-27B, the first dense open-weight model in the Qwen3.6 family —...

Apr 22, 2026

teaching-ai-models-to-say-“i’m-not-sure”

Teaching AI models to say “I’m not sure”

Source: MIT News – Artificial intelligence Confidence is persuasive. In artificial intelligence systems, it is often misleading. Today’s...

Apr 22, 2026