Zyphra Releases ZAYA1-8B-Diffusion-Preview: The First MoE Diffusion Model Converted From an Autoregressive LLM With Up to 7.7x Speedup
Source: MarkTechPost Zyphra, the San Francisco-based AI lab behind the ZAYA1 model family, released ZAYA1-8B-Diffusion-Preview — a preview...
Nous Research Releases Token Superposition Training to Speed Up LLM Pre-Training by Up to 2.5x Across 270M to 10B Parameter Models
Source: MarkTechPost Pre-training large language models is expensive enough that even modest efficiency improvements can translate into meaningful...
Mira Murati’s Thinking Machines Lab Introduces Interaction Models: A Native Multimodal Architecture for Real-Time Human-AI Collaboration
Source: MarkTechPost Most AI systems today work in turns. You type or speak, the model waits, processes your...
Universal AI is “a pathway to AI fluency that’s accessible and approachable to anyone, anywhere”
Source: MIT News – Artificial intelligence “Artificial intelligence is not just for computer scientists anymore; it’s going to...
Tilde Research Introduces Aurora: A Leverage-Aware Optimizer That Fixes a Hidden Neuron Death Problem in Muon
Source: MarkTechPost Researchers at Tilde Research have released Aurora, a new optimizer for training neural networks that addresses...
Meta and Stanford Researchers Propose Fast Byte Latent Transformer That Reduces Inference Memory Bandwidth by Over 50% Without Tokenization
Source: MarkTechPost A team of researchers from Meta, Stanford University, and the University of Washington have introduced three...
Sakana AI and NVIDIA Introduce TwELL with CUDA Kernels for 20.5% Inference and 21.9% Training Speedup in LLMs
Source: MarkTechPost Scaling large language models (LLMs) is expensive. Every token processed during inference and every gradient computed...
NVIDIA AI Just Released cuda-oxide: An Experimental Rust-to-CUDA Compiler Backend that Compiles SIMT GPU Kernels Directly to PTX
Source: MarkTechPost NVIDIA AI researchers recently released cuda-oxide, an experimental compiler that allows developers to write CUDA SIMT...
NVIDIA AI Releases Star Elastic: One Checkpoint that Contains 30B, 23B, and 12B Reasoning Models with Zero-Shot Slicing
Source: MarkTechPost Training a family of large language models (LLMs) has always come with a painful multiplier: every...
OpenAI Adds Chrome Extension to Codex, Letting Its AI Agent Access LinkedIn, Salesforce, Gmail, and Internal Tools via Signed-In Sessions
Source: MarkTechPost OpenAI has launched a Codex Chrome extension for Mac and PC to streamline browser-based workflows that...