DeepSeek AI Researchers Introduce Engram: A Conditional Memory Axis For Sparse LLMs
Source: MarkTechPost Transformers use attention and Mixture-of-Experts to scale computation, but they still lack a native way to...
Google AI Releases MedGemma-1.5: The Latest Update to their Open Medical AI Models for Developers
Source: MarkTechPost Google Research has expanded its Health AI Developer Foundations program (HAI-DEF) with the release of MedGemma-1.5....
Understanding the Layers of AI Observability in the Age of LLMs
Source: MarkTechPost Artificial intelligence (AI) observability refers to the ability to understand, monitor, and evaluate AI systems by...
Google AI Releases Universal Commerce Protocol (UCP): An Open-Source Standard Designed to Power the Next Generation of Agentic Commerce
Source: MarkTechPost Can AI shopping agents move beyond sending product links and actually complete trusted purchases end to...
How This Agentic Memory Research Unifies Long Term and Short Term Memory for LLM Agents
Source: MarkTechPost How do you design an LLM agent that decides for itself what to store in long...
Meta and Harvard Researchers Introduce the Confucius Code Agent (CCA): A Software Engineering Agent that can Operate at Large-Scale Codebases
Source: MarkTechPost How far can a mid sized language model go if the real innovation moves from the...
Stanford Researchers Build SleepFM Clinical: A Multimodal Sleep Foundation AI Model for 130+ Disease Prediction
Source: MarkTechPost A team of Stanford Medicine researchers have introduced SleepFM Clinical, a multimodal sleep foundation model that...
TII Abu-Dhabi Released Falcon H1R-7B: A New Reasoning Model Outperforming Others in Math and Coding with only 7B Params with 256k Context Window
Source: MarkTechPost Technology Innovation Institute (TII), Abu Dhabi, has released Falcon-H1R-7B, a 7B parameter reasoning specialized model that...
Liquid AI Releases LFM2.5: A Compact AI Model Family For Real On Device Agents
Source: MarkTechPost Liquid AI has introduced LFM2.5, a new generation of small foundation models built on the LFM2...
Tencent Researchers Release Tencent HY-MT1.5: A New Translation Models Featuring 1.8B and 7B Models Designed for Seamless on-Device and Cloud Deployment
Source: MarkTechPost Tencent Hunyuan researchers have released HY-MT1.5, a multilingual machine translation family that targets both mobile devices...