Google Drops Gemini 3.1 Flash-Lite: A Cost-efficient Powerhouse with Adjustable Thinking Levels Designed for High-Scale Production AI
Source: MarkTechPost Google has released Gemini 3.1 Flash-Lite, the most cost-efficient entry in the Gemini 3 model series....
Alibaba Releases OpenSandbox to Provide Software Developers with a Unified, Secure, and Scalable API for Autonomous AI Agent Execution
Source: MarkTechPost Alibaba has released OpenSandbox, an open-source tool designed to provide AI agents with secure, isolated environments...
Google AI Introduces STATIC: A Sparse Matrix Framework Delivering 948x Faster Constrained Decoding for LLM Based Generative Retrieval
Source: MarkTechPost In industrial recommendation systems, the shift toward Generative Retrieval (GR) is replacing traditional embedding-based nearest neighbor...
Alibaba Team Open-Sources CoPaw: A High-Performance Personal Agent Workstation for Developers to Scale Multi-Channel AI Workflows and Memory
Source: MarkTechPost As the industry moves from simple Large Language Model (LLM) inference toward autonomous agentic systems, the...
Google DeepMind Introduces Unified Latents (UL): A Machine Learning Framework that Jointly Regularizes Latents Using a Diffusion Prior and Decoder
Source: MarkTechPost Generative AI’s current trajectory relies heavily on Latent Diffusion Models (LDMs) to manage the computational cost...
Sakana AI Introduces Doc-to-LoRA and Text-to-LoRA: Hypernetworks that Instantly Internalize Long Contexts and Adapt LLMs via Zero-Shot Natural Language
Source: MarkTechPost Customizing Large Language Models (LLMs) currently presents a significant engineering trade-off between the flexibility of In-Context...
Perplexity Just Released pplx-embed: New SOTA Qwen3 Bidirectional Embedding Models for Web-Scale Retrieval Tasks
Source: MarkTechPost Perplexity has released pplx-embed, a collection of multilingual embedding models optimized for large-scale retrieval tasks. These...
Microsoft Research Introduces CORPGEN To Manage Multi Horizon Tasks For Autonomous AI Agents Using Hierarchical Planning and Memory
Source: MarkTechPost Microsoft researchers have introduced CORPGEN, an architecture-agnostic framework designed to manage the complexities of realistic organizational...
Google AI Just Released Nano-Banana 2: The New AI Model Featuring Advanced Subject Consistency and Sub-Second 4K Image Synthesis Performance
Source: MarkTechPost In the escalating ‘race of “smaller, faster, cheaper’ AI, Google just dropped a heavy-hitting payload. The...
Nous Research Releases ‘Hermes Agent’ to Fix AI Forgetfulness with Multi-Level Memory and Dedicated Remote Terminal Access Support
Source: MarkTechPost In the current AI landscape, we’ve become accustomed to the ‘ephemeral agent’—a brilliant but forgetful assistant...