Researchers from Renmin University and Huawei Propose MemEngine: A Unified Modular AI Library for Customizing Memory in LLM-Based Agents
Source: MarkTechPost LLM-based agents are increasingly used across various applications because they handle complex tasks and assume multiple...
Salesforce AI Researchers Introduce UAEval4RAG: A New Benchmark to Evaluate RAG Systems’ Ability to Reject Unanswerable Queries
Source: MarkTechPost While RAG enables responses without extensive model retraining, current evaluation frameworks focus on accuracy and relevance...

Chain-of-Thought May Not Be a Window into AI’s Reasoning: Anthropic’s New Study Reveals Hidden Gaps
Source: MarkTechPost Chain-of-thought (CoT) prompting has become a popular method for improving and interpreting the reasoning processes of...
Reinforcement Learning Makes LLMs Search-Savvy: Ant Group Researchers Introduce SEM to Optimize Tool Usage and Reasoning Efficiency
Source: MarkTechPost Recent progress in LLMs has shown their potential in performing complex reasoning tasks and effectively using...
LLMs Struggle to Act on What They Know: Google DeepMind Researchers Use Reinforcement Learning Fine-Tuning to Bridge the Knowing-Doing Gap
Source: MarkTechPost Language models trained on vast internet-scale datasets have become prominent language understanding and generation tools. Their...
SWE-Bench Performance Reaches 50.8% Without Tool Use: A Case for Monolithic State-in-Context Agents
Source: MarkTechPost Recent advancements in LM agents have shown promising potential for automating intricate real-world tasks. These agents...
Google Researchers Introduce LightLab: A Diffusion-Based AI Method for Physically Plausible, Fine-Grained Light Control in Single Images
Source: MarkTechPost Manipulating lighting conditions in images post-capture is challenging. Traditional approaches rely on 3D graphics methods that...
This AI paper from DeepSeek-AI Explores How DeepSeek-V3 Delivers High-Performance Language Modeling by Minimizing Hardware Overhead and Maximizing Computational Efficiency
Source: MarkTechPost The growth in developing and deploying large language models (LLMs) is closely tied to architectural innovations,...
Salesforce AI Releases BLIP3-o: A Fully Open-Source Unified Multimodal Model Built with CLIP Embeddings and Flow Matching for Image Understanding and Generation
Source: MarkTechPost Multimodal modeling focuses on building systems to understand and generate content across visual and textual formats....
DanceGRPO: A Unified Framework for Reinforcement Learning in Visual Generation Across Multiple Paradigms and Tasks
Source: MarkTechPost Recent advances in generative models, especially diffusion models and rectified flows, have revolutionized visual content creation...