Day: March 2, 2025

a-mem:-a-novel-agentic-memory-system-for-llm-agents-that-enables-dynamic-memory-structuring-without-relying-on-static,-predetermined-memory-operations

A-MEM: A Novel Agentic Memory System for LLM Agents that Enables Dynamic Memory Structuring without Relying on Static, Predetermined Memory Operations

Source: MarkTechPost Current memory systems for large language model (LLM) agents often struggle with rigidity and a lack...

Mar 2, 2025

microsoft-ai-released-longrope2:-a-near-lossless-method-to-extend-large-language-model-context-windows-to-128k-tokens-while-retaining-over-97%-short-context-accuracy

Microsoft AI Released LongRoPE2: A Near-Lossless Method to Extend Large Language Model Context Windows to 128K Tokens While Retaining Over 97% Short-Context Accuracy

Source: MarkTechPost Large Language Models (LLMs) have advanced significantly, but a key limitation remains their inability to process...

Mar 2, 2025

tencent-ai-lab-introduces-unsupervised-prefix-fine-tuning-(upft):-an-efficient-method-that-trains-models-on-only-the-first-8-32-tokens-of-single-self-generated-solutions

Tencent AI Lab Introduces Unsupervised Prefix Fine-Tuning (UPFT): An Efficient Method that Trains Models on only the First 8-32 Tokens of Single Self-Generated Solutions

Source: MarkTechPost Unleashing a more efficient approach to fine-tuning reasoning in large language models, recent work by researchers...

Mar 2, 2025