Salesforce AI Released APIGen-MT and xLAM-2-fc-r Model Series: Advancing Multi-Turn Agent Training with Verified Data Pipelines and Scalable LLM Architectures
Source: MarkTechPost AI agents quickly become core components in handling complex human interactions, particularly in business environments where...
Huawei Noah’s Ark Lab Released Dream 7B: A Powerful Open Diffusion Reasoning Model with Advanced Planning and Flexible Inference Capabilities
Source: MarkTechPost LLMs have revolutionized artificial intelligence, transforming various applications across industries. Autoregressive (AR) models dominate current text...
This AI Paper from ByteDance Introduces MegaScale-Infer: A Disaggregated Expert Parallelism System for Efficient and Scalable MoE-Based LLM Serving
Source: MarkTechPost Large language models are built on transformer architectures and power applications like chat, code generation, and...
This AI Paper Introduces an LLM+FOON Framework: A Graph-Validated Approach for Robotic Cooking Task Planning from Video Instructions
Source: MarkTechPost Robots are increasingly being developed for home environments, specifically to enable them to perform daily activities...
This AI Paper Introduces Inference-Time Scaling Techniques: Microsoft’s Deep Evaluation of Reasoning Models on Complex Tasks
Source: MarkTechPost Large language models are often praised for their linguistic fluency, but a growing area of focus...
RARE (Retrieval-Augmented Reasoning Modeling): A Scalable AI Framework for Domain-Specific Reasoning in Lightweight Language Models
Source: MarkTechPost LLMs have demonstrated strong general-purpose performance across various tasks, including mathematical reasoning and automation. However, they...

MMSearch-R1: End-to-End Reinforcement Learning for Active Image Search in LMMs
Source: MarkTechPost Large Multimodal Models (LMMs) have demonstrated remarkable capabilities when trained on extensive visual-text paired data, advancing...
Scalable and Principled Reward Modeling for LLMs: Enhancing Generalist Reward Models RMs with SPCT and Inference-Time Optimization
Source: MarkTechPost Reinforcement Learning RL has become a widely used post-training method for LLMs, enhancing capabilities like human...

Transformer Meets Diffusion: How the Transfusion Architecture Empowers GPT-4o’s Creativity
Source: MarkTechPost OpenAI’s GPT-4o represents a new milestone in multimodal AI: a single model capable of generating fluent...
This AI Paper from Anthropic Introduces Attribution Graphs: A New Interpretability Method to Trace Internal Reasoning in Claude 3.5 Haiku
Source: MarkTechPost While the outputs of large language models (LLMs) appear coherent and useful, the underlying mechanisms guiding...