IBM AI Releases Granite 4.0 Tiny Preview: A Compact Open-Language Model Optimized for Long-Context and Instruction Tasks
Source: MarkTechPost IBM has introduced a preview of Granite 4.0 Tiny, the smallest member of its upcoming Granite...

Vision Foundation Models: Implementation and Business Applications
Source: MarkTechPost In this tutorial, we’ll explore implementing various vision foundation models for business applications. We’ll focus on...
Oversight at Scale Isn’t Guaranteed: MIT Researchers Quantify the Fragility of Nested AI Supervision with New Elo-Based Framework
Source: MarkTechPost Frontier AI companies show advancement toward artificial general intelligence (AGI), creating a need for techniques to...
LLMs Can Now Reason in Parallel: UC Berkeley and UCSF Researchers Introduce Adaptive Parallel Reasoning to Scale Inference Efficiently Without Exceeding Context Windows
Source: MarkTechPost Large language models (LLMs) have made significant strides in reasoning capabilities, exemplified by breakthrough systems like...
LLMs Can Learn Complex Math from Just One Example: Researchers from University of Washington, Microsoft, and USC Unlock the Power of 1-Shot Reinforcement Learning with Verifiable Reward
Source: MarkTechPost Recent advancements in LLMs such as OpenAI-o1, DeepSeek-R1, and Kimi-1.5 have significantly improved their performance on...
Subject-Driven Image Evaluation Gets Simpler: Google Researchers Introduce REFVNLI to Jointly Score Textual Alignment and Subject Consistency Without Costly APIs
Source: MarkTechPost Text-to-image (T2I) generation has evolved to include subject-driven approaches, which enhance standard T2I models by incorporating...

From ELIZA to Conversation Modeling: Evolution of Conversational AI Systems and Paradigms
Source: MarkTechPost TL;DR: Conversational AI has transformed from ELIZA’s simple rule-based systems in the 1960s to today’s sophisticated...

JetBrains Open Sources Mellum: A Developer-Centric Language Model for Code-Related Tasks
Source: MarkTechPost JetBrains has officially open-sourced Mellum, a purpose-built 4-billion-parameter language model tailored for software development tasks. Developed...
Meta and Booz Allen Deploy Space Llama: Open-Source AI Heads to the ISS for Onboard Decision-Making
Source: MarkTechPost In a significant step toward enabling autonomous AI systems in space, Meta and Booz Allen Hamilton...
Training LLM Agents Just Got More Stable: Researchers Introduce StarPO-S and RAGEN to Tackle Multi-Turn Reasoning and Collapse in Reinforcement Learning
Source: MarkTechPost Large language models (LLMs) face significant challenges when trained as autonomous agents in interactive environments. Unlike...