IBM released two new open speech recognition models— Granite Speech 4.1 2B and Granite Speech 4.1 2B-NAR …
releases
-
-
TECH
Photon Releases Spectrum: An Open-Source TypeScript Framework that Deploys AI Agents Directly to iMessage, WhatsApp, and Telegram
by Techaiappby Techaiapp 6 minutes readFor all the progress made in AI agent development over the past few years, one fundamental problem …
-
TECH
MiniMax Releases MMX-CLI: A Command-Line Interface That Gives AI Agents Native Access to Image, Video, Speech, Music, Vision, and Search
by Techaiappby Techaiapp 5 minutes readMiniMax, the AI research company behind the MiniMax omni-modal model stack, has released MMX-CLI — Node.js-based command-line …
-
TECH
Hugging Face Releases TRL v1.0: A Unified Post-Training Stack for SFT, Reward Modeling, DPO, and GRPO Workflows
by Techaiappby Techaiapp 5 minutes readHugging Face has officially released TRL (Transformer Reinforcement Learning) v1.0, marking a pivotal transition for the library …
-
TECH
Liquid AI Releases LocalCowork Powered By LFM2-24B-A2B to Execute Privacy-First Agent Workflows Locally Via Model Context Protocol (MCP)
by Techaiappby Techaiapp 3 minutes readLiquid AI has released LFM2-24B-A2B, a model optimized for local, low-latency tool dispatch, alongside LocalCowork, an open-source …
-
TECH
FireRedTeam Releases FireRed-OCR-2B Utilizing GRPO to Solve Structural Hallucinations in Tables and LaTeX for Software Developers
by Techaiappby Techaiapp 4 minutes readDocument digitization has long been a multi-stage problem: first detect the layout, then extract the text, and …
-
TECH
ByteDance Releases Protenix-v1: A New Open-Source Model Achieving AF3-Level Performance in Biomolecular Structure Prediction
by Techaiappby Techaiapp 4 minutes readHow close can an open model get to AlphaFold3-level accuracy when it matches training data, model scale …
-
TECH
Moonshot AI Releases Kimi K2.5: An Open Source Visual Agentic Intelligence Model with Native Swarm Execution
by Techaiappby Techaiapp 5 minutes readMoonshot AI has released Kimi K2.5 as an open source visual agentic intelligence model. It combines a …
-
TECH
Google AI Releases TranslateGemma: A New Family of Open Translation Models Built on Gemma 3 with Support for 55 Languages
by Techaiappby Techaiapp 7 minutes readGoogle AI has released TranslateGemma, a suite of open machine translation models built on Gemma 3 and …
-
TECH
StepFun AI Releases Step-Audio-R1: A New Audio LLM that Finally Benefits from Test Time Compute Scaling
by Techaiappby Techaiapp 7 minutes readWhy do current audio AI models often perform worse when they generate longer reasoning instead of grounding …