Tencent Hunyuan has released HunyuanOCR, a 1B parameter vision language model that is specialized for OCR and …
Tag:
VLM
-
-
TECH
Liquid AI’s LFM2-VL-3B Brings a 3B Parameter Vision Language Model (VLM) to Edge-Class Devices
by Techaiappby Techaiapp 5 minutes readLiquid AI released LFM2-VL-3B, a 3B parameter vision language model for image text to text tasks. It …
-
TECH
DeepSeek Just Released a 3B OCR Model: A 3B VLM Designed for High-Performance OCR and Structured Document Conversion
by Techaiappby Techaiapp 6 minutes readDeepSeek-AI released 3B DeepSeek-OCR, an end to end OCR and document parsing Vision-Language Model (VLM) system that …
-
TECH
Hugging Face Releases Smol2Operator: A Fully Open-Source Pipeline to Train a 2.2B VLM into an Agentic GUI Coder
by Techaiappby Techaiapp 3 minutes readHugging Face (HF) has released Smol2Operator, a reproducible, end-to-end recipe that turns a small vision-language model (VLM) …
-
TECH
Salesforce AI Research Propose Programmatic VLM Evaluation (PROVE): A New Benchmarking Paradigm for Evaluating VLM Responses to Open-Ended Queries
by Techaiappby Techaiapp 4 minutes readVision-Language Models (VLMs) are increasingly used for generating responses to queries about visual content. Despite their progress, …