dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model Paper • 2512.02498 • Published Dec 2, 2025 • 2
InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery Paper • 2602.08990 • Published 8 days ago • 68
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 12 days ago • 41
Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch Paper • 2602.03183 • Published 15 days ago • 11
Build error Featured 101 Qwen3-ASR Demo 🎙 101 Transcribe audio to text with multi-language timestamps
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 28 days ago • 22