LongCat-Next Open Source Release: A native multimodal model that unifies text, image, and audio
- Abstract LongCat-Next is an open-sourced discrete native autoregressive multimodal model from Meituan's LongCat team, with the goal of unifying text...
- Abstract LongCat-Next is an open-sourced discrete native autoregressive multimodal model from Meituan's LongCat team, with the goal of unifying text...
1. Abstract HY3D-Bench is an open-sourced unified 3D asset data ecosystem by Tencent's Hunyuan team, with the goal of alleviating the common pain poin...
1. Abstract Qwen3-Coder-Next is an open-source weighted code model released by Qwen Team, which is suitable for coding agents and local development sc...
1. Abstract Youtu-VL-4B-Instruct is a compact visual language model (4B parameters) open source by Tencent Youtu, which proposes VLUAS (Vision-Languag...
1. Abstract PaddleOCR-VL-1.5 is an open-source 0.9B parametric document multimodal model of PaddlePaddlePaddle, which provides integrated capabilities...
1. Abstract PaddleOCR is an open-source OCR and document parsing toolbox based on PaddlePaddle, which provides "text recognition + structured extracti...