Back to AI is open source
Tencent Hy-MT1.5-1.8B-1.25bit open source: 440MB mobile phone offline translation model analysis

Tencent Hy-MT1.5-1.8B-1.25bit open source: 440MB mobile phone offline translation model analysis

AI is open source Admin 197 views

1. Abstract

Hy-MT1.5-1.8B-1.25bit is a low-bit offline translation model based on HY-MT1.5-1.8B launched by Tencent's hybrid team, with a parameter size of 1.8B and a quantized volume of about 440MB. It is deployed for mobile and edge devices, supporting 33 languages, 5 dialects/minority languages, and 1056 translation directions. According to the official technical report, the model has strong parameter efficiency in standard Chinese, foreign and English-foreign translation tasks, and can compete with some commercial translation APIs and larger-scale open source models. It is still recommended to review the specific effect according to the business language and test set.

2. Core features

  1. Extremely low volume: Compress the FP16 approximately 3.3GB model to approximately 440MB through Sherry 1.25-bit ternary quantization.
  2. Mobile offline: The official Android Demo APK is provided to experience offline translation in a network-free environment.
  3. Multilingual coverage: Support Chinese, English, Japanese, Korean, French, Spanish, Tibetan, Mongolian, Uyghur, Cantonese, etc.
  4. Translation Capabilities Enhancement: Support tasks such as terminology intervention, contextual translation, and format maintenance.
  5. Compression framework support: Relying on the AngelSlim toolchain, it covers quantization, deployment, and subsequent compression algorithm iterations.

3. Installation

  1. Android experience: Download the official Demo APK and experience offline translation directly after installation.
  2. Model weights: Download the 1.25-bit weights or GGUF version from Hugging Face.
  3. AngelSlim toolchain: can be installed by pip install angelslim; The source code method can be cloned from GitHub and then run python setup.py install.
  4. Note: The official documentation shows that the STQ kernel related to llama.cpp is still in progress, and the project deployment must be subject to the latest repository description.

4. Typical use cases

  1. Mobile offline translation: suitable for travel, cross-border communication, weak or no network scenarios.
  2. Privacy-sensitive text translation: Text can be processed locally, reducing the need to upload external services.
  3. Mutual translation of minor languages: covering Tibetan, Mongolian, Uyghur and other ethnic minority languages.
  4. Enterprise terminology translation: It can be combined with terminology intervention templates to improve the consistency of professional vocabulary.
  5. Cross-app word picking: The Android demo demonstrates the background word picking mode, which is suitable for translating emails, web pages, and chat content.

5. Ecology and competing products

  1. Ecosystem: The project also provides Hugging Face weights, GGUF version, Android demo, AngelSlim documentation, and GitHub repositories.
  2. Competing products: It can be compared with commercial services such as Google Translate, Microsoft Translator, DeepL, and Doubao Translation, as well as Tower-Plus, Qwen, and other open source MT models.
  3. Differences: The core advantages of Hy-MT1.5-1.8B-1.25bit are its small size, offline accessibility, and wide language coverage, making it suitable for end-side translation rather than only for cloud APIs.

6. Limitations and precautions

  1. The official benchmark is not the same as all real business scenarios, and needs to be retested according to the direction, field and text length.
  2. Low-bit quantization may produce unstable results in complex long sentences, technical terms, or low-resource language.
  3. Android APK is suitable for experience, and production deployment still needs to pay attention to permissions, update mechanisms, and security audits.
  4. GGUF and the end-side kernel ecosystem are still iterating, and the current inference backend support should be confirmed before deployment.
  5. When involving high-risk texts such as law, medical care, and contracts, manual review is recommended.

7. Project address

https://github.com/tencent/AngelSlim

8. Frequently asked questions

Q: Can Hy-MT1.5-1.8B-1.25bit be translated completely offline?

A: The official Android demo is for offline translation experience, and the model can be run locally after downloading; For actual deployment, please refer to the latest documentation.

Q: What languages does Hy-MT1.5-1.8B-1.25bit support?

A: There are 33 official languages and 5 types of dialects/minority languages, including Chinese, English, Japanese, Korean, Tibetan, Mongolian, Uyghur, Cantonese, etc.

Q: Which is better, Hy-MT1.5-1.8B-1.25bit or Google Translate?

A: The official report highlights its strong competitiveness on several standard benchmarks; However, Google Translate is a continuously updated commercial service, and it is recommended to test it according to specific language directions and domain text.

Q: Does 1.25-bit quantization lose translation quality?

A: Officially, Sherry quantifies near-lossless performance while compressing volume; The actual effect still depends on the language direction and the input content.

Hy-MT1.5-1.8B-1.25bit open source interpretation: 440MB mobile phone offline translation model Tencent Hy-MT1.5 low-bit translation model released: 33 languages and device-side deployment What is Hy-MT1.5-1.8B-1.25bit: 1.25-bit quantization and offline translation analysis 440MB translation large model on mobile phone: Hy-MT1.5-1.8B-1.25bit technical interpretation Hy-MT1.5-1.8B-1.25bit Getting Started Guide: Hugging Face Weights and Android Demo Tencent's AngelSlim new model: Hy-MT1.5-1.8B-1.25bit offline translation capability analysis 1.8B Parameter Translation on Mobile Devices: Hy-MT1.5-1.25bit Detailed Explanation Hy-MT1.5-1.8B-1.25bit vs. Business Translation API: Advantages and Limitations Sherry 1.25-bit Quantitative Analysis: How Hy-MT1.5 Compresses to 440MB What does it mean that Hy-MT1.5-1.8B-1.25bit supports 1056 translation directions New Selection of Offline Translation Models: Hy-MT1.5-1.8B-1.25bit Project Introduction Translation models that can be run on mobile phones: Hy-MT1.5-1.8B-1.25bit installation and experience Hy-MT1.5-1.8B-1.25bit and GGUF: Analysis of key points of device-side deployment Tencent Hunyuan Translation Model Hy-MT1.5-1.8B-1.25bit open-source information summary How AngelSlim supports Hy-MT1.5 low-bit quantization and deployment Hy-MT1.5-1.8B-1.25bit covers 33 languages and application scenarios From 3.3GB to 440MB: Hy-MT1.5-1.8B-1.25bit compression principle Hy-MT1.5-1.8B-1.25bit Android Demo Experience Guide Which offline translation scenarios is Hy-MT1.5-1.8B-1.25bit suitable for? Tencent's 440MB translation model is open source: Hy-MT1.5-1.8B-1.25bit highlights Is Hy-MT1.5-1.8B-1.25bit a substitute for online translation services? Hy-MT1.5-1.8B-1.25bit supports translation into Tibetan, Mongolian, and other minor languages 1.25-bit ternary quantitative model Hy-MT1.5-1.8B technical popularization What to pay attention to before deploying Hy-MT1.5-1.8B-1.25bit on-premises What is the difference between Hy-MT1.5-1.8B-1.25bit and the 2-bit version Hy-MT1.5-1.8B-1.25bit open source repository and model download address collation End-side model for privacy translation: Hy-MT1.5-1.8B-1.25bit How to achieve a webless translation experience with Hy-MT1.5-1.8B-1.25bit Tencent Hy-MT1.5-1.8B-1.25bit model card information interpretation Hy-MT1.5-1.8B-1.25bit terminology intervention and contextual translation capabilities Introduction to Hy-MT1.5-1.8B-1.25bit, a low-resource language translation model Hy-MT1.5-1.8B-1.25bit Offline Translation APK Download & Usage Instructions The application value of Hy-MT1.5-1.8B-1.25bit on edge devices Hy-MT1.5-1.8B-1.25bit and Microsoft Translator comparison ideas Hy-MT1.5-1.8B-1.25bit vs. DeepL: A Localized Deployment Perspective Comparison points of Hy-MT1.5-1.8B-1.25bit and Qwen translation capabilities AngelSlim quantitative toolchain with Hy-MT1.5 translation model practice Why is the model size of Hy-MT1.5-1.8B-1.25bit only 440MB? Is Hy-MT1.5-1.8B-1.25bit suitable for enterprise privatization translation? Hy-MT1.5-1.8B-1.25bit FAQs: Language, Deployment, and Accuracy Tencent's open source 1.25-bit translation model: Hy-MT1.5-1.8B-1.25bit full analysis What do you think of the FLORES benchmark and real effect of Hy-MT1.5-1.8B-1.25bit? Advantages and risks of Hy-MT1.5-1.8B-1.25bit small volume translation model How Hy-MT1.5-1.8B-1.25bit covers 1056 translation directions Mobile AI translation model Hy-MT1.5-1.8B-1.25bit open source highlights Does Hy-MT1.5-1.8B-1.25bit support background word translation? Hy-MT1.5-1.8B-1.25bit local translation model deployment route Hy-MT1.5-1.8B-1.25bit Summary of Papers and Project Materials Hy-MT1.5-1.8B-1.25bit: Quantization from Sherry to offline translation Is Hy-MT1.5-1.8B-1.25bit suitable for secondary integration by developers?

Recommended Tools

More