Back to AI is open source
Tencent Hunyuan Tencent-HY-MT1.5 Open Source: 1.8B End-Side Translation and 7B Cloud Collaborative Practical Interpretation

Tencent Hunyuan Tencent-HY-MT1.5 Open Source: 1.8B End-Side Translation and 7B Cloud Collaborative Practical Interpretation

AI is open source Admin 268 views

1. Abstract

Tencent-HY-MT1.5 (HY-MT) is Tencent's open source machine translation model suite, which includes two scales: 1.8B (partial side/low resource) and 7B (partial cloud/high quality). The official emphasizes its collaborative deployment for "device-side + cloud-side": low latency and low memory usage on the device, stronger quality and more robust complex scene performance on the cloud, and covering 33+ languages/dialects (including some Min-Chinese and Chinese dialects) mutual translation capabilities.

2. Core features

1. Dual-model coverage of the end cloud: 1.8B adapts to consumer-grade hardware and offline/real-time translation; 7B is a higher quality version and is suitable for cloud batch and high-demand scenarios.

2. Speed and resource friendliness: 1.8B provides a quantized version, with an official caliber of about 1GB of memory usage and low-latency data of 50 tokens (depending on your hardware and inference framework).

3. Production enhancement capabilities: Natively support term intervention (custom term comparison), long conversation contextual translation, and formatted text translation (keep labels/typography as much as possible).

4. Multilingual coverage: In addition to common Chinese, English, Japanese, etc., it also covers a variety of minor languages; It is suitable for cross-border e-commerce, content internationalization and multilingual customer service.

3. Installation

  1. Environment preparation: It is recommended to give priority to using the officially recommended Transformers version (the repository example is a fixed version number) and prepare the GPU/CPU inference environment.
  2. Obtain the model: Download the corresponding weights (1.8B/7B, FP8, GPTQ Int4, etc.) from Hugging Face.
  3. Inference method: Construct a translation prompt template according to the model card/warehouse example (different translation templates for Chinese and foreign translations, foreign translation, and term/context/format translation templates are different), and then call the generation interface to output the translation.

4. Typical use cases

  1. Offline translation on the device side: low-latency scenarios such as mobile, desktop, browser plug-ins, and input method/stroke translation.
  2. High-quality translation in the cloud: batch translation of documents, international content production, and multilingual knowledge base construction.
  3. Consistency in industry terminology: Medical, legal, financial, software engineering documents, etc. require "no drift in terminology" text.
  4. Multi-round dialogue and customer service: Use historical dialogue as the context to reduce pronoun referencing errors and style breaks.
  5. Web page/tag text: HTML/tagged text translation, try to maintain the original structure to facilitate backfilling and rendering.

5. Ecology and competing products

  1. Ecosystem: Provide GitHub engineering examples and technical reports; Hugging Face offers a variety of precision/quantization versions to facilitate the choice of different inference costs on the device side and in the cloud.
  2. Competitive product reference: the open source side can be compared with MarianMT, NLLB series, M2M100, SeamlessM4T, etc.; The closed-source side is commonly used to translate the translation capabilities of various translation APIs or general-purpose large models. Actual selection recommendations are A/B tested based on your language coverage, format retention, terminology consistency, and throughput/latency metrics.

6. Limitations and precautions

  1. Indicator transferability: Official speed/memory data and effect rankings usually rely on specific hardware, quantification, and inference configurations, and need to be retested on the target device before launch.
  2. Prompt Dependence: Term/context/format translation needs to organize input strictly according to the template, otherwise there may be explanatory output or format shifts.
  3. Small languages and colloquial styles: Long-tail languages, slang, and strong field texts may still be mistranslated/omitted, so it is recommended to introduce a glossary and manual sampling to close the loop.
  4. Consistency between the device and the cloud: If the device side and the cloud side use different versions/quantization accuracy, the output style may not be completely consistent, and it needs to be converged through prompt words and terminology strategies.

7. Project address

https://github.com/Tencent-Hunyuan/HY-MT

8. Frequently asked questions

Q: What "end-side translation" scenarios is HY-MT1.5-1.8B suitable for?

A: It is suitable for applications that are sensitive to latency, have limited device resources, and require offline availability, such as mobile translation, IM embedded translation, browser stroke translation, etc.

Q: How to choose between HY-MT1.5-7B and 1.8B? Do you have to choose one or the other?

A: Device-side priority 1.8B, cloud priority 7B; You can also output results on the device side and review/re-translate in the cloud to obtain more stable quality and consistency.

Q: How does HY-MT1.5's "Termbase/Terminology Intervention" work?

A: According to the official terminology prompt template, the comparison of "source term → target term" is injected as a constraint, and then the body text is translated to improve the consistency of the terminology.

Q: How does HY-MT1.5 do long conversation context translation?

A: Use the historical dialogue as a context block input and use the context translation template to let the model refer to the context before translating the current sentence.

Q: What texts does HY-MT1.5 use for format-preserving translations?

A: Suitable for text that contains tags or tags (e.g., web page snippets/structured snippets). It is recommended to use small samples to verify that the labels are stable and retained, and then expand to the batch process.

Tencent Hunyuan HY-MT1.5 Dual-Model Device-Cloud Translation Guide Tencent-HY-MT1.5 Device 1.8B Deployment Practice Tencent HY-MT1.5 Cloud 7B high-quality translation solution HY-MT1.5 covers 33+ language translation capabilities Tencent Open Source HY-MT Terminology Intervention Usage HY-MT1.5 Long Conversation Context Translation Skills Tencent HY-MT1.5 format retention translation tutorial HY-MT1.5 quantitative version memory and latency evaluation HY-MT1.5 device-cloud collaborative translation architecture design The whole process of Tencent-HY-MT1.5 installation and inference HY-MT1.5 prompt template and translation effect optimization Tencent HY-MT1.5 is an offline translation application on the device side HY-MT1.5 cloud batch document translation implemented HY-MT1.5 Multilingual Customer Service Conversation Translation Solution Tencent HY-MT1.5 cross-border e-commerce multilingual practice HY-MT1.5 Terminology Consistency in Medical Translation Applications HY-MT1.5 Legal Financial Terminology No Drift Scheme HY-MT1.5 Software Engineering Document Translation Best Practices HY-MT1.5 web page HTML tag translation and backfilling HY-MT1.5 end-side low-resource device inference configuration Comparative analysis of HY-MT1.5 and MarianMT models Differences and selection between HY-MT1.5 and NLLB series HY-MT1.5 compared with M2M100 translation quality evaluation HY-MT1.5 vs. SeamlessM4T capabilities HY-MT1.5 device-cloud consistency issues and solutions HY-MT1.5 Notes on minor language and colloquial body translation A/B test indicator list before HY-MT1.5 is launched HY-MT1.5 first translates the cloud review solution HY-MT1.5 Model Card Prompt and Engineering Example HY-MT1.5 HuggingFace Weight Download Guide HY-MT1.5 FP8 and GPTQ Int4 quantitative selection HY-MT1.5 inference framework selection and performance tuning HY-MT1.5 low latency 50tokens benchmark interpretation HY-MT1.5 takes up about 1GB of memory and takes up the implementation path HY-MT1.5 multilingual knowledge base construction process HY-MT1.5 content international production pipeline design HY-MT1.5 input method dividing word translation end-side solution HY-MT1.5 browser plug-in offline translation implementation HY-MT1.5 IM Embedded Real-time Translation Landing Guide HY-MT1.5 format translation to avoid label shift tips HY-MT1.5 glossary construction and injection practice HY-MT1.5 Context block organization and pronoun disambiguation HY-MT1.5 Robust Improvement Method for Complex Scenarios HY-MT1.5 device cloud different precision styles converge HY-MT1.5 production environment monitoring and manual sampling inspection HY-MT1.5 Strategies for Dealing with Long-tail Language Mistranslations and Omissions Tencent HY-MT1.5 project address and ecological resources HY-MT1.5 is fully analyzed from installation to typical use cases

Recommended Tools

More