Machine Translation: Definition & Meaning — AI Wiki

自动把文本从一种语言翻译成另一种。现代神经机器翻译(NMT)用在平行语料(文本及其翻译)上训练的 encoder-decoder Transformer。Google Translate、DeepL 和基于 LLM 的翻译都用这种方法的变体。质量已经显著提高 — 对常见语言对,MT 在常规内容上接近专业人工翻译。

为什么重要

机器翻译规模化地打破语言障碍。它实现全球商业、跨语言搜索、实时沟通、以及跨语言的信息获取。对 AI 来说,MT 是主要用英语训练的模型如何服务 100+ 种语言用户的方法 — 也是多语言 tokenizer 效率对成本重要的原因。

Deep Dive

Modern NMT uses the encoder-decoder Transformer architecture: the encoder processes the source sentence, and the decoder generates the target sentence token by token, attending to the encoded source through cross-attention. Training requires parallel corpora — millions of sentence pairs in both languages. Data quality and domain match are critical: a model trained on EU Parliament proceedings translates legal text well but informal chat poorly.

LLMs as Translators

Large language models have become competitive translators, sometimes exceeding dedicated MT systems for high-resource language pairs. Their advantage: they understand context, idioms, and cultural nuances better because they've seen language used in diverse contexts. Their disadvantage: they're much slower and more expensive per sentence than dedicated MT models. For real-time translation of millions of sentences, dedicated models (like those behind Google Translate) are necessary. For quality-critical translation of smaller volumes, LLMs often produce more natural results.

The Long Tail of Languages

MT quality varies enormously across language pairs. English-French, English-Spanish, and English-Chinese are well-served (abundant training data). But for the world's 7,000+ languages, most pairs have little or no parallel training data. Low-resource translation remains an active research area, with approaches including: zero-shot translation through multilingual models, back-translation (using the MT system itself to generate synthetic training data), and transfer learning from related languages.

Machine Translation

为什么重要

Deep Dive

LLMs as Translators

The Long Tail of Languages

相关概念