Zubnet AI学习Wiki › Data Augmentation
Training

Data Augmentation

通过创造现有样本的修改版本来人为扩大训练数据集的技术。图像:翻转、旋转、裁剪、颜色偏移。文本:改写、回译、同义词替换。音频:速度改变、噪声注入。目标是教模型不变性 — 猫就是猫,无论图像被翻转、变暗还是裁剪。

为什么重要

数据增强是在数据有限时提升模型性能最便宜的方法。它通过给模型看每个样本的多个变体减少过拟合,教它关注本质特征而不是表面细节。在计算机视觉里,增强常规地免费提供 2–5% 精度提升。

Deep Dive

The key principle: augmentations must preserve the label. Flipping a cat image horizontally still shows a cat (valid augmentation). Flipping a "turn left" sign makes it a "turn right" sign (invalid augmentation). Choosing appropriate augmentations requires understanding what invariances matter for your task.

Modern Augmentation

AutoAugment and its successors (RandAugment, TrivialAugment) learn or randomize augmentation policies instead of hand-designing them. Cutout/CutMix randomly masks or mixes patches from different images. MixUp interpolates between pairs of examples, creating synthetic training points that smooth decision boundaries. These techniques are now standard in vision training pipelines.

AI-Powered Augmentation

With generative models, augmentation goes beyond geometric transforms. You can use LLMs to paraphrase text training data, use diffusion models to generate variant images, or use models to create entirely new training examples (synthetic data). The line between "augmentation" (modifying existing examples) and "synthetic data" (generating new examples) is blurring, and both are becoming essential parts of modern training pipelines.

相关概念

← 所有术语
← Cursor Data Centers →