Zubnet AI學習Wiki › Super Resolution
Using AI

Super Resolution

Upscaling, Image Enhancement, SR
透過生成原圖中沒有的合理細節來提高影像的解析度。一張 256×256 的照片變成清晰的 1024×1024 影像。AI 超解析度不只是內插像素(那會產生模糊) — 它基於從高解析度訓練影像中學到的東西,「幻覺」出逼真的紋理、邊緣和細節。

為什麼重要

超解析度有立即的實用應用:增強老照片、升級電玩遊戲紋理、改善安防攝影機畫面、為印刷準備低解析度影像,以及作為 AI 影像生成 pipeline 的後處理步驟。Real-ESRGAN 和類似模型能用單次推理大幅改善影像品質。

Deep Dive

Classical upscaling (bilinear, bicubic interpolation) produces smooth, blurry results because it averages neighboring pixels. AI super resolution models (ESRGAN, Real-ESRGAN, SwinIR) learn to predict what high-frequency detail (sharp edges, textures, fine patterns) should look like given the low-resolution input. They're trained on pairs of high-res images and their downscaled versions, learning the mapping from low to high resolution.

The Hallucination Trade-off

AI upscaling necessarily invents detail that isn't in the original image. A blurry face gets plausible-looking features that may not match the actual person. Text becomes readable but may contain wrong letters. This is fine for artistic enhancement but problematic for forensic applications (security footage, medical imaging) where invented detail could be mistaken for real evidence. The output looks convincing but isn't faithful.

In Image Generation Pipelines

Many image generation workflows use a two-stage approach: generate at a lower resolution (faster, cheaper) then upscale with a super resolution model. Stable Diffusion's "hires fix" does exactly this. The base generation handles composition and content; the upscaler adds fine detail and sharpness. This is more efficient than generating at high resolution directly, especially for models that are compute-intensive per pixel.

相關概念

← 所有術語
ESC