Zubnet AIApprendreWiki › DALL-E
Models

DALL-E

DALL-E 2, DALL-E 3
La famille de modèles de génération d'images d'OpenAI. DALL-E 1 (2021) utilisait une approche discrete VAE + Transformer. DALL-E 2 (2022) utilisait CLIP + diffusion. DALL-E 3 (2023) est intégré à ChatGPT et met l'emphase sur le suivi de prompt — il utilise un LLM pour réécrire les prompts utilisateurs en descriptions d'images détaillées avant génération, améliorant significativement le match entre ce que tu demandes et ce que tu obtiens.

Pourquoi c'est important

DALL-E a été le modèle qui a fait connaître la génération d'images IA au grand public. Le lancement de DALL-E 2 en 2022 est devenu viral et a déclenché à la fois l'excitation et les inquiétudes sur l'imagerie générée par IA. L'intégration de DALL-E 3 avec ChatGPT a rendu la génération d'images accessible à des centaines de millions d'utilisateurs. Son innovation de réécriture de prompts a influencé comment d'autres modèles gèrent la conversion text-to-image.

Deep Dive

DALL-E 3's key innovation: instead of feeding user prompts directly to the image model, it uses GPT-4 to expand vague prompts into detailed image descriptions. "A cat" becomes "A fluffy orange tabby cat sitting on a windowsill, afternoon sunlight streaming in, photorealistic style, warm tones." This prompt rewriting dramatically improves output quality because diffusion models respond better to detailed descriptions than to short prompts.

Safety Measures

DALL-E has the most aggressive safety filters in the industry: it refuses to generate images of real public figures, violent content, and sexual content. It also uses C2PA metadata (Content Credentials) to mark images as AI-generated. These safety choices limit DALL-E's flexibility compared to open alternatives (Stable Diffusion, Flux) but reflect OpenAI's approach to responsible deployment. The trade-off between safety and creative freedom is a defining tension in image generation.

API and Integration

DALL-E 3 is available through OpenAI's API and through ChatGPT. The API provides more control (image size, quality settings, style parameter) but the ChatGPT integration is more popular because it handles prompt engineering automatically. The integration model — LLM + image generator as a unified experience rather than separate tools — influenced competitors and is becoming the standard for consumer image generation.

Concepts liés

In The News

Google's LangExtract Turns Document Processing Into Assembly Line Code
Apr 10, 2026
Florida AG targets OpenAI over FSU shooting, escalating AI accountability wars
Apr 10, 2026
Anthropic Grabs 73% of New Enterprise AI Spend as OpenAI Scrambles
Apr 10, 2026
Musk Wants Altman Fired as OpenAI's Legal War Escalates
Apr 09, 2026
OpenAI's child safety blueprint: real protection or performative policy?
Apr 08, 2026
See all 23 articles about DALL-E →
← Tous les termes
ESC