Your First AI Image.
Step by step, from blank prompt to finished visual. No experience needed.
After this lesson you'll know
- How to write your first image prompt from scratch
- The basic anatomy of a good prompt: subject, style, details, mood
- How to iterate quickly when the first result is not quite right
- How to save, download, and use your generated images
Let's use ChatGPT for this walkthrough. It is the simplest starting point.
Open chatgpt.com and sign in (a free account works for a few image generations). If you prefer another tool from Lesson 2, the principles are identical — only the interface changes. We are using ChatGPT here because you can type your prompt in natural language, just like having a conversation.
Your first prompt should be one clear sentence.
Resist the urge to write a novel. Start with something direct and visual. Here is your first prompt:
Look at what you got. Then ask: what would make it better?
Your first image will appear in seconds. It will probably look good — maybe even great. But now is when the real skill begins. Look at the image carefully. Is the dog the right breed? Is the sunset the color you imagined? Are there enough wildflowers? Is the mood right?
This is the core loop of AI image creation: generate, evaluate, refine. Every professional who uses these tools does exactly this. The magic is not in the first result. It is in knowing how to guide the next one.
Now layer in specifics to steer the image closer to your vision.
Take your original prompt and add details. Each detail gives the AI more information to work with:
Every great prompt has four parts.
Subject: What is in the image? (A golden retriever puppy)
Setting: Where is it? (In a field of purple and yellow wildflowers)
Mood/Lighting: How does it feel? (Golden hour, soft warm lighting)
Style: What does it look like? (Photograph style, shallow depth of field)
A Japanese maple bonsai tree in full autumn color,
sitting on a weathered stone pedestal in a misty zen garden,
soft diffused morning light with gentle fog,
fine art photography, shallow depth of field, 8K detail
You do not need all four every time, but including more of them gives the AI a clearer picture of what you want. Think of it as painting with words — the more specific your words, the more specific the result.
The third or fourth version is usually the one you love.
Do not expect perfection on attempt one. Professional designers using these tools typically generate 3-10 variations before they find the one that works. Each iteration teaches you something about how the AI interprets language. Maybe "cinematic" gives you a different mood than "dramatic." Maybe "watercolor" gives you something softer than "painted." This is how you build your prompt vocabulary.
Try it now
Create three images using the same subject but different styles. Start with:
- "A cat sitting on a windowsill, photograph"
- "A cat sitting on a windowsill, oil painting style"
- "A cat sitting on a windowsill, Studio Ghibli anime style"
Notice how the same subject transforms completely based on the style keyword. Save all three — you just created your first intentional variation set. That is a real skill.
Download everything you like. Build a personal library from day one.
In ChatGPT, click the image to expand it, then use the download button. In Midjourney, upscale the image first, then save it. Create a folder on your computer called "AI Visuals" and start organizing by project or style. This library will become incredibly valuable as you develop your eye for what works.
Small prompt changes create big visual differences.
As you practice, you will notice that certain words have outsized impact on results. Here are practical tips that immediately improve your images:
Front-load the important stuff: AI tools pay more attention to words at the beginning of your prompt. Put your most important element first. "A golden retriever in a field" prioritizes the dog. "A vast wildflower field with a small golden retriever" prioritizes the landscape.
Use quality modifiers: Adding "high resolution," "detailed," "8K," or "professional photography" nudges the AI toward higher quality output. These modifiers act as quality floor-raisers — they do not guarantee perfection, but they reduce the chance of getting something blurry or low-effort.
Specify what you do not want: If you keep getting something in your images that you do not want — text overlays, borders, watermarks — add "no text, no watermark, no border" to your prompt. In ChatGPT, you can also just say "please do not include any text in the image."
Be specific about quantity: "Some flowers" is vague. "A field of hundreds of purple wildflowers" is specific. "Three birds in the sky" is specific. The more precise your numbers and quantities, the more control you have.
Describe relationships: "A cat and a ball" might put them anywhere. "A cat batting a red ball with its paw on a hardwood floor" tells the AI exactly how the elements relate to each other spatially.
Avoid these and you will be ahead of 90% of beginners.
Mistake 1 — Being too vague: "A nice picture" gives the AI almost nothing to work with. "A cozy reading nook with warm lamp light, a stack of books, and rain on the window" gives it a clear vision. The more specific you are, the better your results.
Mistake 2 — Cramming too many subjects: "A dog, a cat, three birds, a mountain, a river, a castle, and a dragon" overwhelms the AI. Start with one main subject and build outward. Complex scenes work better when you add detail gradually through iteration rather than trying to get everything in one prompt.
Mistake 3 — Giving up after one try: Your first generation is a starting point, not the final result. Professional AI artists rarely use their first attempt. The iteration process — generate, evaluate, refine — is where the real magic happens. Give yourself permission to experiment.
Mistake 4 — Ignoring aspect ratio: If you need a vertical image for Instagram, say so in the prompt. If you need a wide landscape for a presentation, specify 16:9. Getting the right dimensions from the start saves you from awkward cropping later.
Mistake 5 — Forgetting about lighting: Lighting is the single most impactful thing you can add to a prompt. "A portrait" and "a portrait with soft golden hour lighting" produce dramatically different results. Always think about light.
Let's build a complete image together, step by step.
Here is a real walkthrough showing how iteration works in practice. Follow along with any AI image tool:
Attempt 1: Start minimal. Type "A coffee shop." Look at what you get. The AI will give you something — probably a generic coffee shop interior. Notice the style, the colors, the perspective.
Attempt 2: Add setting and mood. "A cozy independent coffee shop on a rainy afternoon, warm interior lighting." Now the AI has more to work with. Compare this to attempt 1 — see how the added context changed the mood and atmosphere.
Attempt 3: Add style and composition. "A cozy independent coffee shop on a rainy afternoon, warm interior lighting glowing through foggy windows, watercolor painting style, soft muted palette." Now you are directing both what it looks like and how it feels.
Attempt 4: Fine-tune with specific details. "A cozy independent coffee shop on a rainy afternoon, warm interior lighting glowing through foggy windows, a single empty chair by the window with a steaming cup, watercolor painting style, soft muted palette of amber, cream, and dusty blue." Each iteration gets you closer to the image in your mind.
This is the process. Four attempts, each building on the last. By attempt 4, you have something you could frame on your wall — and you built the skill of intentional prompt construction along the way.
The Four Prompt Elements
The same prompt behaves differently in each tool. Here is what to expect.
If you try the same prompt in different tools, the results will vary. Understanding these differences helps you choose the right tool for each project:
ChatGPT (DALL-E): Tends to be literal. "A golden retriever in a field" gives you exactly that — a golden retriever in a field. It follows instructions precisely. Conversational iteration is its superpower — say "make the sky more dramatic" and it adjusts just that element.
Midjourney: Tends to be artistic. The same prompt might give you a more painterly, cinematic, or stylized result. Midjourney adds its own aesthetic interpretation on top of your description. This is beautiful when you want art, but less ideal when you need exact precision.
Stable Diffusion: Depends heavily on which model and settings you use. The base model is more neutral, but community fine-tuned models can be specialized for anything — anime, photorealism, landscapes, portraits. The learning curve is steeper but the control is unmatched.
The best approach: pick one tool to learn first, get comfortable with it, and then explore others. You will carry your prompt skills across all tools — the fundamentals of clear description and intentional detail are universal.
Start organized from day one and thank yourself later.
Even on your very first day of creating AI images, start building good habits:
Save your prompts: Every time you write a prompt that produces a result you like, copy it into a document. This becomes your personal prompt library. In a month, you will have a collection of tested, proven prompts you can remix and reuse.
Name your files meaningfully: Instead of "image_001.png," save files with descriptive names: "golden-retriever-sunset-watercolor.png." When you have hundreds of images, these names become searchable.
Create project folders: Even if you only have one project now, create a folder structure: AI Visuals > [Project Name] > Drafts / Finals. This 30-second setup prevents the chaos of a single folder with thousands of unnamed images.
Note what works and what does not: Keep a running list of discoveries. "Adding 'shallow depth of field' makes portraits look professional." "The word 'cinematic' adds dramatic contrast." These notes compound over time into genuine expertise.
Your image is only valuable when you actually use it.
Generating a beautiful image is satisfying. Using it in your actual life is where the real value lives. Here are immediate uses for the images you create today:
Social media: Post your best generation on your social platform of choice. Share the prompt in the caption. The AI art community loves seeing both the image and the prompt that created it — it is a form of creative sharing that is unique to this medium.
Phone and desktop wallpaper: Your own custom wallpaper, created from your own imagination. This is one of the simplest and most satisfying uses of AI images. Change it whenever the mood strikes.
Messages and communication: Send a generated image to a friend that captures an inside joke, a shared memory, or just a mood. AI images make casual communication more expressive and fun.
Work presentations: Even a single well-chosen AI-generated slide background elevates an otherwise plain presentation. Use one of your new images in your next meeting deck.
The habit of using your creations, not just creating them, is what transforms AI image generation from a novelty into a genuine life skill.
You just created something from nothing. That is worth recognizing.
If you have followed along with the exercises in this lesson, you have already generated your first AI images. You have written prompts, evaluated results, iterated, and started building your visual vocabulary. That is real creative skill, developed in a single lesson.
In the next lesson, we go deeper into prompt craft — the art and science of writing prompts that produce exactly what you see in your mind. The basic four-part anatomy you learned here (subject, setting, mood, style) expands into a full framework with composition, lighting, color, and quality controls. Your images are about to get dramatically better.
Take your best image from today's exercises and save it somewhere you will see it. In a month, compare it to what you are creating then. The growth will surprise you.
Remember: you are not learning a tool. You are learning a new form of creative expression. The tool is just the instrument. The creative vision — what you see in your mind and how you describe it — that is the real skill. And you just took the first real step in developing it.
Before moving on, make sure you have at least three images saved in a folder on your computer. These are the foundation of your personal AI visual library — a collection that will grow with every lesson and every experiment. Label the folder "AI Visuals" and you are already building the organizational habit that Lesson 10 will formalize into a complete workflow system.
Also save the prompts that created your favorite images. Copy them into a note or document labeled "My Prompts." This tiny habit — saving what works — is the seed of the prompt library that will become your most valuable creative asset. Future you will be grateful you started on day one.
One more thing: show someone what you made. Share an image with a friend, post it on social media, or just show it to someone nearby.
The reaction you get — the surprise that you created something this beautiful from a text description — is one of the best parts of this whole journey. It never gets old. And sharing your work is not just about getting validation. It is about joining a community of creators who are all exploring this new medium together. Your first AI image is your entry ticket.