How to use AI for ad creatives: where Gemini, GPT, and Claude each win

Five AI tools that actually ship ad creative. Where each one shines, where each one collapses, and the 4-step workflow that uses three of them without doubling the cost.

By Maddy

Founder, WeActive8Published May 8, 202612 min read

Summarize this article with:

AI Summary

Five AI tools that actually ship ad creative. Where each one shines, where each one collapses, and the 4-step workflow that uses three of them in sequence without doubling the cost.

Claude wins on copy. Long-form brand voice consistency. Ad scripts that stay on-tone across 20 variants. Loses on image generation entirely.
GPT wins on concepting + headline ideation. Brainstorms 50 hooks in 5 minutes. Loses on brand voice (drifts after 3 variants) and long-form copy (gets generic).
Gemini wins on multimodal reference + image-to-image. Best in 2026 at 'do this style but switch the subject.' Loses on copy nuance.
Midjourney wins on hero visuals. Loses everything text-related. Ideogram wins on text-in-image. Loses everything else.
The 4-step workflow: GPT concepting → Claude copywriting → Midjourney visual → Ideogram text overlay. 90 minutes per finished ad. Half the cost of in-house production.

Pick the wrong AI tool for the wrong job and you burn $200 a month and 8 hours a week on a workflow that produces fine-but-not-great ad creative. Pick the right tool for the right job and you ship 30 high-quality variants in a week. This piece is the working playbook from running ad-creative for 12 SMB engagements over the last 14 months.

A SaaS client of mine ran 80% of his Meta ads through ChatGPT alone for 6 months. Cost per result climbed 40%. The ads were not bad. They were sameish. Same hook structure. Same call-to-action shape. Same hero image vibe. The algorithm caught it and the audience caught it too.

He switched to a 3-tool workflow last quarter. Claude for copy. Midjourney for hero visuals. Ideogram for the in-image text. Cost per result dropped 30% in 6 weeks. Same offer. Same audience. Same budget. Different tool stack.

This piece is the working version of that workflow, plus where each of the five major AI tools wins and loses in 2026. Skip to the 4-step workflow at the bottom if you just want the recipe.

Where Claude wins (and where it fails)

Claude is the best LLM in 2026 for long-form copy that has to stay on brand voice across many variants. We have shipped 200+ ad variants through Claude. Voice drift across the full set: minimal. ChatGPT cannot match this. GPT drifts after 3 to 5 variants and starts producing generic 2018-Buzzfeed phrasing.

Where Claude shines:

Long-form ad copy (50+ words). Stays on tone. Sentences vary in length naturally.
Ad scripts for video. Handles the conversational rhythm better than GPT.
Brand voice projection from a 500-word style guide. Read once, apply consistently.
Editing AI-drafted copy from other tools. Best 'de-AI this draft' partner we have used.

Where Claude fails:

Image generation. Cannot.
Short punchy headlines (under 8 words). Tends to over-explain. GPT is better here.
Generating 50 different concepts fast. Slower at divergent thinking than GPT.

Where GPT wins (and where it fails)

GPT (ChatGPT) wins on speed and breadth of ideation. Ask for 50 ad hooks and 50 land in 5 minutes. Some are gold. Most are filler. The filtering is the work.

Where GPT shines:

Concepting at volume. 50 hook ideas in 5 minutes. The top 3 to 5 are usable.
Headline variants under 10 words. Short and punchy is GPT's natural rhythm.
Quick translation of one concept into 10 cultural variants. Decent at localization.
Structured outputs (JSON, tables, scripts) when you want copy back in a format.

Where GPT fails:

Long-form ad copy past 100 words. Voice drifts. Cliches creep in. Em-dashes everywhere.
Brand voice consistency across 20+ variants. Reverts to generic GPT voice by variant 5.
Nuanced critique of existing copy. Tends to over-edit. Says "tighten this" even when the original is good.

Where Gemini wins (and where it fails)

Gemini in 2026 has caught up on most LLM tasks and gone past the others on one thing: multimodal reference work. "Take this image style and apply it to a different subject" is a Gemini task. Neither Claude nor GPT match it.

Where Gemini shines:

Reference-image-driven generation. Upload a brand example, ask for variants in the same style.
Huge-CSV analysis of ad performance data. Handles bigger inputs than the others without choking.
Web-access research for competitor scans. Built-in search is solid.
Image editing prompts that need precise composition adjustments.

Where Gemini fails:

Copy nuance. The output reads slightly stilted compared to Claude.
Brand voice consistency across long sessions. Better than GPT but worse than Claude.
Aesthetic style for hero visuals. Midjourney remains ahead on look quality.

Where Midjourney wins (and where it fails)

Midjourney is still the king of hero visuals in 2026. Subject quality, lighting, atmosphere, none of the LLM-attached image models match it for pure aesthetic. Where it falls apart: anything text-related, anything photo-realistic with hands, anything requiring exact composition.

Where Midjourney shines:

Hero visuals for top-of-funnel ads. Aesthetic over functional.
Mood pieces, lifestyle imagery, abstract concept visuals.
Variations on a strong prompt. The 4-image grid is still the fastest variant generator.

Where Midjourney fails:

Text inside images. Use Ideogram instead.
Realistic product photography. Use a real photographer + retouching.
Exact-composition needs. Use Gemini or Photoshop's generative fill.

Where Ideogram wins

Ideogram is a single-use-case tool. Text inside images. That is what it does. Better than every other AI image generator at this. If your ad needs the words "50% off" inside the hero, Ideogram is the tool.

Beyond text-in-image, Ideogram is fine but not class-leading. Use it for what it wins.

The 4-step workflow

This is the working version we use on every client engagement. 90 minutes per finished ad. Half the cost of in-house production.

Step 1. GPT for concepting (15 minutes)

Open GPT. Paste the offer, the audience, the brand voice in 3 sentences. Ask for 50 ad hook concepts. Get 50 back in 5 minutes. Read all 50. Star the top 5.

Why GPT for this: speed and breadth. The 50-in-5-minutes is the right shape for divergent ideation. Claude can do this but is slower and produces a tighter set that filters less interestingly.

Step 2. Claude for copywriting (30 minutes)

Take the top 5 hooks to Claude. Provide the brand voice doc (or 3 example posts in your voice). Ask Claude to write 3 ad variants per hook. 15 variants total. Read all. Pick the top 5.

Why Claude for this: voice consistency. Across 15 variants Claude will hold the voice tightly. GPT would drift by variant 5.

Step 3. Midjourney for hero visual (30 minutes)

For each of the top 5 ad concepts, write a visual prompt. Run 2 rounds in Midjourney. First round: 4-image grid, pick the best. Second round: variations on the best. Final pick is your hero.

Why Midjourney for this: aesthetic. Other tools can produce technically correct images but the brand polish is harder to get.

Step 4. Ideogram for in-image text (15 minutes)

If the ad concept needs text inside the image (price, offer, urgency), Ideogram. Bring the Midjourney image as reference, prompt for the text overlay you want.

Why Ideogram for this: it is the only AI image tool that gets text inside images right at production quality.

What this workflow costs

Tool subscriptions (2026 pricing):

ChatGPT Plus: $20/month
Claude Pro: $20/month
Midjourney Standard: $30/month
Ideogram Plus: $20/month
Gemini Advanced: $20/month (if you add it)

Total: $90 to $110 a month for the full stack. For an SMB shipping 8 to 20 ads a month, that is $5 to $14 per finished ad in tool cost. Compare to in-house production at $100 to $400 per finished ad. The economics are obvious.

The non-obvious cost: 90 minutes per ad in human time to operate the workflow. If your time is worth $100 an hour, the human cost is $150 per ad. Still cheaper than in-house. Still more expensive than the "ChatGPT alone" workflow that produces sameish creative.

Frequently asked questions

Why not just use ChatGPT for everything?

We tried. So did our clients. Output quality is fine for the first 5 variants and degrades after. Same hook structure, same closing sentence pattern, same metaphor space. The algorithm catches it. The audience catches it. Mixing 3 tools (Claude + GPT + Midjourney) keeps the creative fresh because the tools fail in different directions.

Which subscription is most worth it if I can only afford one?

Claude Pro. It is the best at the highest-leverage task (long-form copy with brand voice consistency). You can fake your way through hero visuals with stock photography + Canva edits. You cannot fake your way through 30 ad variants that all need to hold the same voice.

What about DALL-E 3 (inside ChatGPT) instead of Midjourney?

DALL-E 3 is acceptable for quick concepting. Midjourney remains ahead on the aesthetic polish that makes top-of-funnel ads stop the scroll. The gap closes every quarter but in 2026-Q2 Midjourney is still the right choice for finished hero visuals. Use DALL-E for sketching the concept; finalize in Midjourney.

What about Adobe Firefly?

Useful when you need integrated Photoshop workflow (drop into a comp, retouch). The pure generation quality lags Midjourney for hero visuals. If you already pay for Creative Cloud, Firefly is included and worth using for the integration. If you don't, Midjourney is the better standalone pick.

Can I automate the 4-step workflow with an agent?

Partially. Concepting (GPT step) automates well. Copy generation (Claude step) automates well but the variant selection still needs human judgment. Visual generation (Midjourney) does not automate cleanly yet because the prompt engineering still requires iteration. Realistic state of automation in 2026: 30 to 40% of the workflow, with humans on the judgment calls.

How does this workflow handle video ads?

Different tool stack. ChatGPT or Claude for the script. Runway or Pika for short-form video generation. Adobe Premiere or CapCut for editing. The 4-step workflow above is for static ads. Video gets its own playbook.

How to use AI for ad creatives: where Gemini, GPT, and Claude each win

Five AI tools that actually ship ad creative. Where each one shines, where each one collapses, and the 4-step workflow that uses three of them without doubling the cost.

By Maddy

Founder, WeActive8Published May 8, 202612 min read

Summarize this article with:

AI Summary

Five AI tools that actually ship ad creative. Where each one shines, where each one collapses, and the 4-step workflow that uses three of them in sequence without doubling the cost.

Claude wins on copy. Long-form brand voice consistency. Ad scripts that stay on-tone across 20 variants. Loses on image generation entirely.
GPT wins on concepting + headline ideation. Brainstorms 50 hooks in 5 minutes. Loses on brand voice (drifts after 3 variants) and long-form copy (gets generic).
Gemini wins on multimodal reference + image-to-image. Best in 2026 at 'do this style but switch the subject.' Loses on copy nuance.
Midjourney wins on hero visuals. Loses everything text-related. Ideogram wins on text-in-image. Loses everything else.
The 4-step workflow: GPT concepting → Claude copywriting → Midjourney visual → Ideogram text overlay. 90 minutes per finished ad. Half the cost of in-house production.

This piece is the working version of that workflow, plus where each of the five major AI tools wins and loses in 2026. Skip to the 4-step workflow at the bottom if you just want the recipe.

Where Claude wins (and where it fails)

Where Claude shines:

Long-form ad copy (50+ words). Stays on tone. Sentences vary in length naturally.
Ad scripts for video. Handles the conversational rhythm better than GPT.
Brand voice projection from a 500-word style guide. Read once, apply consistently.
Editing AI-drafted copy from other tools. Best 'de-AI this draft' partner we have used.

Where Claude fails:

Image generation. Cannot.
Short punchy headlines (under 8 words). Tends to over-explain. GPT is better here.
Generating 50 different concepts fast. Slower at divergent thinking than GPT.

Where GPT wins (and where it fails)

GPT (ChatGPT) wins on speed and breadth of ideation. Ask for 50 ad hooks and 50 land in 5 minutes. Some are gold. Most are filler. The filtering is the work.

Where GPT shines:

Concepting at volume. 50 hook ideas in 5 minutes. The top 3 to 5 are usable.
Headline variants under 10 words. Short and punchy is GPT's natural rhythm.
Quick translation of one concept into 10 cultural variants. Decent at localization.
Structured outputs (JSON, tables, scripts) when you want copy back in a format.

Where GPT fails:

Long-form ad copy past 100 words. Voice drifts. Cliches creep in. Em-dashes everywhere.
Brand voice consistency across 20+ variants. Reverts to generic GPT voice by variant 5.
Nuanced critique of existing copy. Tends to over-edit. Says "tighten this" even when the original is good.

Where Gemini wins (and where it fails)

Where Gemini shines:

Reference-image-driven generation. Upload a brand example, ask for variants in the same style.
Huge-CSV analysis of ad performance data. Handles bigger inputs than the others without choking.
Web-access research for competitor scans. Built-in search is solid.
Image editing prompts that need precise composition adjustments.

Where Gemini fails:

Copy nuance. The output reads slightly stilted compared to Claude.
Brand voice consistency across long sessions. Better than GPT but worse than Claude.
Aesthetic style for hero visuals. Midjourney remains ahead on look quality.

Where Midjourney wins (and where it fails)

Where Midjourney shines:

Hero visuals for top-of-funnel ads. Aesthetic over functional.
Mood pieces, lifestyle imagery, abstract concept visuals.
Variations on a strong prompt. The 4-image grid is still the fastest variant generator.

Where Midjourney fails:

Text inside images. Use Ideogram instead.
Realistic product photography. Use a real photographer + retouching.
Exact-composition needs. Use Gemini or Photoshop's generative fill.

Where Ideogram wins

Beyond text-in-image, Ideogram is fine but not class-leading. Use it for what it wins.

The 4-step workflow

This is the working version we use on every client engagement. 90 minutes per finished ad. Half the cost of in-house production.

Step 1. GPT for concepting (15 minutes)

Open GPT. Paste the offer, the audience, the brand voice in 3 sentences. Ask for 50 ad hook concepts. Get 50 back in 5 minutes. Read all 50. Star the top 5.

Why GPT for this: speed and breadth. The 50-in-5-minutes is the right shape for divergent ideation. Claude can do this but is slower and produces a tighter set that filters less interestingly.

Step 2. Claude for copywriting (30 minutes)

Take the top 5 hooks to Claude. Provide the brand voice doc (or 3 example posts in your voice). Ask Claude to write 3 ad variants per hook. 15 variants total. Read all. Pick the top 5.

Why Claude for this: voice consistency. Across 15 variants Claude will hold the voice tightly. GPT would drift by variant 5.

Step 3. Midjourney for hero visual (30 minutes)

For each of the top 5 ad concepts, write a visual prompt. Run 2 rounds in Midjourney. First round: 4-image grid, pick the best. Second round: variations on the best. Final pick is your hero.

Why Midjourney for this: aesthetic. Other tools can produce technically correct images but the brand polish is harder to get.

Step 4. Ideogram for in-image text (15 minutes)

If the ad concept needs text inside the image (price, offer, urgency), Ideogram. Bring the Midjourney image as reference, prompt for the text overlay you want.

Why Ideogram for this: it is the only AI image tool that gets text inside images right at production quality.

What this workflow costs

Tool subscriptions (2026 pricing):

ChatGPT Plus: $20/month
Claude Pro: $20/month
Midjourney Standard: $30/month
Ideogram Plus: $20/month
Gemini Advanced: $20/month (if you add it)

Frequently asked questions

Why not just use ChatGPT for everything?

Which subscription is most worth it if I can only afford one?

What about DALL-E 3 (inside ChatGPT) instead of Midjourney?

What about Adobe Firefly?

Can I automate the 4-step workflow with an agent?

How does this workflow handle video ads?

How to use AI for ad creatives: where Gemini, GPT, and Claude each win

Summarize this article with:

Where Claude wins (and where it fails)

Where GPT wins (and where it fails)

Where Gemini wins (and where it fails)

Where Midjourney wins (and where it fails)

Where Ideogram wins

The 4-step workflow

Step 1. GPT for concepting (15 minutes)

Step 2. Claude for copywriting (30 minutes)

Step 3. Midjourney for hero visual (30 minutes)

Step 4. Ideogram for in-image text (15 minutes)

What this workflow costs

Frequently asked questions

Related reads

Maddy

How to use AI for ad creatives: where Gemini, GPT, and Claude each win

Summarize this article with:

Where Claude wins (and where it fails)

Where GPT wins (and where it fails)

Where Gemini wins (and where it fails)

Where Midjourney wins (and where it fails)

Where Ideogram wins

The 4-step workflow

Step 1. GPT for concepting (15 minutes)

Step 2. Claude for copywriting (30 minutes)

Step 3. Midjourney for hero visual (30 minutes)

Step 4. Ideogram for in-image text (15 minutes)

What this workflow costs

Frequently asked questions

Related reads

Maddy