Claude + Nano Banana 2: How I Changed My Creative Workflow Overnight
Three use cases for automating the image generation process end-to-end.
Every newsletter I publish needs a thumbnail. And for months, my process for creating one looked like this:
Open my newsletter draft inside Claude Projects.
Ask Claude to analyze the post — the hook, the key themes, the emotional angle.
Tell Claude to generate a detailed text-to-image prompt based on what it understood.
Copy that prompt.
Open ChatGPT in a new tab.
Paste the prompt into ChatGPT’s image generator.
Wait for the result.
Hate it.
Tweak the prompt. Regenerate. Still not right.
Adjust again. Maybe three, four, five rounds until I get something that actually captures what the post is about.
Download the image. Drag it into my project folder. Move on.
That was the process for one thumbnail. Multiply that by weekly posts, and you start to feel the weight of it.
Then Google released Nano Banana 2 last month.
If you haven’t seen what this model can do — it’s built on Gemini 3.1 Flash and it’s a genuine leap. 4K resolution. Text rendering that actually works across multiple languages. Subject consistency that can maintain up to 5 characters and 14 objects across images. And it’s fast — Pro-level quality at Flash speed.
You might want to check out some use cases along with prompts created by Justine Moore 👇
The image quality was so good that I started doing something new: creating infographics for specific sections within my newsletters. If you’ve read my recent posts (including this one), you’ve probably noticed those visual breakdowns that explain key concepts, frameworks, or workflows. They help readers understand complex ideas faster than text alone.
But even with better image quality, the workflow was still the same.
I was still analyzing my newsletter in Claude, generating prompts in Claude, then copying those prompts into Nano Banana 2 to actually create the images. For thumbnails. For every infographic. The same copy-paste loop, just with a different tool on the receiving end.
The thinking happened in one place. The creation happened in another. And the gap between them kept killing my momentum.
Then I connected Nano Banana directly to Claude using MCP:
Now when I finish writing a newsletter post, I share the draft with Claude and say: “Generate a thumbnail for this post.” Claude already understands the content — the themes, the hook, the emotional tone — because it just helped me write it. It crafts the perfect prompt and generates the image right there. No tab-switching. No copy-pasting. No five rounds of “not quite right.”
Same thing for section infographics. I highlight a paragraph and tell Claude: “Turn this into a visual breakdown.” It reads the context, understands what needs to be communicated visually, and generates the infographic using Nano Banana Pro — all without me leaving the conversation.
What used to take 20-30 minutes of back-and-forth across multiple tools now happens in the same space where I’m already writing.
This works in Claude Desktop, Claude Code, and Claude Cowork. Anywhere you can connect an MCP server, you can give Claude the ability to generate and edit images on command.
If you followed my NotebookLM + Claude MCP guide — where we connected Claude to Google’s research tool and turned it into a research partner — this is the same idea, different superpower. That guide gave Claude the ability to think with your sources. This one gives Claude the ability to see what you’re writing and create visuals that match.
In this guide, I’ll show you exactly how to set this up, the custom instructions that make Claude generate better images, and three use cases that changed how I create visual content for my newsletter.
Let’s build it.
What the Nano Banana + Claude Integration Unlocks
Before we get into the setup, let me show you what’s actually possible once these two are connected.
Everything you can do with Nano Banana’s image generation — you can now trigger directly from Claude.
Inside a single Claude conversation, you can tell it to:
Generate images from scratch — thumbnails, social headers, blog visuals, concept art. Describe what you want in plain language and Claude handles the prompt engineering for you.
Edit existing images — hand Claude a photo and say “change the background to a coffee shop” or “make this darker and more moody.” It understands context and makes targeted edits.
Add text overlays that actually work — this is where most image models fall apart. Nano Banana handles text rendering cleanly, including multiple languages and proper positioning.
Generate at any size and aspect ratio — 1:1 for social posts, 16:9 for newsletter headers, 9:16 for stories, 4K resolution when you need production-ready output.
Batch-generate multiple images — up to 4 images per request, with consistent style across the set.
Maintain subject consistency — Nano Banana 2 can keep up to 5 characters and 14 objects consistent across multiple generations. Huge for branded content series.
But here’s what makes this actually powerful:
You’re combining Nano Banana’s image capabilities with Claude’s understanding of your content.
Claude doesn’t just generate a generic image from a prompt. It reads your newsletter draft, understands the themes, and creates visuals that match what you’re actually trying to communicate. It can look at a framework you’ve written and turn it into an infographic. It can read your carousel outline and generate slide visuals that follow the narrative.
The image intelligence of Nano Banana meets the content intelligence of Claude.
That’s the unlock.
What It Costs
I want to be upfront: image generation through the Gemini API requires a paid tier. The free Gemini API key gives you access to text models, but image generation is pay-as-you-go only.
The good news — it’s cheap. NB2 (the default) runs about $0.04–$0.05 per image at standard resolution, scaling up to ~$0.15 per image at 4K. Pro costs roughly $0.13 per image at 2K and $0.24 at 4K.
In practice, generating all the visuals for one newsletter — a thumbnail plus 2-3 infographics — costs me less than a dollar. You’ll need to add billing to your Google AI Studio account, but there’s no monthly minimum or subscription required. You only pay for what you generate. We’ll get to this setup process later.
What You’ll Learn in This Guide
By the end of this post, you’ll have:
Complete technical setup (10 minutes): Nano Banana MCP server installed and connected to Claude Code, Claude Desktop, or Claude Cowork.
Use Case #1 — Newsletter Thumbnail Generation: How I use my existing Claude project knowledge to craft the perfect text-to-image prompt, then generate the thumbnail without ever leaving the conversation.
Use Case #2 — Spreadsheet-Driven Bulk Image Generation: How to use a simple CSV file inside Claude Cowork to batch-generate 10+ images at once — each with its own platform, aspect ratio, style, and color palette. Includes a downloadable template you can customize.
Use Case #3 — A Visual Intelligence Skill: A downloadable Claude Skill that analyzes your content, selects the right visual archetype (timeline, comparison, pyramid, iceberg, and 8 more), maps visual metaphors, and engineers the prompt — all before generating. Includes 20+ styles, structured prompt templates, and iterative refinement logic.
These cases will help you master the end-to-end image generation workflow using Claude, so you can create any images you want.
Now, it’s time to dive in.
How to Set Up Nano Banana MCP with Claude
Same as my previous NotebookLM + Claude integration guide, this requires using the terminal. And if this is your first time, the whole thing takes about 10 minutes.







