Nano Banana Tutorial: Editing Complex Scenes in Natural Language
Brief Description: This tutorial focuses on AI and Nano Banana hotspots, teaching you to use natural language to complete complex scene editing in Gemini: replacing backgrounds, blending multiple subjects, matching light, shadow, and perspective. Generate high-quality prompts with ChatGPT and Claude, making AI workflows more intelligent, automated, and replicable.
1. Workflow Overview
1. Entrance and Material Preparation
AI editing starts with selecting Nano Banana, entering the image editing mode, and importing a clear body image and reference background. Use ChatGPT or Claude to generate scenario keywords and style words to ensure that the large model understands the goals and limitations, and create an intelligent and automated AI tool chain.
2. Natural language editing principles
Artificial intelligence instructions should be specific and verifiable: explain that the subject remains unchanged, only replace the background, maintain skin tone, and the clothing is consistent with the size. With the help of ChatGPT and Claude output three to five versions, Nano Banana has been tested multiple times to converge to the most stable machine learning results.
(1) Prompt Structure Template
Subject Description + Action Requirements + Background Style + Light and Shadow Direction + Restrictions. Example statement: Keep the details of the characters and clothing, change the background to a cloudy city street scene, the light is from the left rear, the overall color temperature is cold, and it is forbidden to change facial features.
(2) Consistency and safety
Complex scenes need to emphasize character consistency, lens focal length and depth of field. AI will add a watermark to the generated content, which is suitable for use in multiple scenarios such as e-commerce, short videos, and social media.
2. Four-step method for complex scenes
1. Separate the subject from the "green screen transfer method"
Before replacing the background, the AI tool first changes the background to pure green or pure gray, and then changes to the target background in the second step, which can reduce color spillage and mold penetration. In this step, ChatGPT or Claude gives standardized prompt templates, and Nano Banana performs automated processing.
2. Light and shadow and color matching
Write the direction, intensity and white balance of the light source in the prompt, such as cold light in the right rear, make up a little ambient reflection, and require shadow softening and penumbra transitions to ensure that artificial intelligence synthesis does not violate.
3. Unified perspective and depth of field
Describe the camera feeling: wide angle or medium focus, clear foreground and background blur, background bokeh radius. Let the AI keep the horizon height consistent and avoid the soles of the characters from "floating".
4. Style and batch
Make alist of style words: movie gray blue, film grain, commercial minimalism, Nordic home. ChatGPT and Claude generate ten to twenty scene combinations in batches, and Nano Banana cycles with one click to form a stable automated production line.
3. Practical examples: three types of typical scenarios
1. E-commerce product map
AI requirements: only replace the background with soft light solid colors, add slight countertop reflections and soft projections, and lock product colors and metal highlights. ChatGPT outputs five brand color backgrounds, Claude provides lighting solutions, and Nano Banana is a film.
2. Multi-character group photo
AI requirements: retain the position and height ratio of the three people in the left, middle and right, and replace it with the beach at dusk, the light is from the right, the color of the clothes remains unchanged, and the skin texture is retained. If necessary, it is divided into two rounds: first the background is pure green, and then it is introduced to the seaside.
3. Portrait to city night scene
AI requirements: add neon reflection, warm color temperature of street lights, blue and purple fill light on the edge of the subject, and depth of field f/2.0 style. It is emphasized that it is forbidden to change the structure of facial features and hair density, and avoid "redrawing the face".
4. Quality control and pit avoidance list
1. Edge and hair thread
instructions add edge finishing, color removal, and hair reconstruction. When burrs appear, iterate in small steps to maintain the same cue skeleton.
2. Color and noise
Write a uniform white balance, slightly visible grain, noise reduction to ensure details. Compare the skin tone of the exported image with the original image to ensure AI consistency.
(1) Correction of failure
exampleChanged "Replace the background to the city night scene" to "Only replace the background to the city night scene, the subject is locked, the rear right is warm, and the color and skin color of the clothes are maintained" to reduce the mistakes.
(2) Batch consistency
Use fixed templates + variable bits: location, time, weather. Variable tables are generated by ChatGPT and Claude, and Nano Banana runs in batches to output consistent styles.
(3) Export specifications
indicate the resolution, long-edge pixels and compression ratio, and keep the source file and prompt log to facilitate backtracking and reproduction.
Frequently Asked Questions (Q&A)
Q: How does Nano Banana and ChatGPT and Claude divide the most efficient labor when editing complex AI scenes?
A: Nano Banana is responsible for image generation and editing, while ChatGPT and Claude are responsible for prompt design, style thesaurus, and variable tables, all of which are connected into an intelligent and automated workflow with higher stability and consistency.
Q: How should AI fix the background with natural language to replace the background?
A: Use the green screen to transfer first, and then import the target background; Add edge refinement, color removal, and keep the proportions consistent with perspective in the prompt, so that the Nano Banana gradually converges.
Q: How to use ChatGPT and Claude to work with Nano Banana for e-commerce batch charts?
A: ChatGPT generates product selling points and brand color lists, Claude generates lighting and scene parameters, and Nano Banana reads templates to replace backgrounds and light and shadows in batches, exporting unified style AI atlases.
Q: What are the advantages and limitations of AI tools compared to traditional PS cutouts?
A: Artificial intelligence is faster in multi-subject semantic understanding, light and shadow consistency, and style unity, but it still needs to be clearly constrained and inspected. Nano Banana combines ChatGPT and Claude to significantly improve efficiency in mass production lines.