Prompting

How to Write Prompts for Gemini (Google Style)

Gemini's edge is a very large context window and native multimodality. Feed it whole documents and images, then point it precisely at what you need.

Setuproll Team8 min read2026-06-20

Gemini's strengths shape how you should prompt it. It accepts very long inputs and reads images, audio and video alongside text. So instead of trimming your context down to fit, you can often hand it the whole thing and then ask a precise question against it.

Lead with the long context

When you paste a large document, a transcript or several files, put that material first and your instruction last. The model reads the context, then the ask. Label each chunk so you can refer back to it, the same way you would name sections of a brief.

prompt.txt
[DOCUMENT A: Q3 board deck]
...full text...

[DOCUMENT B: Q3 finance memo]
...full text...

Task: Using only DOCUMENT A and DOCUMENT B, list every claim
about revenue that appears in one document but not the other.
Cite which document each claim came from.
Anchor the answer to the source
With a big context, the risk is the model blending sources or drifting to its own knowledge. Tell it to use only the provided material and to cite which chunk each point came from. That keeps it grounded and makes the answer checkable.

Use multimodal inputs directly

You do not have to describe an image in words first. Attach it and ask about it. The same goes for a chart, a screenshot of an error, or a slide. Phrase the question as if the model can see the thing, because it can.

InputA good multimodal ask
A screenshot of a UIList every accessibility issue you can see in this screen.
A photo of a receiptExtract the vendor, date and total as JSON.
A chart imageDescribe the trend and name the two largest months.
Gemini - grounded extraction
You
[image: receipt.jpg] Extract vendor, date, total. Return JSON only.
Agent
{"vendor":"Blue Bottle","date":"2026-06-14","total":"18.40"}
Point at the image and pin the output shape.

Be explicit about scope

A large window is powerful but it is not a license to be vague. The more material you provide, the more important it is to scope the question tightly. Tell it which sections to consider, what to ignore, and how long the answer should be.

  1. Name the inputs the answer should draw from.
  2. Say what to exclude, such as appendices or boilerplate.
  3. Ask for citations back to the labeled chunks.
  4. Set a length budget so a huge context does not produce a huge answer.
Long context is not free attention
Even with a big window, a question buried in the middle of a giant paste can get diluted. Keep the instruction at the end, keep it short, and refer to your chunk labels so the model knows exactly where to look.

A Gemini prompt pattern

  1. Paste or attach all context first, each piece labeled.
  2. Add any image, audio or video inputs in place.
  3. End with a single, scoped instruction.
  4. Require grounding: use only the provided material, cite the source.
  5. Set the output format and a length limit.
Google prompting guidanceGoogle's strategies for prompting Gemini, including long-context and multimodal tips.ai.google.dev

The mental shift with Gemini is from compressing your input to organizing it. Give it the whole picture, label the parts, then aim a precise question at exactly what you need.

0 Comments

Sign in to post

Loading discussion...