"Advanced Prompting (2026) — Roles, Examples, and Chaining in Practice"
The 4-block role formula, four chaining patterns, and a working self-critique loop
ํต์ฌ ์์ฝ
- Audience: Anyone who's already comfortable with the five-pattern basics and wants to push output quality with multi-step prompting.
- What you'll get: 1) The 4-block role formula, 2) what makes few-shot examples actually work (and fail), 3) the four standard chaining patterns, 4) how to wire up a self-critique loop, 5) meta-prompting — letting the AI write your prompts.
- Key shift: Basics was "ask well in one shot." Advanced is "break it into steps" + "let the model refine its own answer."
1. Role prompting — the 4-block formula
You are an expert alone is too thin. Use four blocks.
Block 1: Role
Who you are + what you're great at.
"You are a contract lawyer with 12 years of experience specializing in IT-startup SaaS and B2B disputes, with a focus on protecting suppliers (the service-provider side)."
Block 2: Audience
Who the answer is for.
"Your audience is a solo founder with a technical background but limited legal training. Briefly define legal terms on first appearance."
Block 3: Task
Exactly what to produce.
"Review the SaaS contract draft below and produce: 1) clauses risky to us, 2) ambiguous language, 3) protective clauses we should add."
Block 4: Constraints
What not to do + format.
"Don't speculate. Cite the contract clause for each claim. Output in plain English, Markdown table, 1–2 lines per row."
Save this 4-block as a system prompt and you can reuse the persona across many questions: ChatGPT Custom GPTs, Claude Projects, Gemini Gems.
2. Few-shot — what actually makes examples work
The basics piece established that 1–2 examples can lift accuracy ~7.48%. Advanced is about example quality.
2.1 Diversity beats quantity
Five similar examples < two distinct ones. The model latches onto whatever all examples share.
Bad set (all alike)
Five "diary → tweet" examples, all about food and waiting in line.
Good set (varied)
1) Food/queue, 2) Movie/emotion, 3) Workout/pain, 4) Commute/exhaustion.
2.2 Contrastive (X vs Y) examples
Showing what to do and what to avoid sharpens the signal.
"Of the two responses below, follow style B. Response A (avoid): formal, long, stiff Response B (target): casual, short, point-first"
2.3 The trap: too-strong examples kill diversity
If your task itself requires diverse output (brainstorming, ideation), use zero or one example. Examples = answer keys = the model converges.
3. Prompt chaining — break it up instead of all-in-one
Complex tasks split into 3–5 steps gain accuracy and become much easier to debug (Prompt chaining 2026 guide).
3.1 The four standard patterns
| Pattern | Steps | Best for |
|---|---|---|
| Analyze → Plan → Draft → Refine | Analyze → plan → draft → polish | Writing, presentations |
| Extract → Transform → Summarize | Extract → transform → summarize | Data and document processing |
| Classify → Route → Generate | Classify → branch → generate | Triage, customer support |
| Generate → Critique → Improve | Generate → critique → refine | Code, design, copy |
3.2 Walk-through — blog post (Analyze → Plan → Draft → Refine)
[Step 1 — Analyze]
"My topic is 'AI automation for beginners.'
Profile the readers, their search intent,
and what existing posts get right vs. miss."
[Step 2 — Plan]
"Using that analysis, draft an outline
with 7 H2 sections. For each, state the core
message and target word count."
[Step 3 — Draft]
"Write section 3 from the outline.
Markdown, ~700 words."
[Step 4 — Refine]
"From the previous draft:
1) Remove vague phrases ('many people')
2) Mark numbers needing sources with '[source needed]'
3) Strengthen the opening sentence."
Output of one step is the input of the next. Even web UIs flow this naturally inside a single chat.
3.3 Mix models per step
A subtle benefit: different model per step. Cheap model (Haiku, GPT-4o-mini) for extraction; reasoning model (Sonnet, Opus) for analysis; creative model for generation. Documented savings of up to ~70% vs. running the strongest model end-to-end (SurePrompts 2026).
4. Self-critique loops
Generate → Critique → Improve, but folded into a single ask.
4.1 Single-prompt version
"Do this in three steps: Step 1: Produce a first draft of [task]. Step 2: As a critic, name three weaknesses. Step 3: Rewrite the draft addressing those weaknesses.
Task: 30-second ad copy for an eco-friendly tumbler."
The model drafts, criticizes itself, and improves — all in one response.
4.2 Multi-call version (stronger)
Splitting the steps into separate calls produces deeper critique. Boost it further by giving the critic a different persona.
"[Call 1] You are a copywriter. Write the ad." "[Call 2] You are a marketing director. Identify three weaknesses in the ad above." "[Call 3] You are the copywriter again. Rewrite v2 addressing those weaknesses."
4.3 Self-Consistency
Sample N answers with temperature ≥ 0.7 and pick the majority (Wang et al., 2022, arXiv:2203.11171). Strong on simple reasoning but multiplies cost. Use only on critical decisions.
5. Meta-prompting — let the AI write your prompts
The laziest and often most effective: ask the AI for better prompts.
"Generate three prompt versions for the task below. Each takes a different angle: (1) maximally specific, (2) example-rich, (3) role-based.
Task: Reply to a customer complaint email."
Pick the best output, or merge the best parts. Anthropic Console's Prompt Generator does exactly this (official).
When meta-prompting really shines
- A task you'll repeat 100+ times (automation, batch jobs)
- System prompts going into production
- Domains you don't deeply know (legal, medical, finance personas)
6. Summary — when to reach for what
| Situation | First try | Add if needed |
|---|---|---|
| One-off question | Five basic patterns | "Think step by step" |
| Long writing or research | 4-block role | Analyze → Plan → Draft → Refine chaining |
| Tight reasoning | Chain of thought | Self-Consistency (cost permitting) |
| Code review / debug | Generate → Critique → Improve | Multi-persona critique |
| Automation / batch | System prompt + few-shot | Meta-prompting for the system prompt |
Developer notes
For API users:
- Chaining = N× tokens. A 4-step chain = 4 calls, with previous outputs feeding the next. Use prompt caching (Anthropic, OpenAI, Google all support it) aggressively.
- Chaining ≠ agents. You define the steps; agents (ReAct, function-calling loops) let the model decide. Start with chains; graduate to agents.
- Self-Consistency implementation: call N times at temperature ≥ 0.7, then majority-vote — or use an LLM-as-judge to pick the best.
- Mix-model orchestration: LangChain or LlamaIndex routers let you assign models per step explicitly.
- Meta-prompting at scale: Anthropic Prompt Improver returns improved versions of an existing prompt.
- Evals are non-negotiable. Chaining and meta-prompting introduce regressions easily. Maintain a 10–20-item golden set + LLM-judge eval pipeline.
References
- Anthropic — Prompting best practices
- Anthropic — Prompt Improver
- Prompt Engineering Guide — Prompt Chaining
- Prompt Engineering Guide — Self-Consistency
- Self-Consistency paper (Wang et al., 2022)
- SurePrompts — Prompt Chaining 2026 Guide
This is part 3-2 of 11 in the AI Basics series. Next: AI Learning Tools — NotebookLM + Perplexity in practice.
๋๊ธ
๋๊ธ ์ฐ๊ธฐ