ResearchBenchmark critique

Critique a benchmark or ranking methodology

Use to judge whether a published ranking or benchmark actually measures what it claims.

The prompt
prompt.txt
Role: measurement and metrics auditor.

Benchmark / ranking: {{benchmark}}
What it claims to measure: {{claim}}
Methodology (paste or summarize): {{methodology}}

Audit it:
1. Construct validity: do the metrics actually capture the claimed concept?
2. Data and sampling: where could the inputs be biased, gamed, or incomplete?
3. Weighting and aggregation: how sensitive are the results to arbitrary choices?
4. Incentives: who produced it and could that shape the design?
5. What the benchmark genuinely tells you versus what it is wrongly used to imply.

Conclude: how much should I trust this ranking, and for what decisions is it fit or unfit? Suggest one improvement to the methodology.

Click the copy button in the top right of the block to grab the full prompt.

Variables

Replace each placeholder below with your own values before you run the prompt.

  • {{benchmark}}
  • {{claim}}
  • {{methodology}}
Recommended models
Claude Opus 4.8GPT-5Gemini 2.5 Pro
Tags
#benchmark#metrics#critique#validity

Related prompts

ResearchLiterature review
Write a structured literature review

You are a research analyst writing a literature review. Topic: {{topic}} Field / discipline: {{field}} Time window to focus on: {{time_window}} Produce a literature review that: 1....

Claude Opus 4.xGPT-5Gemini 2.5 Pro
#literature-review#academic#synthesis
View
ResearchCompare options
Compare options with a decision matrix

Act as a neutral research assistant helping me choose between options. Options to compare: {{options}} My use case: {{use_case}} What matters most to me, in order: {{priorities}} M...

Claude Opus 4.xGPT-5Gemini 2.5 Pro
#comparison#decision#evaluation
View
ResearchFact-check
Fact-check a specific claim

You are a careful fact-checker. Claim to check: {{claim}} Context where I saw it: {{context}} Assess the claim and return: 1. Verdict: True, Mostly true, Mixed, Mostly false, False...

Claude Opus 4.xGPT-5Perplexity
#fact-checking#verification#claims
View
ResearchTopic deep dive
Deep dive into an unfamiliar topic

I want to deeply understand a topic I am new to. Topic: {{topic}} Why I am learning it: {{goal}} My current level: {{level}} Walk me through it in layers: 1. The one-paragraph big...

Claude Opus 4.xGPT-5Gemini 2.5 Pro
#deep-dive#explainer#overview
View
ResearchCited summary
Summarize a source with citations

Summarize the source below so a reader can trust where each point comes from. Return: 1. A 2-sentence overview of what the source is and its main argument. 2. {{point_count}} key p...

Claude Opus 4.xGPT-5Gemini 2.5 Pro
#summary#citations#sources
View
ResearchSource synthesis
Synthesize findings across several sources

You are synthesizing several sources into one clear picture. Question I am trying to answer: {{question}} Sources are pasted below, each marked with a label. Do the following: 1. S...

Claude Opus 4.xGPT-5Gemini 2.5 Pro
#synthesis#sources#analysis
View

0 Comments

Sign in to post

Loading discussion...