The Grue Paradox - Surprising Paradoxes

The Paradox

You've examined thousands of emeralds. Every single one has been green. It seems perfectly reasonable to conclude: "All emeralds are green."

But philosopher Nelson Goodman (1955) invented a strange new color called grue. And here's the problem: every emerald you've ever seen is also grue.

So why predict future emeralds will be green rather than grue?

The Strange Colors

GRUE

An object is grue if:

• Observed before time T → GREEN
• Not observed before T → BLUE

Before T, grue things look green. After T, unobserved grue things are blue!

BLEEN

An object is bleen if:

• Observed before time T → BLUE
• Not observed before T → GREEN

The mirror of grue: before T looks blue, after T unobserved ones are green.

See It In Action

Move the time slider to see how "grue" emeralds behave differently before and after time T:

Current Time: Before T

Long Before T Time T After T

Your Observed Emeralds (all examined before T)

Unobserved Emeralds (predictions about the future)

The Inductive Conflict

💎

Standard Induction (Green)

All observed emeralds are green → All emeralds are green → Future emeralds will be green

GREEN

💎

Gruesome Induction (Grue)

All observed emeralds are grue → All emeralds are grue → Future emeralds will be grue = BLUE!

BLUE

🤔

The Paradox

Both inductions use the exact same evidence (observed green/grue emeralds) but reach opposite conclusions about unobserved emeralds!

What makes "green" a better predicate for induction than "grue"?

The Reversal: Green is "Gruesome"!

You might think grue is artificial because it's defined in terms of green, blue, and time. But here's the twist:

Standard Definitions

grue = (observed before T ∧ green) ∨ (not observed before T ∧ blue)

bleen = (observed before T ∧ blue) ∨ (not observed before T ∧ green)

Grue/bleen defined via green/blue + time

Reversed Definitions

green = (observed before T ∧ grue) ∨ (not observed before T ∧ bleen)

blue = (observed before T ∧ bleen) ∨ (not observed before T ∧ grue)

Green/blue defined via grue/bleen + time!

The symmetry is perfect. If we started with grue and bleen as "basic," then green would be the weird, time-dependent predicate! There's no objective reason to prefer one set of color concepts over the other.

Historical Development

1739

David Hume formulates the original problem of induction: we can't justify the assumption that the future will resemble the past.

1955

Nelson Goodman publishes "Fact, Fiction, and Forecast," introducing grue/bleen and the "new riddle of induction" — a successor problem that's arguably harder to solve.

1960s-70s

Philosophers attempt various solutions: natural kinds, simplicity, conventionalism. None fully succeed.

1980s-Present

The problem connects to machine learning and Bayesian inference. Solomonoff induction and Kolmogorov complexity offer partial solutions based on description length.

Goodman's Solution: Entrenchment

Goodman's own answer was entrenchment: some predicates have been successfully used in past projections, while others haven't.

"Green" has a long history of successful inductive projections
"Grue" has never been used to make predictions
We should prefer predicates that have "paid their dues" through past success

But critics note this is descriptive, not justificatory. It explains what we do, not why we're right to do it.

Proposed Solutions

Natural Kinds

Green picks out a "natural kind" — a real category in nature. Grue is gerrymandered and doesn't carve nature at its joints. But what makes a kind "natural"?

Simplicity / Occam's Razor

Green is "simpler" than grue. But simplicity is relative to the language you start with. In grue-language, grue is simpler!

Kolmogorov Complexity

Prefer hypotheses with shorter descriptions in a universal programming language. Green has lower complexity than grue. This is the basis of Solomonoff induction.

Causal Structure

Green corresponds to a physical property (wavelength). Grue doesn't map onto any single physical feature. Induction should track causal/physical structure.

Bayesian Prior

We have strong prior probability for "all emeralds are green" and low prior for "all are grue." But this just pushes the question back: why those priors?

Pragmatic Solution

It doesn't matter philosophically — we just evolved to use predicates like green because they worked. Natural selection, not logic, chose our concepts.

Why It Matters

The grue paradox isn't just a philosophical puzzle — it has real implications:

Machine Learning: How should AI systems generalize from training data? The "no free lunch" theorem shows that without assumptions, all hypotheses are equally supported.
Scientific Method: Why do we trust some generalizations over others? What makes a hypothesis "projectible"?
Language: Do our concepts carve reality at its joints, or are they arbitrary conventions?
Metaphysics: Are there "natural kinds" in the world, or is all classification human-imposed?

"The problem is not merely to explain why we in fact project 'green' rather than 'grue.' The problem is to provide a justification for this practice."

— Nelson Goodman

The Deeper Lesson

Hume showed we can't prove that induction works. Goodman showed something arguably worse: even if induction works, we can't say which inductions are correct.

The evidence doesn't uniquely determine the hypothesis. Infinitely many predicates are consistent with any finite set of observations. Choosing between them requires something beyond the data itself — prior assumptions, simplicity metrics, or brute convention.

In a sense, all learning is theory-laden. We never observe the world neutrally; we always bring concepts that shape what we see and what we expect.

"All emeralds are green. All emeralds are grue. Both statements are supported by identical evidence."

— The uncomfortable truth of induction