A field guide

Why adults can’t sight-read — and how to fix it

You can play a Chopin nocturne. So why does reading a new piece feel like decoding a cipher, one note at a time? The problem isn’t your eyes, your talent, or your age. It’s what you’re reading.

You’re spelling, not reading

When you read this sentence, you don’t sound out s-e-n-t-e-n-c-e. You recognise whole words — and you’re already looking at the next ones. A fluent music reader does the same: they don’t see “F, A, C, E, counting up from middle C,” they see a shape — a third, a triad, a familiar cadence. The page isn’t two hundred symbols; it’s a few dozen patterns they’ve seen ten thousand times.

Most adult learners never make that jump. You read note by note — spelling — and spelling is slow no matter how good you get at it. Getting faster at spelling is not the same as learning to read.

The adult’s curse: you’re too good at decoding

Here’s the uncomfortable part. The reason adults get stuck is that they’re competent enough to limp. You know enough theory to work out any note — count up from a landmark, reason out the accidental — so you always can decode your way through. That escape hatch is exactly what traps you. Because you can always fall back on spelling, you never force your brain to do the harder, more valuable thing: recognise the pattern instantly, without thinking.

A beginner child can’t reason their way out, so they’re forced to recognise. You can — so you don’t.

What fluency actually is

Fluency is pattern recognition at speed. It has three properties, and you have to train all three:

Recognition, not calculation. A third should register as “a third” the way the letter A registers as “A” — instantly, before you think.
At speed. Recognition you can only do slowly isn’t fluency. The clock isn’t a stress test bolted on top — it’s the point. Time pressure is what forces recognition to replace decoding.
In context. Real music isn’t flashcards. The patterns have to survive on a real staff, in a real key, surrounded by other notes.

The four layers of the language

Music is built in layers, and you read it the way you read language — letters, then words, then sentences:

Notes — the letters. Every pitch on the grand staff, on sight, without counting from a landmark.
Intervals — the first words. The distance between two notes, seen as one shape: a third, a sixth, an octave. Stop reading two notes; read the gap.
Chords — whole words. A triad is one object, not three stacked notes. Learn the shape and its inversions and a chord lands in a single glance.
Harmony — sentences. How chords pull toward each other — a ii–V–I, a deceptive cadence. When you hear the grammar, a progression reads as logic instead of a string of accidents — and you start reading ahead, because you can predict what’s coming.

Why understanding makes you faster

Fluent readers anticipate. They’re not reacting to each symbol as it arrives; they’re a measure ahead, because the music makes sense to them. That’s the hidden payoff of theory: it isn’t academic — it’s compression. Knowing “this is a ii–V–I” collapses four chords into one idea you’ve read a thousand times. Understanding is speed.

It’s also memory. Pieces stop being strings of notes to memorise and become structures you understand — which is why you’ll learn repertoire faster, too.

How to actually train it

Drill recognition to automaticity — one layer at a time, until a third or a triad registers with no thought.
Then put it under the clock — so recognition replaces decoding. Short daily sessions beat long rare ones.
Then transfer — take the patterns back to real music, where they have to hold up in context.

You don’t need more hours at the piano. You need to stop spelling and start reading the patterns.

That’s exactly what Ostinato trains.Notes → intervals → chords → harmony, under the clock, with the why on every screen. Read music. Don’t decode it. — coming to iPhone, 2026.