Musical intervals explained: what they are, how to recognize them, and what most apps get wrong

A musical interval is the distance between two pitches. Learning to recognize them by ear is the first thing most ear-training apps drill — and there is a strong empirical case that drilling them in isolation is not how you build a working musical ear. This article explains what intervals are, what each one sounds like, and the right way to train them.

A musical interval is the distance between two pitches. The major third from C to E is an interval. The perfect fifth from C to G is an interval. The “Here Comes the Bride” leap is an interval (specifically, an ascending perfect fourth). Every two notes in any piece of music form an interval.

Learning to recognize intervals by ear is the textbook starting point of ear training, and almost every ear-training app on the market drills it as the headline exercise. The pedagogical research, however, suggests that interval drilling in isolation is necessary but not sufficient — and that learners who treat it as the whole task often plateau without ever developing the working musical ear they wanted.

This article covers what intervals are, the thirteen common ones with their sound-quality, the song-mnemonic technique that everyone teaches and its hidden trap, and the broader functional approach that the research supports.

The thirteen common intervals

In Western music, intervals are classified by quantity (how many letter names they span) and quality (perfect, major, minor, augmented, or diminished). The thirteen most common are:

Name Semitones Example (from C) Common feel
Unison 0 C–C Same pitch, no movement
Minor 2nd (m2) 1 C–D♭ Tense, leading
Major 2nd (M2) 2 C–D Step, scale-like
Minor 3rd (m3) 3 C–E♭ Sad, melancholy
Major 3rd (M3) 4 C–E Bright, stable
Perfect 4th (P4) 5 C–F Open, suspended
Tritone (TT) 6 C–F♯ Tense, unresolved
Perfect 5th (P5) 7 C–G Open, strong
Minor 6th (m6) 8 C–A♭ Yearning, plaintive
Major 6th (M6) 9 C–A Warm, expansive
Minor 7th (m7) 10 C–B♭ Bluesy, suspended
Major 7th (M7) 11 C–B Tense, leading-tone
Perfect 8th / Octave (P8) 12 C–C Same note, higher/lower

The “common feel” column is approximate — the same interval will sound different in different harmonic contexts (more on that in a moment).

Quality: perfect, major, minor, augmented, diminished

Intervals come in qualities. The unison, fourth, fifth, and octave are called perfect because they are acoustically simple (small whole-number frequency ratios) and historically considered consonant. The second, third, sixth, and seventh come in major and minor versions, separated by one half-step.

An interval can also be augmented (one half-step larger than perfect or major) or diminished (one half-step smaller than perfect or minor). A “tritone” — the interval six half-steps wide — can be spelled either as an augmented fourth or a diminished fifth depending on context, and the spelling matters for how you write it down even if it sounds the same.

You don’t need to memorize all of this to train your ear. The thirteen common intervals above are sufficient for almost all real music; the exotic spellings show up mostly in formal music-theory analysis.

Ascending vs descending

Intervals sound subtly different when they go up versus down. An ascending major third sounds bright and stable; a descending major third sounds resolving, settling. The minor sixth ascending feels yearning; descending, it feels like a deep sigh. Most learners need to train ascending and descending intervals separately — the descending versions are often harder, partly because most people’s mental song-mnemonic library leans ascending.

The song-mnemonic technique (and why it has limits)

The most widely taught technique for learning intervals is to associate each one with the opening leap of a familiar song:

  • Ascending m2Jaws theme (the two-note shark motif)
  • Ascending M2Happy Birthday (first two syllables of “Happy”)
  • Ascending m3Greensleeves; or The Star-Spangled Banner (“Oh-say”)
  • Ascending M3When the Saints Go Marching In (“Oh when the”)
  • Ascending P4Here Comes the Bride; or Amazing Grace
  • Ascending tritoneThe Simpsons theme; or Maria (West Side Story)
  • Ascending P5Twinkle Twinkle Little Star; or the Star Wars theme
  • Ascending m6The Entertainer
  • Ascending M6NBC chimes; or My Bonnie Lies Over the Ocean
  • Ascending m7Somewhere (West Side Story); or the original Star Trek theme
  • Ascending M7Take On Me (a-ha)
  • Ascending P8Somewhere Over the Rainbow (“Some-where”)

This technique works for clean test conditions: someone plays two notes, you sing through your mental rolodex, find the matching song, and identify the interval. Most learners can reach 95% accuracy on isolated interval ID this way within a few months.

The technique then breaks down at exactly the moment it should be useful: in the middle of a real piece of music. When the same major third appears as part of a melody — say, the second note of a tune you’ve never heard — your Saints mnemonic isn’t fired, because the surrounding context is the actual music, not an isolated test. The mnemonic is a stand-in for context; once context is provided by the music itself, the mnemonic becomes interference rather than help [1].

This is one form of the broader mnemonic ceiling: the technique that gets you to the test gate doesn’t transfer to musical hearing.

What intervals sound like in context

The more important truth: the same interval has different functional sensations depending on its surrounding key.

A major third from C to E sounds:

  • Stable and resolved when E is the third of C major (you hear do → mi, both members of the I chord).
  • Suspended and unresolved when E is the fifth of A minor (you hear 6̂ → 1̂, an arrival on a non-tonic).
  • Chromatic and tense when both notes are passing tones in B♭ major (you hear neither as a structural pitch).

It is the same physical interval; it has three different musical identities. The trained listener doesn’t hear “a major third” — they hear the third of the key, the leading tone resolving, a chromatic neighbor. Interval recognition is the raw sensory layer; functional hearing is the layer the listener actually uses.

This is why interval drilling alone tends not to produce learners who can transcribe melodies, hear chord progressions, or sing back what they hear with accurate pitch. The skill it trains is real but partial.

The right order to train intervals

The order the pedagogical literature converges on:

  1. Same / different and higher / lower discrimination. The most basic perceptual layer.
  2. Tonality. Establish a key (cadence or scale primer); train find-the-tonic and stable-vs-unstable.
  3. Scale degrees. Train each of the seven diatonic notes against an established key. This is the functional ear training stage; it is the single highest-leverage step.
  4. Intervals as movements between scale degrees. Train ascending major thirds as 1̂→3̂, ascending fourths as 5̂→1̂, etc. The interval becomes a labelled motion within a key, not an abstract distance.
  5. Out-of-key intervals. Random pitches without tonal context — the “isolated interval” task most apps lead with — comes last, as a separate skill.
  6. Chords, progressions, and dictation. The synthesis layer.

This ordering — get tonal context first, drop intervals into it, do isolated interval training only as a separate later stage — matches the way trained ears actually parse music. Interval drilling in isolation is not useless; it is just the wrong starting point.

FAQ

What’s the difference between a major and minor third? A major third spans 4 half-steps (C to E); a minor third spans 3 (C to E♭). The major third sounds bright; the minor third sounds darker. Most beginners struggle to reliably tell them apart in isolation; this is the most common ear-training plateau and there is a specific 2-week fix for it.

What is the tritone? The interval of six half-steps — exactly half an octave. It’s called the tritone because it spans three whole tones. In tonal music it is dissonant and demands resolution; in jazz harmony it is the structural feature of dominant chords. Famously called diabolus in musica in medieval pedagogy, though the prohibition story is more legend than documented practice.

What is a compound interval? An interval larger than an octave. A 9th = an octave + a 2nd; an 11th = an octave + a 4th; a 13th = an octave + a 6th. In jazz harmony, the 9th, 11th, and 13th are the standard “extension” tones added to chords for color.

Should I memorize all the song mnemonics? For initial interval recognition, picking 1–2 mnemonics for each interval and learning them well is faster than not having a fallback. But treat the mnemonic as the training-wheels stage, not the goal. The real ear-training work is hearing intervals in tonal context, not in isolation.


References


  1. The argument that song-mnemonic interval training fails to transfer to music in context is laid out in the practitioner literature — see Bruce Arnold’s “Use Your Ear” pedagogy and the Berklee functional aural-skills tradition — and is supported by the broader research on context-dependent perception. The empirical foundation is Carol Krumhansl’s tonal-hierarchy work, which established that listeners hear pitches against the established key rather than as raw distances; see Cognitive Foundations of Musical Pitch (Oxford, 1990) for the comprehensive treatment. ↩︎

← Back to all articles