The four lineages of jazz ear training (and why they don't agree)
Jazz ear training has no Vaccai. It has four parallel pedagogies — Berklee functional, Banacos cadence-based, Stoloff scat, and the Sheila Jordan bass-up modal approach — that disagree about ordering and emphasis. Each gets something right.
Classical ear training, for all its branches, has a recognizable mainstream. Open Karpinski, open the Royal Conservatory syllabus, open any North American university aural-skills program, and you find the same broad order: intervals, scale degrees, melodic dictation, harmonic dictation, sight-singing. Disagreements exist, but the spine is shared.
Jazz ear training is not like this. It has at least four parallel pedagogies, each developed largely independently, each producing musicians who can clearly hear at a high level — and the four traditions disagree about what to teach first, what syllables (if any) to use, what role the voice plays, and even what counts as the unit of perception. Looking at the four side by side reveals what they each get right.
Lineage 1: Berklee functional ear training
The dominant institutional jazz ear-training tradition in North America, codified across the four-semester Berklee Ear Training sequence (Ear Training 1–4) and the related Harmonic Ear Training materials by Roberta Radley and the late Steve Prosser. Radley served as assistant chair of Berklee’s Ear Training department from 1997 until 2021, when she was named Chair Emeritus [1].
The Berklee approach treats functional hearing as a layered cognitive task that can be decomposed and trained piece by piece. The student is taught to listen for one element at a time — first the bassline, then the chord’s third, then whether the chord sounds in or out of the key — and then to combine these layered judgments with theoretical knowledge to determine the chord progression [2]. The vertical axis (root-up arpeggios of seventh chords) and the horizontal axis (chord function relative to the song’s key) are taught as two complementary skills that converge on the same answer.
What this lineage gets right: explicit decomposition. A student who is failing at “hearing the changes” is given a specific sub-skill to work on, not just told to listen harder. The pedagogy is unusually transparent about what is happening cognitively when an experienced player follows a chord chart by ear.
Lineage 2: Charlie Banacos and the cadence-based correspondence method
Charlie Banacos (1946–2009) was the legendary Boston-area teacher whose private students included Mike Stern, Jerry Bergonzi, Mick Goodrick, and a generation of working jazz professionals. Banacos created over 100 courses of study, run as mail-order correspondence lessons; his pedagogy is now best documented in Lefteris Kordis’s “Top Speed and In All Keys: Charlie Banacos’s Pedagogy of Jazz Improvisation” [3].
The Banacos ear-training method is built around a single recurring drill: establish a tonality with a chord cadence, then sound a single note (or short note sequence) and identify it relative to the established key. If the student misses, replay the cadence and re-orient before trying again [4]. The set of tones being identified gradually expands — first one tone, then clusters of tones, eventually all twelve simultaneously — and then progresses to atonal contexts where the student must identify pitches without any key reference at all.
Two principles distinguish Banacos’s method from the Berklee tradition. First, the cadence is the absolute reference, repeatedly re-asserted. The student is never allowed to drift away from a freshly-established tonality. Second, everything must be done at top speed and in all twelve keys before it counts as learned [3:1] — partial mastery is not mastery.
What this lineage gets right: the cadence-as-anchor discipline. Banacos understood that functional hearing collapses the moment the tonal reference fades from working memory. Re-asserting it constantly, even at the cost of feeling repetitive, is the single biggest determinant of long-term retention.
Lineage 3: Bob Stoloff and the scat tradition
Bob Stoloff spent 28 years as Professor and eventually Assistant Chair in the Voice Department at Berklee, and his Scat! Vocal Improvisation Techniques (1996) is the closest thing the jazz vocal world has to a Vaccai — a graduated method book built around progressively harder vocal exercises [5]. Where Vaccai’s exercises are pure-vowel pentachord vocalises, Stoloff’s are syllable-articulated bebop arpeggios, rhythmic displacement etudes, and call-and-response patterns over ii-V-I.
The Stoloff method makes a pedagogical claim that the other three lineages largely miss: the syllables matter, and they are not arbitrary. Doo-bah-dah-bee-doo articulates swing eighth-note accents that pure vowels cannot. The scat tradition treats vocal articulation as a rhythmic skill, not just a melodic one — the consonant attacks line up with where the swing accent falls, and learning to scat fluently means learning to put the consonants in the right places relative to the beat.
What this lineage gets right: vocal production and rhythmic articulation are entangled. The Berklee functional tradition and the Banacos cadence tradition treat the voice as a tool for verifying pitch identification. The Stoloff tradition treats the voice as the site where rhythm is internalized too — and a long line of jazz singers (Lambert/Hendricks/Ross, Ella Fitzgerald, Bobby McFerrin) bear out the claim.
Lineage 4: The Sheila Jordan / Jay Clayton bass-up modal approach
The fourth lineage is harder to pin to a single book, but it is the dominant approach in modern jazz vocal programs at Manhattan School, NEC, and the New School. The pedagogy associated with Sheila Jordan and Jay Clayton inverts the usual ordering: instead of starting with melody and adding harmony, start with the bass note of every chord and build upward.
The student first sings only the root of each chord in a progression — for one beat each, then for one bar each, eventually internalizing the bassline as the harmonic skeleton. Then thirds are added, then sevenths, then upper extensions. The exercises are typically slow, often performed against a drone or a held bass note, and the chromatic vocabulary of bebop is deliberately avoided in the early stages.
What this lineage gets right: the bass is the cognitive anchor for harmony, not the melody. This claim has surprising support from the Berklee pedagogy itself — Radley’s decomposition lists bassline-listening as the first sub-skill before chord quality [2:1]. The bass-up tradition just makes that observation the centerpiece rather than one step in a longer sequence.
Where the four traditions diverge
| Question | Berklee | Banacos | Stoloff | Bass-up |
|---|---|---|---|---|
| Reference for hearing a chord | Key center + chord function | Repeatedly re-cued cadence | Vocalized chord tones | The bass note |
| First skill to drill | Decomposed sub-skills (bass, 3rd, function) | Single tone vs cadence | Vocal arpeggios with syllables | Roots only, then guide tones |
| Voice’s role | Verify pitch | Verify pitch | Embody rhythm and pitch together | Build harmony from the ground up |
| Speed expectation | Build gradually | Top speed, all keys, immediately | Tempo-locked from start | Slow, contemplative |
| Vocabulary trained | Diatonic + light chromatic | Tonal then atonal | Bebop, chromatic | Modal, diatonic |
These are not minor stylistic preferences. They reflect different theories of how the jazz ear is actually built.
What the convergence is
For all the disagreement, three things appear in every one of the four lineages — and they are probably the load-bearing claims of jazz ear training generally.
Singing is part of the perceptual task, not separate from it. All four traditions ask the student to vocalize what they hear. None of the four treat ear training as a silent listening exercise. This converges with the broader sensorimotor coupling literature [6].
Everything must eventually be done in all twelve keys. The Banacos formulation is the most extreme, but the same principle is implicit in the Berklee curriculum (which transposes drills around the cycle of fourths) and explicit in the Stoloff exercises (each etude is written once and transposed by the student). The bass-up tradition is slightly more relaxed but still expects modal facility across keys eventually.
The chord’s third and seventh are the harmonic fingerprint. The 3rd-7th guide-tone tradition appears in Berklee, in Banacos’s intervallic work, in Stoloff’s bebop arpeggio patterns, and in Jay Clayton’s approach to building harmony bass-up. These two pitches are what determine the difference between major-7, minor-7, and dominant-7 — the three chord qualities that cover most of the jazz harmonic vocabulary [7]. (See Singing the changes: how Berklee’s functional jazz ear training works for the long version.)
What this means for self-directed practice
The four traditions are different roads up the same mountain. A learner who picks one and works it hard will get there. A learner who hops between them weekly will probably stall, because the cognitive frames don’t perfectly compose.
Our recommendation, drawn from the convergence:
- Use a cadence-based reference, in the Banacos style, even if you’re not running the full Banacos exercise set. Don’t let the key fade.
- Sing everything, in the Stoloff style, even if you can’t carry a tune. The voice is not optional.
- Drill the bass first, in the Jordan/Clayton style, when learning a new tune. The harmonic skeleton is the bassline.
- Decompose what you’re missing, in the Berklee style, when stuck. “I can’t hear the changes” is too vague to act on; “I can’t tell ii from IV” is a plan.
Related reading
- Why interval drills alone don’t make you hear music — and what does
- Cadence first, scale second: how to set up your ear before every drill
- Tonic, dominant, subdominant: hearing chord function before chord name
- Sing what you hear: the sensorimotor case for vocalizing while you ear-train
References
Berklee College of Music, faculty page: Roberta Radley. https://college.berklee.edu/people/roberta-radley. See also: Berklee Online, Ear Training 1. https://online.berklee.edu/courses/ear-training-1; Berklee Online, Harmonic Ear Training: Recognizing Chord Progressions. https://online.berklee.edu/courses/harmonic-ear-training-recognizing-chord-progressions. ↩︎
Radley, R. (2009). The Real Easy Ear Training Book. Sher Music Co. https://www.shermusic.com/188321761X.php. The decomposition approach — listen for one element at a time (bassline, third, in/out of key) — is the structural innovation of the Berklee functional tradition. ↩︎ ↩︎
Kordis, L. (2014). “Top Speed and In All Keys”: Charlie Banacos’s Pedagogy of Jazz Improvisation. Academia preprint. https://www.academia.edu/16755337/_Top_Speed_and_In_All_Keys_Charlie_Banacoss_Pedagogy_of_Jazz_Improvisation_by_Lefteris_Kordis ↩︎ ↩︎
Charlie Banacos online correspondence lessons archive (continued by his estate after his 2009 passing). https://www.charliebanacos.net/. See also: Wikipedia, Charlie Banacos. https://en.wikipedia.org/wiki/Charlie_Banacos. The original cadence-based ear-training exercise is documented at https://www.miles.be/articles/the-charlie-banacos-exercise/. ↩︎
Stoloff, B. (1996). Scat! Vocal Improvisation Techniques. Gerard and Sarzin Publishing. ISBN 9780962846755. Stoloff was Professor and Assistant Chair in the Voice Department at Berklee for 28 years. ↩︎
Pfordresher, P. Q., & Brown, S. (2014). Singing ability is rooted in vocal-motor control of pitch. Discussed at length in Sing what you hear: the sensorimotor case for vocalizing while you ear-train. ↩︎
The pedagogical centrality of the 3rd–7th guide-tone pair is articulated in nearly every jazz ear-training resource. See: Learn Jazz Standards, Use Guide-Tones to Navigate Chord Changes. https://www.learnjazzstandards.com/blog/learning-jazz/jazz-theory/use-guide-tones-navigate-chord-changes/; The Jazz Resource, Guide Tones. https://www.thejazzresource.com/guide_tones.html. ↩︎