Sing what you hear: the sensorimotor case for vocalizing while you ear-train
A growing literature on vocal-motor coupling shows that singing what you hear is not separate from learning to hear — it is part of how perception is built.
Most people who download an ear-training app sit silently while they use it. They listen, they tap an answer, they move on. The voice is left out of the loop entirely, and on the surface this seems fine — ear training is listening, after all. The mouth is a separate department.
The research suggests this is the wrong model. Vocal pitch production and pitch perception share neural infrastructure to a degree most musicians underestimate. If you are not using your voice while you train your ear, you are likely leaving meaningful gains on the table.
The vocal-motor coupling literature
Peter Pfordresher’s lab at Buffalo has produced a long line of studies on the connection between singing accuracy and pitch perception. A 2014 paper, “Singing ability is rooted in vocal-motor control of pitch,” found that pitch and interval matching in singing were substantially more accurate when participants imitated sung references than when they imitated the same intervals played on a piano [1]. The same listener, given the same interval, was a better matcher when the source was a voice.
This is not a quirk of preference. It points to an integration in the brain between the systems that perceive pitch and the systems that produce pitch — what researchers call sensorimotor coupling. The brain that hears a sung note is partly simulating the production of that note in the singer’s voice; the brain that produces a sung note is partly hearing it before it emerges.
A more recent study found that singing ability — controlling for self-reported musical experience and for raw pitch-discrimination ability — uniquely predicted accuracy on a vocal emotion-recognition task [2]. Performers who could produce pitch accurately were better at understanding pitched vocal signals. The effect ran through processes specific to vocal sensorimotor integration.
Why this matters for ear training
Two implications follow.
1. Singing back is part of the perceptual task. When you hear a note, sing it, and check your match, you are training the perception–production loop directly. You are not just rehearsing perception — you are calibrating the internal model your ear uses to “place” the note inside your own vocal range. This is why traditional aural-skills programs at conservatories integrate sight-singing and ear training, often in the same course. They are not two separate skills hopeful to transfer to each other; they are the same skill viewed from two angles [3].
2. The ear builds faster when the voice is involved. Pomerleau-Turcotte and colleagues studied predictors of sight-singing performance among university music students and found that aural dictation skill was the strongest predictor of sight-singing performance — and conversely [4]. Pfordresher’s range-of-pitch studies suggest the causal arrow runs in both directions: training one improves the other [5].
There is also a more pragmatic argument. Singing forces a level of commitment to a heard pitch that silent quiz-taking does not. When you tap “major third” you have made a categorical decision. When you sing the third before tapping, you have committed to the actual pitch — you have placed it in your body. That commitment surfaces uncertainty earlier, which is exactly what good practice should do.
What you don’t need to be a “good singer” for any of this
A common objection: I’m not a singer. The research is not about singing performance. It is about using vocal production as a tool to train auditory perception. Pitch-matching accuracy improves with practice in nearly every population studied, including people who initially identify as poor singers. Pfordresher and Greenspon’s 2025 study on pitch-range training showed measurable accuracy improvements with relatively short interventions [5:1].
What the research does suggest is that you need some feedback on your match. Singing into a void teaches you to commit to a pitch. Singing while a tuner or pitch detector tells you whether you hit it — within, say, 30–50 cents — turns the feedback loop into something the brain can learn from.
How to incorporate it
A few practical suggestions, all supported by the literature:
- Sing the answer before you tap it. When you hear a scale-degree question, sing what you think the degree is, audibly enough that you commit. Then check.
- Use a sing-back exercise daily. A short routine — hear a note, match it, get visual feedback — reliably improves pitch-matching accuracy [5:2]. Fifths’ Foundations segment has Match the Pitch and Sing the Pivot exactly for this purpose; a tuner app on your phone with a played reference works just as well.
- Sing scale degrees with solfège or with numbers. The verbal label couples to the production. This is the core of the Kodály and Gordon traditions and aligns with the broader sensorimotor-integration finding that perceptual learning is enhanced by motor enactment [6].
- Hum if singing is not socially possible. Humming engages the same vocal-motor system at lower volume; it is not as accurate as singing but it is much better than silent listening.
The connection to mental imagery
There is one more layer worth knowing about. fMRI studies of musicians have shown that imagining a melody activates the auditory cortex in ways that overlap substantially with actually hearing it [7]. Imagined performance also activates motor and premotor areas [8]. In other words, the brain treats imagined music and produced music as related operations on shared circuitry.
This suggests a third practice mode worth experimenting with: hear a note or a short phrase, then imagine singing it back before you sing it for real. The additional cognitive work — actively summoning the auditory image rather than passively replaying it — appears to engage the same systems that get strengthened by overt singing. The literature on this is younger and more tentative than the singing-back literature, but the underlying mechanism is well-attested.
The takeaway
Your voice is not a separate department from your ear. They share neural infrastructure, and they train each other. The most under-used adjustment in self-directed ear training is, almost embarrassingly, also the cheapest: stop being silent while you do it.
If you can only change one thing about your practice, sing what you hear. The studies on vocal-motor coupling are convergent, the effects are robust, and the bar to start is exactly your willingness to make a sound.
References
Pfordresher, P. Q., Halpern, A. R., & Greenspon, E. B. (2014). A mechanism for sensorimotor translation in singing: The Multi-Modal Imagery Association (MMIA) model. Music Perception, 32(3), 242–253. See also: Pfordresher, P. Q., & Brown, S. (2007). Poor-pitch singing in the absence of “tone deafness.” Music Perception, 25(2), 95–115. Springer-indexed: https://link.springer.com/article/10.3758/s13414-014-0732-1 ↩︎
Mazzocconi, C., et al. (2022). Singing ability is related to vocal emotion recognition: Evidence for shared sensorimotor processing across speech and music. Attention, Perception, & Psychophysics. https://doi.org/10.3758/s13414-022-02613-0. PubMed: https://pubmed.ncbi.nlm.nih.gov/36380148/ ↩︎
Cleland, K. D., & Dobrea-Grindahl, M. (2021). Developing Musicianship through Aural Skills: A Holistic Approach to Sight Singing and Ear Training (2nd ed.). Routledge. ↩︎
Pomerleau-Turcotte, J., Moreno Sala, M. T., Dubé, F., & Vachon, F. (2022). Experiential and Cognitive Predictors of Sight-Singing Performance in Music Higher Education. Journal of Research in Music Education, 70(3). https://doi.org/10.1177/00224294211049425 ↩︎
Pfordresher, P. Q., & Greenspon, E. B. (2025). Effects of pitch range on singing accuracy training. Quarterly Journal of Experimental Psychology. https://doi.org/10.1177/10298649241289542 ↩︎ ↩︎ ↩︎
Hyde, K. L., et al. (2009). Musical training shapes structural brain development. Journal of Neuroscience, 29(10), 3019–3025. For broader review: Herholz, S. C., & Zatorre, R. J. (2012). Musical training as a framework for brain plasticity: behavior, function, and structure. Neuron, 76(3), 486–502. ↩︎
Halpern, A. R., & Zatorre, R. J. (1999). When that tune runs through your head: A PET investigation of auditory imagery for familiar melodies. Cerebral Cortex, 9(7), 697–704. See also the recent overview: Voluntary musical imagery in music practice: contextual meaning, neuroscientific mechanisms and practical applications. Frontiers in Psychology (2024). https://doi.org/10.3389/fpsyg.2024.1452179 ↩︎
Meister, I. G., et al. (2004). Playing piano in the mind — an fMRI study on music imagery and performance in pianists. Cognitive Brain Research. https://doi.org/10.1016/j.cogbrainres.2004.01.005 ↩︎