Canned Treat: The Headphone Listening Experience

From listening & recording as a teenager
to the canned treats of smartphones

Mirror Mirror On the Wall
Where are the worst reflections of all?

[Italian Version Here]

Phenomenon: Headphone Listening
Manufacturer: Many
Price: From peanuts to thousands
Author: Mark Wheeler - TNT UK
Published: February, 2026

Headphones and Mental Health

This article is aimed at anyone who uses headphones at work or at home or traveling, as well as those audiophiles hoping for the best sound quality. These headphones might be for shutting out the overwhelming world on your daily commute, or for counselling a person in the midst of a crisis, as much as for an absorbing symphony. Headphones now have a significant role in our lives well beyond the private enjoyment of music.

There has been a recent behavioural change from experiencing music, and other broadcast material via one or two loudspeakers, to listening through tiny in-ear headphones popularly known as earbuds. Such tiny earpieces might be familiar to a 1920s crystal set user. Headphones are also an essential tool of the call centre operator. The growth in delivery of online counselling, often using headphones, has run parallel to these social changes. That all these roles mostly seem to be facilitated by similar headphone products, worn similarly, seems distinctly odd, given the differences between loudspeakers in a railway station, stadium concert, kitchen radio or domestic Hi-Fi system. Only the one-ear call centre headset seems to be tailored specifically to its intended job.

To get the best use from any tool means knowing how best to use it in each context. The end user must know too how to choose the correct tool for the job. Those offering online counselling, for example, would be better buying something fit for that purpose rather than relying on the ear buds and lanyard mic that was bundled with their cellphone. Purpose built USB plug-in desktop speaker/mic units are available. Online counsellors/psychotherapists also need to consider what their own voice sounds like to the client and whether their clients should not use headphones at all. Do counsellors really want the earbud effect of their client's voice apparently emerging from a point source in the back of their heads?

The Recording Studio

Those singers using headphones in the studio will not want spillage from their cans to the expensive microphone. Your Old Scribe loathed the effect of the Koss Pro4AA that were ubiquitous in studios back in the day, but they were there for good reason. My then preferred Koss HV1a, being both open backed and equipped with open cell foam earpads, would be completely unsuitable on that side of the glass. Your Old Scribe also disliked the fit and feeble headband of the equally ubiquitous contemporary Sennheiser HD414/HD424 despite any sound quality merits. However, your Old Scribe often uses the latest closed back circumaural HD280 PRO in comfort when total 2-way isolation is needed. Already we must realise that there is no one ‘correct' type of headphone.

Safety First

For safety reasons, many headphone users must have the sounds of the outside world clearly audible as well as the headphone output. Any proximity to traffic should be a clear contraindication to wearing isolating headphones outside.

Listening context, listener head shape and listener ear pinna shape are as important as capsule frequency response in choosing and using headphones. Audiophiles will want headphones that most emulate a real space experience. For some this might mean headphones spaced out from the ear pinnae. In some circumstances closed back headphones will be necessary. In some cases, for example under protective headgear, in-ear headphones are the only possible option.

The Pyschological Aesthetics of Headphone Limitations

Our ability to identify and locate any a sound is a key evolutionary achievement. In our thousands of years evolving as successful hunter-gatherers, those who were good at identifying and locating sounds were more likely to grow old enough to breed. Therefore our ancestors were a selective breeding programme for hearing acuity and sound source location ability. If they weren't good at identifying a sound source's identity and location very quickly they either became food for a predator or failed to find food for themselves. Either way that group did not become our forebears.

We identify the identity of sounds from the content of the waveform and the shape of the waveform envelope. We soon learn that a certain mix of frequencies delivered a certain way means that a small branch has just been snapped nearby. We can even tell how big a branch has snapped and therefore how big a force snapped that branch. Is it something to eat or something to eat us?

We also learn as infants where that event occurred in play and everyday experience, much as we learn how long are our arms and how far to reach things. We have two ears to help with this plus the ear extensions on the sides of our heads to improve that directivity discrimination. The visible ear part, the pinna, is like the horn of an old fashioned gramophone but compressed flatter to fit under hats.

"Ears evolved to fit under hats?" Query plebs, stage left. "Only if they're listening to Homburg by Procul Harum" add those plebs determined to out-pun the Old Scribe.

Evolutionary Hearing

These two ears are either side of our heads to improve this location perception accuracy. Prey tend to have their eyes on each side of the head for similar defensive perceptual reasons whereas humans are predators so our eyes are arranged like owls and hawks, on the front of our heads..

We position a sound source very quickly by a succession of auditory processes. Our head is the essential tool. It casts a sound shadow over one ear if the sound is from anywhere other than exactly central, whether that is centrally forward, backward, above or below. The masking effect of the head creates a slight difference in level between left and right ears. It is this level difference that the stereo mixing desk pan-pot control exploits, to create a level difference between two loudspeakers or two headphone capsules. Primarily this helps us to locate a sound approximately on an imaginary horizontal plane.

The next process our ear-brain system employs is more sophisticated. Despite that, it is processed at the same time as the difference in sound level experienced by each ear. We perceive the sound arriving at different times. The more complex the sound (a cracking twig is such a complex sound) the better and faster we can discriminate its positional clues. A constant sine wave is very difficult to source locate, regardless of its pitch.

Our pinnae, those earflaps used as objects of ridicule if they are prominent, provide the next information. Over the early years of our lives we learned how sound is altered by its relative position to our pinnae. Thus we can often in a moment, not only identify roughly where a sound is relative to our left and right, but also sometimes whether it is in front of or behind us. Our outer ears even help us identify the height of a sound source relative to use as the frequency response is changed at different heights relative to our heads. The LEDR test depends on our developed height discrimination and is therefore different for each individual.

The actual shape and texture of our pinnae alters, or distorts, the frequency response of what arrives down our ear canals. Each pinna imprints a unique character, or frequency shaping on the sounds, depending on their direction of arrival at our head. This creates additional data for front-back and up-down discrimination. We can call the sum of these parts so far, our audio propriaception for want of a better term. Early lessons in how to perceive and operate in our near environment kindle the neural pathways to enable this individual perceptual capacity.

Stationary Binaural Viewpoint

The above conditions assume that for some obscure reason our head is held by a clamp. Perhaps we are in a nineteenth century photographer's studio having our portrait made for our new cartes de visite. In most circumstances we are constantly moving our heads, much as our eyes are constantly making tiny movements to keep a constant visual data stream.

When we listen to headphones or a conventional stereo signal coming from two loudspeakers, the best we can hope for is that the source recording has more than one of these processes encoded. However, it is most likely (except in the case of dummy head or soundfield recordings) that only the level difference is present. With headphones of all types we are deprived of most of the spatial data that we are evolved to utilise.

When we hear an unexpected sound we often tilt our head or turn slightly to try to add a new data set to this, whether consciously or unconsciously. The sighted among us will also use our eyes to try to add more information about the possible sabre toothed tiger or unusually aggressive piano lurking nearby.

When we use headphones we alter all the parameters. The sound source is now not one single source but two sources in an unusually fixed relationship really close to our heads.

The Uncan is an April Fools' Joke

Our unique pinnae shape changes the sound characteristics poured into our ear canals. From birth each pinna imprints a unique character that we learn to decode. As we grow (and our pinnae continue growing throughout our lives) our ear-brain kindles and thickens neural pathways to help us decode auditory data. These processes never cease and may be a contributory factor to dementia being accelerated by hearing loss. The problem underlying this spatial clue system is its complexity to understand and model and it is very specific to each individual's pinnae. This may be a reason why it is seldom used in stereophonic recordings, because it comes from the fact that these frequency changes are incredibly subtle, complex and ever changing.

Some people have their ear pinnae pinned back for aesthetic reasons. Their ability to enjoy a full range of sounds and spatial clues will be compromised. Audiophiles might wish to consider the opposite form of cosmetic surgery: pinna extensions.

Sometimes it is helpful to be deprived of spatial complexity in order to improve intelligibility. This is handy when trying to hear exactly what someone is saying, hence headphone use by audio typists and espionage agents. Sometimes excluding external noise is really useful. Sometimes what we are hearing would be an irritation to others. These are essential headphone occasions.

Conclusion

Utility is the main driver of headphone listening. We inevitably all have a different experience with headphones. Recorded music has become less important to the earbud generation than previous generations, while live music (especially festivals) has become more important to them (based on available consumer spending data as an indicator). The essential early consumer purchase of a Hi-Fi system, as a rite of passage, has been replaced by the games console and smart-phone, both of which are more likely to be heard via headphones. Perhaps this drives millennials and GenZ to venues where they can experience live music directly.

There will always be something lacking in the headphone experience. Knowing why and how these mechanisms work may eventually result in circuits like our April Fool's Uncan becoming truly effective and built into the circuitry of smart-phones and headphone amplifiers. Until then, your Old Scribe will stick to big loudspeakers wherever possible.

Did you enjoy this article? You can support our free, advertising-free magazine with a small Paypal donation using the button below. Thank you! [Love]

[Donate with Paypal!]

DISCLAIMER. TNT-Audio is neither a shop, nor a HiFi company or a repair laboratory for HiFi components. We don't sell anything. It is a 100% independent magazine that neither accepts advertising from companies nor requires readers to register or pay for subscriptions. If you wish, you can support our independent reviews via a PayPal donation. After publication of reviews, the authors do not retain samples other than on long-term loan for further evaluation or comparison with later-received gear. Hence, all contents are written free of any “editorial” or “advertising” influence, and all reviews in this publication, positive or negative, reflect the independent opinions of their respective authors. TNT-Audio will publish all manufacturer responses, subject to the reviewer's right to reply in turn.

[Follow us on Facebook]

© 2026 Mark Wheeler - mark@tnt-audio.com - www.tnt-audio.com