DVD grabs (click)















































































|
|
Unwanted Frequencies
by Trond S. Trondsen
New technology demands exacting audio standards
A closer listen to some recent DVD releases

The main purpose of this article is to equip the reader with the basic
knowledge required in order to qualify and quantify a suspected authoring error in a DVD's audio
track. Identification of such problems may provide the impetus towards returning the disc to the
store and to seek out an alt.-region release not subject to the same defect, or, in more egregious cases
it may provide the basis for a call for the manufacturer to recall the defective product.
It is my (perhaps deluded) conviction that better equipped and informed consumers will eventually lead
to better products, as more DVD companies begin to realize that they can no longer get away
with their reprehensible quality assurance & control practices. Some very elementary principles of
digital signal analysis will be reviewed, and some related techniques are directly applied to the analysis of
the audio track of some recent DVD releases.
Figure 1 shows a sine wave with a frequency of 440 Hertz.
In music, this is referred to as the A above middle C.
440 Hz simply means that the wave goes through 440 full cycles each second, resulting in a wave period of
1/440 seconds, i.e., about 0.002 seconds, which can be seen in Figure 1 as the time needed for the
sine wave to complete one cycle. Click here to listen to the tone (mp3).
Figure 1 represents the wave in the so-called "time domain," where the horizontal axis represents time, and
the vertical axis represents amplitude. It is however often more constructive to look at a
signal in the "frequency domain." Figure 2 shows the exact same signal,
but this time in the frequency domain. In this representation, we have the frequency (Hz) along the horizontal axis,
and the "intensity" of each frequency contained within the signal along the vertical axis. In this case, we see a single
peak at the frequency 440 Hz (= 0.440 kHz). This informs us that our signal contains, of course,
a strong component of 440 Hz and nothing much else. The sinusoidal waveform is the most basic repetitive
pattern in existence. A sine wave consists of only one single frequency, without any harmonics, or
(in musicians' terms) "overtones."
But what if the frequency content of a signal changes with time?
To study the evolution of a signal's frequency content, we need a sonogram. This is essentially
a series of frequency domain plots (as shown in Figure 2) stacked side by side, one for each instant in time. Figure 3
shows a sonogram for the above 440 Hz tone. Time runs along the horizontal axis, frequency along the vertical axis, and "intensity" (the 3rd dimension, coming out
of the plane of your computer monitor) is now represented by the use of colors.
We notice, quite as expected, the presence of a pure 440 Hz (0.440 kHz) tone for the duration of the 2-second audio clip.
Let's look at a more complex signal, such as Arvo Pärt's beautiful Ode I (mp3, excerpt).
Figure 4 shows, along the top, the corresponding sonogram, and below it
we see a zoomed-in view of the portion selected by the light blue box at the top. The horizontal traces in the
sonogram indicate sustained voices at various frequencies — a signature we indeed expected to see, based on
what we just heard in the audio clip. Now let's do the unthinkable and pollute the audio clip using this
annoying sound — a 10 kHz sine wave. The modified music piece sounds like this,
with Figure 5 showing the resulting sonogram. The sonogram is identical to
that of Figure 4, except for the addition of an unwelcome 10 kHz frequency component which fades in about 2 seconds into the clip.
Such a 10 kHz noise is easily spotted and eliminated by audio engineers, and would thus
not be present in the final product: the DVD or CD we buy in the store.
But what if a somewhat higher frequency noise accidentally found its way into the system somewhere along the way? If the audio engineer is
not aware of its presence, it may very well end up on the final digital master. The average range of hearing for children is from a low of 20 Hz to a high of 20 kHz.
By the time a person hits the age of 20, years of attending loud concerts may already have killed off the 'high tones'.
Studies show that the high range for an average young adult is about 16,000 Hertz (16 kHz). In another 10 years, a 30 year
old is often down to a mere 12,000 Hertz (12 kHz). Such a person would simply not be able to detect the presence of, say,
a 14 kHz squeal in an audio track. Unfortunately for those of us who have somewhat above-average hearing, many CD and DVD studios appear to
be hiring such average (in terms of hearing) personnel. Yours truly, age 40 and perfectly able to hear well past 17,000 Hz,
has, along with a not insignificant fraction of the population, for decades had to put up with such high-pitched squeals —
and there is generally no sympathy to be received from family, friends and DVD/CD store managers. They are simply unable to hear what is
emanating from the loudspeakers and promptly attribute your complaint to a bad case of tinnitus. Classical music CDs have traditionally been
the worst offender, but we have noticed a worrisome trend in that similar problems are increasingly starting to crop up on DVD audio tracks.
Let us use some very simple digital signal processing techniques to study this phenomenon and some other noise-related artifacts.
By way of example, we will take a closer look at some recent popular DVD releases.
ONCE UPON A TIME IN THE WEST (Leone, 1967) Paramount, R1/NTSC [HUB_ID: AB06830AR0SP0 J03K0214A]
This is a brilliant piece of cinema, and it is presented here in a truly excellent video transfer.
However, what the numerous rave reviews scattered across the Internet neglect to point out is that the original English soundtrack suffers from a serious defect.
Listen, for example, to this audio sample, taken from 01:47:27 into the movie:
[ mp3 | aiff ].
Those of you with an acute sense of hearing will most likely get an instant headache from the high-pitched squeal ("ringing"), even
on fairly low-end loudspeakers and headphones.
The corresponding plot of amplitude as a function of time is shown in Figure 6.
It looks pretty normal. But let us take a closer look at, for example, the particular moment in time we have marked with a vertical yellow cursor.
Looking at this moment in the frequency
domain, we see what is shown in Figure 7. Notice especially the prominent peak just below
15 kHz. The exact frequency is 13800 Hz. A sonogram of the entire clip is shown in Figure 8.
Note the 13.8 kHz trace, more or less present throughout the clip. This incredibly high-pitched squeal reappears at frequent
intervals throughout the movie, regrettably making watching
the DVD a very unpleasant experience. Here is yet another example, from 01:04:50 into the film:
[ mp3 | aiff ].
The amplitude plot is shown in Figure 9, and the frequency domain plot
from the window in time marked in red is shown in Figure 10. Again, note the presence of
an unwanted, highly distracting prominence just below 15 kHz. The sonogram is shown in Figure 11.
You may find it instructive to play the presented aiff files in a piece of free software called
Frequency.
RUSSIAN ARK (Sokurov, 2002) Seville, R1/NTSC [HUB_ID: LT36 23053DVL0 #30802Z13]
Another highly anticipated DVD release, another immense disappointment. This Seville release, a direct port of the Wellspring release, suffers from a very similar problem to
that of the above discussed disc.
A high pitched ringing starts only a few minutes into the film, riding on top of the voice of the "invisible filmmaker" character.
Starting at 00:46:11 into the film there is an otherwise quiet moment, allowing us to have a closer listen at the interfering component:
[ mp3 | aiff ].
The ringing gradually starts to become noticeable about halfway into the clip, and by the end of the clip it is almost unbearable, threatening to
shatter one's wine glasses in the cupboard.
Figure 12 shows the clip's amplitude plot, wherein nothing abnormal is evident.
The frequency domain representation of the signal within the portion marked with a red band is shown in Figure 13.
Note the prominence at 15000 Hz.
Figure 14 shows the frequency content a few seconds later, within the time slot marked in
Figure 15. Notice how the interfering 15000 Hz component has increased in power from time 00:46:17 to 00:46:22.
The sonogram shows this increase quite clearly, in Figure 16. Again, this kind of plot is proof positive that the disc is
suffering from a serious defect.
U2 GO HOME (Hamish Hamilton, 2003)
Universal, R1/NTSC [HUB_ID: LT36 0249811969L0 #31004Z04]
This disc, while not exactly a masterpiece of Cinema, serves to illustrate yet another kind of audio problem which can be studied,
quantified, and brought to the attention of the manufacturer.
In terms of musical performance, pure inspiration, raw energy, and unpolished, intimate, footage, U2 Go Home: Live from Slane Castle is
the by far best U2 concert DVD to date, and perhaps one of the best live concert DVDs out there.
But the PCM Stereo track — which is what this author likes to listen to on headphones while traveling — is hopelessly messed up.
Listen to this audio clip (this is the left channel), starting at 00:49:25 into the film:
[ mp3 | aiff ].
You will immediately become aware of periodic injections of white/pink noise into the audio channel. The clip's sonogram shows this
very clearly, in Figure 17. This defect appears in the left channel at frequent intervals throughout the film,
and often during exceptionally intense and inspired musical moments. DVD reviews found on the Internet praise the PCM Stereo track as "excellent."
We beg to differ.
LES DAMES DU BOIS DE BOULOGNE (Bresson, '45)
Criterion, R0/NTSC [HUB_ID: DVSS-073590A1 1]
Several people have called to our attention a "cyclic noise" on the audio track of this Criterion release of Robert Bresson's 1945 film.
The following is an audio clip from early on in the film, where the protagonist is riding
the elevator up to her apartment: [ mp3 | aiff ].
The "whirring" sound is normal, it is the sound of the elevator, but the larger-scale "pulsations" are not — they continue after the elevator stops.
The sonogram for this clip is shown in Figure 18, wherein the "pulsations" are clearly
evident. The pulse width is 0.6 seconds, and the pulsation period is 0.7 seconds. The noise clearly originates with the inferior source materials
used by Criterion. Any attempt at digitally cleaning up the audio track
would presumably have done more damage than good in this case. While it is very easy to remove a
discrete-frequency signal, such as the 15 kHz ringing described above, by applying clever retouching in the frequency domain before transforming back into the time domain,
it is very hard to remove the kind of broadband noise exhibited by this disc.
STRANGER THAN PARADISE (Jarmusch, 1984)
MGM, R1/NTSC [HUB_ID: LV 000981.1A VA01 Mastered by wamo]
Les Dames du Bois de Boulogne was made in 1945, and Stranger Than Paradise was made almost 40 years later.
Yet, the latter exhibits a pulsating noise remarkably similar to that of the former (discussed above).
Here is a sample audio clip, starting at 00:07:25 into the film: [ mp3 | aiff ].
The corresponding sonogram is shown in Figure 19 (contrast enhanced for clarity).
The top shows the entire 80-second audio clip.
The three bursts towards the end is the phone ringing three times,
each time followed by Eva telling Willy that the phone is ringing. At
22 s we see horizontal structure corresponding to car tires screeching outside the window. Other well-defined
vertical (broadband) features are environmental noises generated by Eva herself. The
bottom view shows the first (selected) portion in close-up. We call your
attention to two artifacts. Firstly, the lack of all frequencies in the important
1.4–1.6 kHz regime, contributing to a rather "hollow" sound.
It should not surprise us in the least if it turns out that the observed
frequency dropout is due to an attempt by MGM to filter away some interfering signal
component in that range, in the process throwing the baby out with the bath water.
Secondly, a low-frequency
pulsating noise (similar to that exhibited by the Les Dames disc), which is very audible and distracting, manifesting
itself as periodic excursions as indicated. To
the viewer this sounds like a constant "whoosh-whoosh-whoosh" at a
1-second periodicity. This is heard during large portions of the
first half of the film. This author does not recall hearing this
effect in the theatre. The pulsating noise heard on this release, as well as on the Les Dames disc, is presumably
due to actual physical damage to the optical soundtrack running up the side of the film. Damage
occurring to a wound reel might explain the cyclical nature of the problem.
In both cases the DVD companies may be faulted for choosing to base their release
upon obviously damaged source elements.
CONCLUDING REMARKS
New technology brings with itself a demand for less
subjective, more exacting audio standards. The desired
audio signal is today preserved at higher fidelity than
ever, throughout the entire production chain, but the same
can be said of any unwanted signals. This is clearly the
case in our first two examples above. A quick history
lesson: in the not too distant past, certain audio recording
devices would employ a 50 Hz "pilot" tone, modulated at
around 14 kHz. The tone was recorded onto quarter-inch tape
along with the dialog. It was later filtered away, during
transfer to the film's magnetic track. Any residual
high-frequency crud would "automatically" disappear upon
later transfer to the (limited bandwidth) optical track. We do
not know if this particular process is to be blamed for the
observed artifacts. We mention it here merely as an
illustration of the fact that the "old way" of doing things (with
its inherent assumptions) combined with new
technology could easily cause headache (literally) to the
fraction of the population with unusually acute hearing. In
the case of Once Upon a Time in the West and
Russian Ark, a simple spectral analysis on the final
product would immediately have revealed that all was not
well. The faulty discs should never have reached the market.
In the case of the white noise injections on U2 Go
Home, even our semi-audiophile neighbor's kid, brought up
on Playstation and mp3's, would immediately have spotted
the problem. If this noise injection happened during the
recording stage itself, and Universal was aware of the issue, why
not do as some Japanese studios do: slap a
sticker on the DVD warning the potential customer
of a less-than-subtle problem with the soundtrack "due to sub-optimal source material."
Which is exactly what MGM and (the otherwise so excellent) Criterion
should have done on their Stranger than Paradise and
Les Dames disc, respectively.
RESOURCES
It has not been our intention here to teach on the topic of
digital signal analysis — we have merely reviewed some very basic principles to help
you perform your own simple experiments at home (something DVD companies may not wish you to do).
This is a wide and fascinating field, and further in-depth study is highly recommended.
Some helpful resources are listed here.
The Fourier transform is among the most widely used tools for transforming data sequences between the time domain and the frequency domain.
A problem with the Fourier Transform is that there is a tradeoff
between precision in the time domain and
frequency domain. This tradeoff is a manifestation of the
Heisenberg Uncertainty Principle. The Fourier Transform provides
very good frequency resolution but very poor time resolution. The
Wavelet Transform, conceptually a superset of the Fourier
Transform, addresses this problem by providing more control over
the tradeoff.
The audio analysis for this article was performed simply by feeding the line-level audio output
of a Toshiba DVD player to the audio input of our Apple G4 Titanium 17" PowerBook. The audio stream
was then sampled with a free program called Audio In.
Subsequent Fourier analysis was done using Frequency.
The (free) demo-version of AudioXplorer was also of some help.
There is presumably an abundance of equivalent software available for Intel-based architecture and associated soundcards.
ACKNOWLEDGMENTS
Trond wishes to thank Nick Wrigley for putting his Ableton Live skills to work to help improve this article, as well as
Lasse Ulander, Kyle Armstrong, and Titus Mathews Jr. for constant friendship and helpful advice.
-------------
by Trond Trondsen, March 2004
Copyright © 2004 mastersofcinema.org
|
|
DVD grabs (click)
















































































|
|