Logjam 5: I hear you
At first sight it may appear strange that “an octave” describes a factor of two, since “octo” in Latin and similar sounds in derived languages (e.g. Italian, “otto”) means “eight”. (One can have similar thoughts about October being the tenth month, but that is another story.) The reason is somewhat obscure and historical, as usual, but relates to a series of musical “intervals” — factors/ratios — between notes: 1:1, 9:8, 5:4, 4:3, 3:2, 5:3, 15:8, 2:1, giving the scale from C: C, D, E, F, G, A, B, C. [In the key of C major.] The last C is the eighth in the sequence. In musical terms the intervals with smaller integers in the ratio are more “perfect”. So 2:1 is a “perfect eighth”. The “fourth” (4:3) and “fifth” (3:2) are also considered “perfect”, along with “unison” (1:1). There are the less perfect “major” intervals of 5:4 (“third”) and 5:3 (“sixth”), or even 9:8 (“second”) and 15:8 (“seventh”).
There are three ratios between steps in the given sequence 10:9 (e.g. E:D), 9:8 (e.g. D:C), and 16:15 (F:E). The last ratio is significantly smaller (1.066 . . .) than the first two (1.11 . . . and 1.125). The gaps between the larger steps are filled with “minor” intervals, giving a 12-tone scale. There are a large number of methods to do this and a bewildering array of designations and meanings. The note inserted between F and G can be referred to as F-sharp (F♯) or G-flat (G♭), and depending on how it is done, may refer to a slightly different frequency.
This “just” or “pure” intonation tuning is fine for ensembles with a few players and instruments. However, in large orchestras there is a wide range of sounds that can be produced. Generally tuning is done according to the “chromatic” scale with the octave divided into twelve “semi-tone” steps with constant ratios. We need to find the 12-th root of 2. This can be done with logarithms, or with a calculator/computer. The answer comes out at 1.059. . . .
Putting it all together we get the comparison:
Note | m:n | m/n | delta | Semi-tone | 12-tone ratio |
---|---|---|---|---|---|
C | 1:1 | 1 | 1 | ||
♯/♭ | 1.059 | 1.059 | |||
D | 9:8 | 1.125 | 1.125 | 1.059 | 1.122 |
♯/♭ | 1.059 | 1.189 | |||
E | 5:4 | 1.25 | 1.111 | 1.059 | 1.260 |
F | 4:3 | 1.333 | 1.067 | 1.059 | 1.335 |
♯/♭ | 1.059 | 1.414 | |||
G | 3:2 | 1.5 | 1.125 | 1.059 | 1.498 |
♯/♭ | 1.059 | 1.587 | |||
A | 5:3 | 1.667 | 1.111 | 1.059 | 1.682 |
♯/♭ | 1.059 | 1.782 | |||
B | 15:8 | 1.875 | 1.125 | 1.059 | 1.888 |
C | 2:1 | 2 | 1.067 | 1.059 | 2 |
The reason for the perfect and major intervals being more “musical” is presumably related to how sound resonates in most instruments and the inner ear canal. There is a fundamental tone and then a series of overtones, which makes instruments playing the same “note” sound different. The overtones are close to a multiple of the fundamental frequency, determined by the size and material of the resonating object. The strength of the overtones tends to be less than the fundamental and die out more rapidly. The perfect intervals create more possibilities for overlapping and reinforcement of overtones, and hence a more “pleasing” sound, unless you are an awkward bastard like me that enjoys a good bit of dischordant music (e.g. the end of Mozart’s “Ein musikalischer Spaß”/“A Musical Joke”, or perhaps “Trout Mask Replica”?).
Wikipedia informs me about why we relate octave intervals:
“Monkeys experience octave equivalence, and its biological basis apparently is an octave mapping of neurons in the auditory thalamus of the mammalian brain. Studies have also shown the perception of octave equivalence in rats (Blackwell & Schlosberg 1943), human infants (Demany & Armand 1984), and musicians (Allen 1967) but not starlings (Cynx 1993), 4–9 year old children (Sergeant 1983), or nonmusicians (Allen 1967).”
These statements seem somewhat at odds with each other. Maybe “human infants” refers to exclusively to babies? How does bird song fit into this? I am sure experts in music and biology would find many imperfections and errors in the account I have given of these matters. I leave it to readers to check the veracity or not of what I have written.
Moving on, sound intensity/loudness is measured using logarithmic concepts. Generally a “decade”/base-10 scale is used — the “bel” or more precisely the decibel (a ratio x in dB ), thus 1dB represents a ratio of 1.258 . . . , the 10-th root of 10. But ratio of what? To have meaning beyond ratios, dBs need some reference level for the 0dB value. For sound, a reference “sound intensity level” (SIL) of 1pW/m2 is often used, being close to what is barely detectable by a healthy (young?) ear.
In engineering, powers are also often described using dBs, but with an added letter: dBm. In this case, 0dBm represents 1mW. One might be misled by the fact that 10dBm = 10mW to guess that 20dBm is 20mW, but in fact 20dBm = 100mW (20dB is a factor of 102 = 100).
Further confusion can arise when energy is related to the square of some quantity — e.g. kinetic energy to square velocities or sound intensity to the square of pressure fluctuation. This can lead to 20, rather than 10 appearing in a definition.
In electronics, amplifiers usually have some cut-off frequency beyond which the gain falls off, often by a factor that goes as one over the square of the frequency ( ). This behavior is expressed in models that posit parisitic resistance and capacitance in the circuit, along with components put in to ensure stability and so on. Anyway an increase in frequency by a factor of 10 (a decade), leads to a reduction in gain by a factor of 100 (20dB), giving 20dB/decade “roll-off”. Sometimes, rather than a decade, an octave (2×) increase in frequency is used as the measure, giving a 4× reduction in power (6.02 . . . dB), and hence a 6dB/octave roll-off.