EP48: Spectral Orchestration — Grisey and the Roughness Curve
前置知识
Overview / 概述
A single trombone note contains twenty overtones. A spectral composer can extract those overtones with a spectrum analyzer, then redistribute each one to a different orchestral instrument — transforming the physics of a single sound into the architecture of an entire ensemble. This is spectral orchestration, the defining technique of the French spectral school (école spectrale) that emerged in the 1970s.
The mathematics behind this practice is not new. It is the harmonic series — the decomposition of a periodic sound into integer multiples of a fundamental frequency — which is identical to the Fourier series expansion encountered in EP02. What is new in the spectral school is the compositional attitude: frequency spectra are not analyzed about music, they are the musical material. The boundary between signal processing and composition disappears.
This episode traces three mathematical tools that spectral composers use explicitly: (1) the harmonic series as a spectrum template, (2) the Plomp-Levelt roughness curve as a harmonic design tool, and (3) spectral interpolation as a continuous morphing technique between consonance and dissonance.
中文: “一个大提琴低音,藏着二十个泛音。作曲家能把每个泛音,分配给不同的管弦乐器——‘管弦乐化一个频谱’。这是二十世纪下半叶,最激进的作曲方法之一。”
Prerequisites / 前置知识
- Wave Equation and Fourier Series (EP02) — the harmonic series is exactly the Fourier expansion of a periodic waveform; all spectral content in this episode is a direct application of that decomposition
- Overtone Series and Just Intonation (EP09) — the integer-ratio structure of the harmonic series, which Grisey uses as raw compositional material
- Psychoacoustic Frequency Scales (EP40) — the Bark scale and critical bandwidth are the perceptual basis for the Plomp-Levelt roughness curve
Definitions / 定义
Let be a fundamental frequency. The harmonic series generated by is the sequence of frequencies
The -th partial (overtone) has amplitude determined by the vibrating body. The resulting sound pressure waveform is
which is a finite Fourier series with fundamental period .
Example (Trombone E2): The trombone open-slide position near E2 produces a fundamental of approximately Hz. The first fifteen partials are Hz. Grisey extracted precisely these fifteen frequencies as the pitch material for Partiels (1975/76), assigning each to a different orchestral section.
The critical bandwidth at center frequency is the frequency width of one auditory filter (one Bark band) in the basilar membrane. Two tones separated by less than one critical bandwidth interact within the same auditory channel and produce beating or roughness.
Empirical approximations (Zwicker 1961):
At low frequencies (below 500 Hz), Hz is roughly constant; at higher frequencies it scales as approximately one-third of an octave. The Bark scale from EP40 partitions the audible spectrum into 24 critical bands, each of width .
Let two pure tones have frequencies with difference . Define the normalized frequency separation
The Plomp-Levelt roughness is the perceived dissonance between the two tones. Plomp and Levelt (1965) measured experimentally and found:
- (unison — no roughness)
- rises steeply, reaching a maximum near (one-quarter critical bandwidth)
- then falls, returning to near zero at (one full critical bandwidth)
- for (tones perceptually separated)
A common analytic approximation is
Usage: Spectral composers treat as a design parameter. Placing two partials so that their falls near 0.25 creates deliberate tension (roughness); placing them so creates smooth fusion.
Let and be amplitude vectors of two spectra sharing the same frequency bins. The linear spectral interpolation at parameter is
so for each partial .
As increases from 0 to 1, the composite sound morphs continuously from spectrum to spectrum . Spectral composers apply this across time: successive musical sections realize for a time sequence .
Main Theorems / 主要定理
Any periodic sound waveform with fundamental period and finite energy admits the Fourier series representation
where the coefficients and phases are uniquely determined by the waveform. The frequencies are precisely the harmonic series of .
For the Plomp-Levelt model with the approximation on , the roughness is maximized at
and empirically at for the measured Plomp-Levelt data. The latter corresponds to a frequency difference equal to one-quarter of the critical bandwidth at the mean frequency of the two tones.
For the analytic approximation , differentiate and set to zero:
Since for all finite , we require , giving for this particular smooth approximation. The discrepancy with the experimentally measured peak at reflects the asymmetry of the true roughness function: the rise to the peak is much steeper than the analytic model captures near . Plomp and Levelt’s original 1965 data, obtained from forced-choice listening experiments with sinusoidal pairs, place the maximum roughness consistently near one-quarter of the critical bandwidth. The decisive step is that roughness is a psychoacoustic (not purely physical) quantity — it is determined by the width of the auditory filter, not by the arithmetic of the frequencies alone.
Let be non-negative amplitude vectors. Then for all , the interpolated spectrum satisfies
for each partial index . In particular, (no negative amplitudes arise).
For each component, is a convex combination of and with weights and that sum to 1 and are both non-negative (since ). By the definition of convexity, any convex combination of two real numbers lies between them. Since , the minimum , so .
Compositional implication: Spectral interpolation never introduces spurious partials with negative amplitude; the morphed spectrum is always a physically valid (non-negative) amplitude envelope.
For a harmonic spectrum , adjacent partials are separated by . At low frequencies (), adjacent partials can fall within the roughness zone. For an inharmonic spectrum whose partial spacing is irregular, the aggregate roughness
is generally higher than for a harmonic spectrum with the same number of partials and similar frequency range, because inharmonic partials are more likely to land near the roughness peak .
Numerical Examples / 数值示例
Example 1 — Trombone E2 partial list:
Taking Hz (trombone open slide near E2), the first fifteen partials and their approximate frequency values are:
Grisey assigned partials 1–4 (41–164 Hz) to the string section, partials 5–9 (205–369 Hz) to woodwinds, and partials 10–15 (410–615 Hz) to brass. When played together, the ensemble reconstitutes the original spectral envelope of a single amplified trombone tone.
Example 2 — Roughness between partials 2 and 3:
The second partial is at Hz, the third at Hz. Their difference is Hz. The mean frequency is Hz, where Hz. The normalized separation is
At , the roughness curve is still well above zero (descending from the peak near 0.25), indicating moderate roughness. Grisey’s orchestration assigns these to bass strings and contrabass clarinet — instruments with rich individual timbres that blur the beating effect in performance.
Example 3 — Spectral interpolation at midpoint:
Suppose a harmonic spectrum has partial amplitudes and an inharmonic percussion spectrum has . At the midpoint :
The resulting spectrum is neither clearly harmonic nor clearly inharmonic — it sits between the two acoustic identities, producing a perceptually ambiguous, transitional timbre.
Musical Connection / 音乐联系
Grisey’s Partiels (1975/76) — Signal Processing Enters the Concert Hall
Gérard Grisey composed Partiels for eighteen musicians. The compositional process began not at a piano but at a spectrum analyzer: Grisey recorded a trombone playing low E, printed the spectrogram, read off the frequencies and relative amplitudes of the first fifteen partials, and then notated those as pitches for the orchestra — adjusting to the nearest quarter-tone where necessary.
The opening sixty seconds of Partiels are a direct sonic replica of that trombone spectrum: the double bass plays the fundamental, the cello the second partial, the viola the third, and so on up to the high brass. The composite sound is not a chord in the traditional sense — it has no root-position tonal identity, no voice-leading function. It is a spectrum made audible at orchestral scale.
The progression from the opening to the climax of the piece traverses the Plomp-Levelt roughness curve: Grisey deliberately introduces partial pairs whose separation drifts toward the roughness peak, creating waves of tension and release through spectral means rather than harmonic motion in the tonal sense. This is the precise compositional use of Definition 48.3 and Theorem 48.2.
Spectral Interpolation in Murail’s Gondwana (1980)
Tristan Murail, Grisey’s colleague in the Ensemble L’Itinéraire, developed spectral interpolation as the structural backbone of Gondwana for orchestra. The piece moves from a bell spectrum (inharmonic, high roughness) to a brass-like harmonic spectrum (low roughness) over twenty minutes. Each formal section realizes a different value in the interpolation , giving the large-scale form a directional logic grounded entirely in acoustic physics.
The Critical Bandwidth as Harmonic Grammar
In tonal music, consonance and dissonance are defined by interval ratios (just intonation) or by stylistic convention (common practice). Spectral composers replace this with a psychoacoustic grammar: an interval is consonant if its partials fall outside each other’s critical bands (), and dissonant if they overlap (). This regrounds harmony in auditory physiology rather than cultural convention — a move that connects EP40’s Bark scale directly to compositional practice.
中文: “频谱作曲家用这条曲线作为和声设计的工具:两个泛音的频率差落在粗糙区间,就产生张力;落在平滑区间,就产生融合。”
Limits and Open Questions / 局限性与开放问题
-
Quarter-tone notation vs. continuous frequency: Grisey’s score approximates spectral pitches to the nearest quarter-tone (50 cents). The trombone’s 15th partial at 615 Hz lies 16 cents flat of a notated D5. This rounding introduces systematic spectral errors — the actual roughness relationships in performance differ from those Grisey designed on paper. A complete model of spectral composition must account for the discrete pitch lattice imposed by Western notation.
-
Amplitude decay during performance: Grisey specifies dynamic markings that approximate the amplitude envelope of the trombone spectrum. But orchestral instruments have different dynamic characteristics: a pianissimo brass note does not scale linearly with a forte string note. The “reconstructed spectrum” in performance is thus a coarse approximation to the ideal Fourier decomposition.
-
Psychoacoustic individuality: The Plomp-Levelt curve is an average over listeners. Individual variation in critical bandwidth (particularly among trained vs. untrained listeners) means that a roughness peak Grisey targets at is perceived differently by different audience members. A universal psychoacoustic grammar for spectral composition remains elusive.
-
Nonlinear roughness superposition: Definition 48.3 sums pairwise roughness values independently. In practice, simultaneous presentation of many rough pairs does not produce a roughness equal to the sum of pairwise values — there is auditory masking, attention, and nonlinear combination. The additive roughness model is a first-order approximation that breaks down for dense spectra (ten or more simultaneous partials).
-
Extension to inharmonic timbres: Spectral techniques were developed primarily for instruments with near-harmonic spectra (strings, brass). Extension to percussion (bells, tam-tams) requires handling inharmonic partials whose frequencies are not integer multiples of any single fundamental — the Fourier decomposition framework of Theorem 48.1 no longer applies directly, and a more general Wigner-Ville or wavelet representation may be needed.
The Plomp-Levelt roughness function is a universal psychoacoustic invariant: the position of its maximum at is the same across cultures, regardless of musical training or familiarity with Western harmonic conventions. If confirmed, this would ground spectral composition’s claim to a culture-independent harmonic grammar entirely in basilar membrane mechanics.
Falsification criterion: A cross-cultural study presenting Plomp-Levelt tone pairs to musically naive listeners from non-Western traditions (e.g., listeners with no exposure to harmonic music) should find the same roughness peak location (within one-tenth of a critical bandwidth) as Western trained listeners. If the peak location shifts by more than 0.05 critical bandwidths across groups, the universality claim fails and the “roughness as grammar” approach is culturally conditioned.
Academic References / 参考文献
-
Plomp, R., & Levelt, W. J. M. (1965). Tonal consonance and critical bandwidth. Journal of the Acoustical Society of America, 38(4), 548–560.
-
Grisey, G. (1987). Tempus ex machina: A composer’s reflections on musical time. Contemporary Music Review, 2(1), 239–275.
-
Murail, T. (2005). The revolution of complex sounds. Contemporary Music Review, 24(2–3), 121–135.
-
Zwicker, E., & Fastl, H. (1999). Psychoacoustics: Facts and Models (2nd ed.). Springer.
-
Fineberg, J. (2000). Spectral music. Contemporary Music Review, 19(2), 1–5.
-
Rose, F. (1996). Introduction to the pitch organization of French spectral music. Perspectives of New Music, 34(2), 6–39.
-
Sethares, W. A. (2005). Tuning, Timbre, Spectrum, Scale (2nd ed.). Springer. Ch. 3 (Consonance and Dissonance).
-
Hasegawa, R. (2009). Gérard Grisey and the “nature” of harmony. Music Analysis, 28(2–3), 349–371.
-
Terhardt, E. (1974). Pitch, consonance, and harmony. Journal of the Acoustical Society of America, 55(5), 1061–1069.
-
Anderson, J. (2000). A provisional history of spectral music. Contemporary Music Review, 19(2), 7–22.
-
Murail, T. (1984). Spectra and pixies. Contemporary Music Review, 1(1), 157–170.
-
Bregman, A. S. (1990). Auditory Scene Analysis: The Perceptual Organization of Sound. MIT Press. Ch. 2 (Spectral fusion).
-
Huron, D. (2001). Tone and voice: A derivation of the rules of voice-leading from perceptual principles. Music Perception, 19(1), 1–64.