EP34

EP34: The Bessel Secret of the DX7 — FM Synthesis Mathematics

Bessel旁频展开Jₙ(I), Carson带宽公式, 算子反馈分岔
6:11 Signal Processing

Overview / 概述

In 1983, Yamaha released the DX7 synthesizer at $1,995 — and it promptly rewired the sonic landscape of popular music. The iconic electric piano patches on thousands of 1980s recordings were not built from stored waveforms or analog filters. They were computed from a single equation with one free parameter. That equation is FM synthesis, and the parameter is the modulation index .

The mathematical core is the Jacobi-Anger expansion: a phase-modulated sinusoid decomposes into an infinite series of sidebands whose amplitudes are Bessel functions . Turning the modulation index from 0 to 10 is equivalent to redistributing energy across the Bessel function family — from a pure carrier at outward to , painting progressively brighter, more metallic timbres as each new partial switches on.

中文: “那个标志性的电钢琴音色,出现在上千首八十年代的金曲里——它的核心,是一个你可能在数学课上见过的函数:Bessel函数。今天我们拆开DX7的黑箱,看看FM合成背后的谱系数学。”

This episode also covers Carson’s rule for estimating FM bandwidth, and the phenomenon of operator self-feedback: when an FM operator modulates itself, the system transitions from sinusoid through sawtooth to noise-like instability — an engineering-domain version of the bifurcation cascades studied in nonlinear dynamics.


Prerequisites / 前置知识


Definitions

Definition 34.1 (FM Synthesis Signal)

Let (carrier angular frequency), (modulator angular frequency), and (modulation index) be real parameters. The FM synthesis signal is

The instantaneous phase deviation from the carrier is , so the instantaneous frequency is . The peak frequency deviation is .

中文: “当I等于零,y就是纯正弦波,只有载波频率。随着I从零增大,被调制的相位幅度增大,波形开始扭曲,不再是正弦波。频谱上,在载波频率两侧出现新的分量——称为边带。I越大,边带越多,音色越丰富、越明亮、越金属感。”

Definition 34.2 (Bessel Function of the First Kind)

The Bessel function of the first kind of order is defined by the integral

Key symmetry: .

Selected values at small argument: , for . For large with , (the function is exponentially small beyond the “turning point”), which is why only finitely many sidebands are audibly significant at any finite modulation index.

Definition 34.3 (FM Sideband)
For an FM signal with carrier and modulator , the n-th order sideband is the spectral component at angular frequency , for . Positive- sidebands lie above the carrier; negative- sidebands lie below. The component is the carrier itself (modified in amplitude). The amplitude of the -th sideband is .
Definition 34.4 (Carson Bandwidth)

The Carson bandwidth of an FM signal with modulation index and modulator frequency is

Carson’s rule captures the empirical observation that sidebands beyond order have negligible energy ( for approximately), so the occupied bandwidth is approximately sideband spacings.

Definition 34.5 (DX7 Operator and Algorithm)

A DX7 operator is an FM oscillator with controllable frequency, amplitude envelope, and feedback amount. The Yamaha DX7 contains six operators. An algorithm is a directed acyclic graph (with optional self-loops on one designated feedback operator) on the six operators, specifying which operators modulate which. The DX7 provides 32 preset algorithms. In graph terms:

  • Carriers are operators with no outgoing modulation edges (they feed the audio output directly).
  • Modulators are operators whose output feeds into a carrier’s or another modulator’s phase input.
  • Algorithm 1 is a fully serial chain: OP1 → OP2 → OP3 → OP4 → OP5 → OP6 → output.
  • Algorithm 32 is fully parallel: six independent carriers summed to output (additive synthesis).

Main Theorems / 主要定理

Theorem 34.1 (Jacobi-Anger FM Expansion)

Let . Then

where is the Bessel function of the first kind of order , and the series converges absolutely and uniformly in .

Proof.

Strategy: rewrite the FM signal using the complex exponential, apply the Jacobi-Anger expansion, then recover the sine series by taking the imaginary part.

Step 1 — Complex form. Write

Step 2 — Jacobi-Anger identity. The classical Jacobi-Anger identity states that for any real and ,

This follows from the generating-function definition of : evaluated at .

Setting and :

Step 3 — Combine and extract imaginary part.

Taking the imaginary part,

Convergence. The Bessel functions satisfy (see Theorem 34.2), so by the Cauchy-Schwarz inequality applied to the sequence ; absolute convergence of the series follows, and uniformity in follows from the uniform bound .

中文: “Jn of I就是第n阶第一类Bessel函数,参数是调制指数I。第零阶J0表示载波本身的幅度。第一阶J1表示距载波加减omega-m处的边带幅度。第二阶J2表示距载波加减2 omega-m处的边带幅度,以此类推。”

Theorem 34.2 (Bessel Energy Conservation (Parseval for FM))

For any modulation index ,

In particular, the total sideband power of an FM signal with unit amplitude equals the carrier power of an unmodulated sinusoid. FM synthesis redistributes energy across sidebands but does not create or destroy it.

Proof.

Strategy: Parseval’s theorem applied to the -periodic function .

Step 1. From the Jacobi-Anger identity, the Fourier coefficients of are

Step 2. By Parseval’s identity for ,

Step 3. Since for all ,

Since is real, , giving the stated identity.

中文: “当I很小,只有J0接近1,其他接近零——几乎纯正弦。随着I增大,能量从J0流向J1、J2、J3。不同阶的Bessel函数在不同I值处到达峰值,有的甚至变号——对应边带相位翻转。这就是调制指数控制音色的数学机制:I是在Bessel函数族中分配能量的旋钮。”

Prop 34.1 (Sideband Symmetry)

For an FM signal with real modulation index , the -th and -th sidebands satisfy . Therefore:

  1. Odd-order negative sidebands are phase-inverted relative to their positive counterparts.
  2. Even-order negative sidebands are in phase with their positive counterparts.
  3. The amplitude spectrum (plotting ) is symmetric about .
Proof.

Substitute in the integral definition:

Substitute (changing sign on both the argument and the measure):

More directly: since ,

Using the identity is more complex; instead use the generating function directly. From , replace :

But (substitute in the generating function, noting by the same argument applied to the integral with ). Equating coefficients of gives .

Theorem 34.3 (Carson's Rule for FM Bandwidth)

Let be a small threshold (typically 1% of total power). Define the essential bandwidth to be the smallest bandwidth containing all sidebands with . Then

for the conventional choice , and this approximation is tight for .

Proof.

Strategy: bound the index beyond which using the asymptotic behavior of Bessel functions.

Step 1 — Upper bound for large order. For integer and , the Bessel function satisfies

This follows from the series representation ; bounding by the first term gives .

Step 2 — When does the bound drop below ? We need . By Stirling’s approximation, grows faster than for . For the 1% threshold, numerical evaluation of Bessel zeros shows that the last significant sideband occurs at approximately , independent of over a wide range.

Step 3 — Bandwidth. With at most significant sidebands on each side of the carrier, the total occupied bandwidth (distance from lowest to highest significant sideband) is

This is Carson’s rule. The approximation reflects that the exact cutoff order depends weakly on and ; the formula is exact for (bandwidth = , the two first-order sidebands) and becomes increasingly accurate as grows.

Theorem 34.4 (Feedback FM Fixed-Point Equation)

For the simplified feedback FM model

the equation has a unique solution for each when , and this solution can be obtained as the limit of the fixed-point iteration . For , the fixed-point mapping need not be a contraction and the iteration may diverge or cycle.

Proof.

Step 1 — Contraction mapping. Define . Then

where we used (mean value theorem, since ).

Step 2 — Banach fixed-point theorem. When , the Lipschitz constant of is , so is a contraction on (restricted to the bounded range , which is invariant). By the Banach contraction mapping theorem, there exists a unique fixed point and the iteration converges to it for any starting value .

Step 3 — Failure for . When , the Lipschitz constant equals 1 and the contraction property fails. For , is an expansion near the fixed point: a small perturbation grows under iteration, destabilizing the periodic solution. In discrete implementations (such as the DX7’s one-sample delay feedback), this transition corresponds to the onset of a rich spectrum that empirically resembles a sawtooth wave for moderate and noise-like instability for large .

中文: “当I很小时,反馈量微弱,输出接近纯正弦。I稍大,波形开始向锯齿波倾斜,产生奇次和偶次谐波。I继续增大到某个临界值,系统产生类噪声的不稳定频谱——更接近工程意义上的不稳定,而非严格数学上的分岔混沌。”


Numerical Examples

Example 1 — Spectral Profile at I = 1 (Electric Piano Region)

Compute for :

Order Sideband frequency Power fraction
0 0.7652 (carrier) 58.6%
±1 0.4401 2 × 19.4% = 38.7%
±2 0.1149 2 × 1.3% = 2.6%
±3 0.0196 negligible

Check: . Energy conservation holds. Carson bandwidth: .

Example 2 — Spectral Profile at I = 5 (Brass Region)

At , Carson bandwidth is . The Bessel values show energy dispersed across orders 0–6:

Order Timbre contribution
0 −0.1776 carrier (phase-inverted, reduced)
1 −0.3276 1st sideband
2 0.0466 2nd sideband
3 0.3648 3rd sideband (dominant)
4 0.3912 4th sideband (dominant)
5 0.2774 5th sideband
6 0.1310 6th sideband

Note that : the carrier component is phase-inverted. This “hollowing out” of the carrier and redistribution into high-order sidebands is what gives brass and bell timbres their inharmonic, cutting character.

Example 3 — DX7 Algorithm Complexity

With 6 operators and carrier modulated by a chain of depth , the instantaneous phase involves iterated FM at each level. A depth-2 chain (OP1 → OP2 → carrier output) produces:

which expands to a double Bessel series . The number of significant partials grows multiplicatively, explaining why Algorithm 1 (depth-6 serial chain) produces extremely dense spectra from just six oscillators.


Musical Connection / 音乐联系

音乐联系

FM synthesis vs. subtractive synthesis exemplify two dual philosophies of timbre construction. In subtractive synthesis (

EP30

), one begins with a harmonically rich waveform (sawtooth, square) and removes partials with filters — a sculptor chipping away stone. FM synthesis does the opposite: a pure sine carrier gains sidebands as increases, building complexity from simplicity.

The DX7’s timbral vocabulary maps directly onto the Bessel function family:

Timbre Typical range Dominant Bessel orders
Sine / flute 0–0.5 only
Electric piano (Rhodes) 0.5–2
Clarinet / oboe 2–4 through
Brass / horn attack 3–7 through
Bell / gong / metal 5–12 spread across to

Cross-episode thread. In

EP19 (Vibrato)

, we studied vibrato as FM synthesis with and Hz — a rate far below the audible spectrum, producing pitch fluctuation rather than timbre change. The DX7 uses exactly the same equation with in the audio range (50 Hz–several kHz) and up to 14. The mathematical structure is identical; only the parameter regime changes the perceptual result.

Feedback and edge-of-chaos timbres. The operator self-modulation in Theorem 34.4 connects to the broader theme of nonlinear dynamics from

EP30

. In the DX7, this is implemented with a one-sample-delayed feedback loop (with averaging to suppress instability), so the exact analytical model is an approximation — but the qualitative behavior matches: small gives pure tone, moderate gives sawtooth-like richness, large gives noise-like spectra used for breath, cymbal, and wind sounds.

Computational efficiency. FM synthesis requires only sine evaluations and a few multiplications per sample — far less than the filter banks needed for subtractive synthesis or the physical modeling waveguides of digital waveguide synthesis. This is why the DX7 could produce a six-operator polyphonic synthesizer in 1983 hardware: the Bessel sideband structure is “free” — it emerges automatically from the phase equation without any explicit sideband computation.


Limits and Open Questions / 局限性与开放问题

  1. Carson’s rule is an approximation. The threshold at which a sideband is declared “negligible” is arbitrary; the true essential bandwidth depends on , , and the application context (broadcasting, audio synthesis, FM radio all use different standards). No sharp, parameter-independent bandwidth formula exists.

  2. Feedback FM is not mathematically chaotic. The DX7’s self-modulation feedback uses a one-sample delay and averaging circuit, making it a discrete-time nonlinear map rather than the continuous-time system in Theorem 34.4. Whether the discrete system exhibits true period-doubling bifurcations in the sense of EP30 (versus mere spectral broadening from aliasing and quantization) remains an open engineering question.

  3. Perceptual inversion problem. Given a target timbre (e.g., a recorded piano note), finding the FM parameters that best approximate it is an inverse problem with no closed-form solution. DX7 programmers famously found this “black magic” difficulty the instrument’s main usability barrier. Modern approaches use gradient descent on the Bessel series, but convergence to perceptually meaningful local minima is not guaranteed.

  4. Inharmonicity and real instruments. The FM sideband model places sidebands at exact multiples of the modulator frequency. Real instruments (pianos, bells) have inharmonic partials due to stiffness and geometry. The DX7 approximates these by choosing as a non-integer ratio, but this still produces exact rational relationships between partials, not the true irrational spacing of stiff-string inharmonicity.

Conjecture (Optimal FM Approximation)
For any target power spectrum supported on harmonics, the best two-operator FM approximation (minimizing mean-squared spectral error) satisfies and . Falsification criteria: exhibit a target spectrum whose optimal FM approximation requires , or show that the optimal solution uses by constructing a counter-example via numerical optimization.

Academic References / 参考文献

  1. Chowning, J. M. (1973). “The Synthesis of Complex Audio Spectra by Means of Frequency Modulation.” Journal of the Audio Engineering Society, 21(7), 526–534. (The original paper introducing FM synthesis.)

  2. Chowning, J. M., & Bristow, D. (1986). FM Theory and Applications: By Musicians for Musicians. Yamaha Music Foundation, Tokyo.

  3. Abramowitz, M., & Stegun, I. A. (Eds.) (1964). Handbook of Mathematical Functions. National Bureau of Standards. Chapter 9: Bessel Functions.

  4. Watson, G. N. (1922). A Treatise on the Theory of Bessel Functions. Cambridge University Press. (Comprehensive reference for Jacobi-Anger expansion and Bessel identities.)

  5. Carson, J. R. (1922). “Notes on the Theory of Modulation.” Proceedings of the IRE, 10(1), 57–64. (Original statement of Carson’s bandwidth rule for FM radio.)

  6. Oppenheim, A. V., & Schafer, R. W. (2010). Discrete-Time Signal Processing (3rd ed.). Pearson. Chapter 8: Frequency Analysis of Signals and Systems.

  7. Roads, C. (1996). The Computer Music Tutorial. MIT Press. Chapter 6: Frequency Modulation Synthesis, pp. 224–289.

  8. Schottstaedt, W. (1977). “The Simulation of Natural Instrument Tones Using Frequency Modulation with a Complex Modulating Wave.” Computer Music Journal, 1(4), 46–50.

  9. Dodge, C., & Jerse, T. A. (1997). Computer Music: Synthesis, Composition, and Performance (2nd ed.). Schirmer. Chapter 4: Frequency Modulation.

  10. Steiglitz, K. (1996). A Digital Signal Processing Primer. Addison-Wesley. Chapter 9: Modulation and Demodulation.