EP13: Polyrhythm and Rhythm-Pitch Duality
前置知识
Overview
This is Episode 4 of 4 in the Jacob Collier series — the finale of the arc.
Jacob Collier’s mastery of both rhythm and harmony looks like two separate gifts. This episode argues it is one: the ability to perceive integer ratios across all time scales. A polyrhythm of and a perfect fifth are mathematically identical — the ratio — separated only by a perceptual threshold near 20 Hz.
中文: “节奏和和声,是同一种数学在不同时间尺度上的表达。”
The episode develops three interlocking results:
- LCM governs the period of any polyrhythm.
- GCD governs irreducibility and structural richness — the same theorem used in EP12 to generate the circle of fifths.
- A rhythm accelerated past ~20 Hz becomes a perfect fifth — the perceptual boundary is not a mathematical one.
Prerequisites
- Combination Tones and Nonlinearity (EP09) — frequency ratios and perceived intervals
- Tuning Systems and Irrational Numbers (EP11) — is irrational; the Pythagorean comma
- The Circle of Fifths as a Generator (EP12) — generates all 12 pitch classes
Definitions
A polyrhythm is the simultaneous sounding of beats against beats within the same time span . The beat durations are and respectively. The two sequences share the same start and end point but divide the interval differently.
Special cases:
- — hemiola: two against three; the most common polyrhythm in Western art music.
- , , — increasingly complex combinations.
Given a polyrhythm , the common grid is the finest uniform subdivision that contains all beat onsets of both rhythmic streams. The number of grid cells per span is .
More generally, for a -layer polyrhythm with layers , the common grid has subdivisions.
The rhythm-pitch threshold is the critical repetition rate, approximately 20 Hz (individual variation: 15–30 Hz), below which the auditory system perceives a periodic pulse as a rhythm (individual events are counted), and above which it perceives the pulse as a pitch (a continuous tone at that frequency).
This threshold is a property of auditory processing, not of the signal itself. The mathematical structure — a periodic sequence with period — is identical on both sides of the boundary.
Given two periodic signals with frequencies and in ratio (lowest terms), their combined waveform repeats with period . The Helmholtz consonance of the interval is defined as the repetition frequency of the combined waveform:
Higher (i.e., smaller , i.e., simpler integer ratio) corresponds to greater consonance.
Main Theorems
In a polyrhythm , the two rhythmic streams realign (both streams have a beat onset simultaneously) for the first time after exactly minimal grid subdivisions, and then periodically every subdivisions thereafter.
For Jacob Collier’s five-layer polyrhythm :
A complete cycle requires 60 minimal subdivisions.
Let the minimal subdivision have duration . Stream places beats at positions , and stream at — all integers, since and .
The first positive position where both streams align is the smallest positive integer with and , which by definition is . All subsequent alignments occur at integer multiples of .
For the five-layer case: , , , .
A polyrhythm is irreducible (cannot be simplified to a coarser polyrhythm) if and only if .
If , then , a simpler polyrhythm with the same structure repeated times.
Furthermore, the complexity of the polyrhythm — measured by — is maximized when .
Write and with . Then . The polyrhythm consists of complete repetitions of the irreducible polyrhythm , each of duration .
When : , the maximum possible value. No simplification exists.
Connection to EP12: there, ensured that adding 7 semitones repeatedly visited all 12 pitch classes before returning to the start — the generator of . Here, ensures the polyrhythm visits all grid positions before repeating. Same theorem, different domain.
Let denote the rhythmic sequence produced by two streams at repetition rates and Hz. As increases:
- For : the auditory system perceives two independent rhythmic streams with ratio .
- For : the auditory system perceives two simultaneous pitches at frequencies and Hz, forming the musical interval with frequency ratio .
The mathematical structure is invariant under this perceptual transition: the ratio is preserved. The transition is a property of auditory processing, not of the acoustic signal.
Corollary: The polyrhythm at produces pitches at 200 Hz and 300 Hz — a perfect fifth ( frequency ratio, approximately 702 cents).
The two pulse trains have frequencies and . Their Fourier spectra have energy concentrated at integer multiples of and respectively.
When Hz: the inter-onset interval ms. The auditory system’s temporal resolution (~1–5 ms) can track individual onsets, and the brain counts discrete events as rhythm.
When Hz: the inter-onset interval ms. The auditory system integrates the pulse train as a continuous periodic signal (pitch fusion). The perceived pitch corresponds to the repetition frequency .
The frequency ratio is identical in both regimes. The perceptual change is a consequence of the auditory system’s temporal integration window, not a mathematical discontinuity.
Let be a fundamental frequency. The integer multiples (the second through sixth harmonics) form the frequency ratios .
These ratios correspond to:
- — major third (386 cents just intonation)
- — perfect fifth (702 cents)
- — just major triad
- — the same integer sequence that defines Jacob Collier’s five-layer polyrhythm.
Consequently, accelerating a polyrhythm past 20 Hz yields a just major chord above .
The Collier Limit: 30:31
Jacob Collier performs a polyrhythm. This is an extreme edge case:
A full cycle requires 930 minimal subdivisions — near the limit of human temporal tracking. The perceptual effect resembles irrational ratios without actually being irrational.
Compare to the result from EP11: is irrational, so the sequence — powers of 2 and powers of 3 never coincide (except at 1). A true polyrhythm would never realign.
The ratio is rational (realigns after 930 units) but approximates the perceptual experience of irrationality. This is an aesthetic strategy: maintain mathematical controllability while approaching the boundary of perceptual chaos.
中文: “他在整数比的世界里,寻找最接近无理数感觉的配置。这就是他的策略:保持理性结构的可控性,同时推向复杂性的极限。”
Musical Connection
The Arc in Four Perspectives
This episode closes a four-part arc on the mathematics of :
- EP10: Symmetry — the reflection and rotation group of the twelve-tone system.
- EP11: Incommensurability — the Pythagorean comma; irrational.
- EP12: Generation — generates all of from a single element.
- EP13: Scale invariance — the same integer ratios appear in rhythm (below 20 Hz) and pitch (above 20 Hz).
Jacob Collier does not use two separate skills — harmonic fluency and rhythmic complexity. He uses one skill: perceiving integer ratios as structures invariant across time scales. The LCM that governs when two rhythmic streams align is the inverse of the Helmholtz consonance that governs how smooth a musical interval sounds. The GCD that makes the generator of the circle of fifths is the same GCD that makes the signature of hemiola’s maximal complexity.
Euclidean Rhythms (connection to EP12): Distribute beats as evenly as possible among subdivisions. The Bjorklund algorithm for this distribution is mathematically equivalent to the Euclidean algorithm for computing — the same algorithm that establishes the generator property in EP12. African bell patterns, Cuban clave, and the bossa nova pattern are all Euclidean rhythms.
Rhythmic canons: Tiling with a pattern and its translates is a number-theoretic problem. A rhythm tiles if there exists a set such that every element of is covered by exactly one translate , . This is equivalent to a factorization of the cyclic group — directly related to EP04’s group structure.
Limits and Open Questions
-
The 20 Hz threshold is not sharp. Individual variation spans approximately 15–30 Hz. The “boundary” is a statistical description of population-level auditory processing, not a physical constant. Some listeners with absolute pitch or musical training may perceive rhythm-to-pitch transitions earlier.
-
Euclidean rhythms and the Toussaint conjecture. Godfried Toussaint conjectured (2005) that all “good” rhythms used in world music are Euclidean rhythms. While many examples are confirmed, a complete classification of “good rhythms” remains open.
-
Rhythmic canons and the Fuglede conjecture. Determining which subsets of can tile the group (rhythmic canons) is connected to spectral properties of the set via the discrete Fourier transform. Fuglede’s conjecture (1974) proposed an equivalence between tiling and spectrality; it was disproved in for but remains open in and for .
-
Cognitive limits of polyrhythm tracking. Jacob Collier’s has . The cognitive limit for simultaneous rhythmic tracking is not well established theoretically. Empirical studies suggest 3–4 independent rhythmic streams as a working memory limit, but virtuosic performance (Collier, Nancarrow) routinely exceeds this — suggesting different neural mechanisms than explicit beat counting.
-
Forward to EP26. The power spectral density of rhythmic patterns and the fractal structure of musical rhythm (Hurst exponent analysis) generalize these ideas. A fractal rhythm is self-similar across time scales — the rhythm-pitch duality of this episode is the limiting case of that self-similarity as scale goes to zero.
Academic References
-
Toussaint, G. T. (2013). The Geometry of Musical Rhythm. CRC Press. — Euclidean rhythms, Bjorklund algorithm, tiling problems.
-
Helmholtz, H. von (1863). On the Sensations of Tone (trans. A. J. Ellis, 1954). Dover. — Consonance as repetition frequency of the combined waveform; .
-
Risset, J.-C., & Wessel, D. L. (1999). Exploration of timbre by analysis and synthesis. In The Psychology of Music (2nd ed.), ed. Deutsch. Ch. 5. — Rhythm-to-pitch continuum; auditory streaming and temporal resolution.
-
Vuust, P., & Witek, M. A. G. (2014). Rhythmic complexity and predictive coding: A novel approach to modeling rhythm and meter perception in music. Frontiers in Psychology, 5, 1111. — Cognitive models of polyrhythm perception.
-
Lerdahl, F., & Jackendoff, R. (1983). A Generative Theory of Tonal Music. MIT Press. — Metric hierarchy and grouping structure.
-
Bjorklund, E. (2003). The theory of rep-rate pattern generation in the SNS timing system. Los Alamos National Laboratory Report SNS-NOTE-CNTRL-99. — Original algorithm for Euclidean rhythms (discovered independently; also Toussaint 2005).
-
Tymoczko, D. (2011). A Geometry of Music. Oxford University Press. Ch. 2 — Voice leading and pitch-class intervals as elements of ; connections to EP04.