In this lab, we’re going to step away from the textbook and into the recording studio. We’re going to test Zipf’s Law—the rule that says the most common word in a text will appear twice as often as the second most common, and three times as often as the third. Does this "1/n" relationship hold up when a beat is involved? Let’s find out.
The goal of this lab is to see if a 3-minute song contains enough "data" to trigger the Power Law. Usually, Zipf’s Law is easiest to see in massive books like Ulysses, but even in a short song, the "shape" of the language should start to emerge.
The Lab Setup
Select Your Subject: Pick a song with a decent amount of lyrics (avoid instrumental tracks or songs that are 90% "La La La").
The Raw Data: Print out the lyrics. Using a highlighter, find the most common word (The "Rank 1" word). It’s often "I," "you," "the," or the main word of the chorus.
The Count: Count how many times Rank 1 appears. Let's say it appears 30 times.
The Prediction: Based on Zipf’s Law, how many times should the Rank 2 word appear? (30 ÷ 2 = 15). How about Rank 3? (30 ÷ 3 = 10).
The Reality Check: Count the actual occurrences of the 2nd and 3rd most frequent words. How close was the math to the reality?
This isn't just a counting exercise; it’s an introduction to Rank-Size Distributions.
When students plot their song's words on a graph (Rank on the X-axis, Frequency on the Y-axis), they will see a steep curve that levels out into a "long tail." This is a Power Law curve. It’s the same curve that describes how wealth is distributed in a country or how many people live in different cities.
The most exciting part of this lab is discussing why it happens. Is it because the songwriter is lazy? Or is it because human brains are wired to balance "new information" with "familiar structure"?
In music, we need the "Rank 1" words to ground the song, giving our ears a place to rest between the more unique, descriptive words that give the song its meaning. Zipf’s Law is the mathematical proof of that balance.
Classroom Discussion Questions
The Chorus Factor: How does a repetitive chorus "distort" Zipf’s Law? Does it make the Rank 1 word even more dominant?
Genre Comparison: Do rap songs (which typically have a higher unique word count) follow the law more closely than pop songs?
The "Zero" Problem: What happens to the law when you get to the 50th or 100th ranked word?
Have fun with this lab. It is designed to introduce students in a fun way. Let me know what you think, I'd love to hear. Have a wonderful weekend.