Graveyard
I wrote all kinds of additional text and widgets that did not fit with the main story that much. Hence this bonus content. If you like the mini-course so far, or some of the widgets dumped below interest you, check them out!
Bonus chapters
I wrote three chapters that show some nice stuff connected to KL divergence & entropy.
Kolmogorov Complexity
Mandelbrot set looks incredibly complex, yet it's generated by a very short formula.
Kolmogorov complexity gives us the language to talk about all this, and much more.
For example, it tells us a lot about pseudorandomness.
Coin Flip Randomness Tester
Click coins to build a sequence, then see if it passes statistical tests for randomness
Current Sequence (0 flips)
Click coins above to build your sequence
It's also the ultimate answer to how similar files are.
More in the Kolmogorov chapter.
Multiplicative Weights Update
Here's a riddle that's behind a ton of recent algorithms in CS and ML. Say you wanna get rich trading stocks. Lucky you— investors share their tips every day. Each day , they give advice, and afterward you find out how they did. For investor , you get - how many dollars she gained that day.
Your strategy: Start with equal trust in everyone— for all investors. Each morning, randomly pick an investor based on your trust distribution and follow their advice. After seeing how everyone did, update your distribution to .
How should you update? What's best? 1
Try it yourself with the widget below:
Multiplicative Weights Update in Action
Select Algorithms
Choose Scenario
Also, here's how we can use it in algorithms, for noisy binary search problem.
Noisy Binary Search Visualization
If you got interested, continue to the multiplicative weights chapter.
Fisher Information
A typical US election poll asks 1000-2000 random people. Do the math and you'll find such polls are usually within 2-3% of the truth.2 Pretty wild—it doesn't even matter how many people live in the country, 1000 random ones get you within a few percent!
But here's the thing: US elections are super close. We already know both parties will get around 50%. So maybe we should poll more people (or combine polls) to get within 0.1%. How many people would that take?
Explore the relationship in the widget below.
Polling Error Calculator
Required Sample Size
Quick Scenarios
More in the fisher info chapter.
Bonus random content
More on entropy and KL divergence applications; cut from the third chapter.
Mutual Information Explorer
Joint Distribution P(Weather, Transport)
More max entropy distributions, cut from the max-entropy chapter.