Reservoir Sampling Visualizer

⚙️ Setup

Stream Size (n)

Items flowing past, drawn one at a time.

Reservoir Size (k)

How many to keep. Must be ≤ n.

Algorithm

L skips ahead in O(k(1+log(n/k))) steps.

Step Delay (ms)

0 = instant. Use 0 for big runs.

Seed (optional)

Mulberry32 PRNG for reproducible runs.

📈 Uniformity Test

After a Monte Carlo run, every item j ∈ [1, n] should appear in the reservoir k/n of the time. The bars below show empirical inclusion frequency per stream position. A flat line at k/n means the algorithm is unbiased.

Run a Monte Carlo to populate.

How It Works

Algorithm R — fill the reservoir with the first k items. For each subsequent item i (1-indexed, i > k), pick an integer j ∈ [1, i] uniformly. If j ≤ k, replace slot j. This gives each item probability k/n of being in the final sample.

Algorithm L — instead of testing every item, draw a geometric "skip" count floor(log(random()) / log(1-W)) where W starts at 1 and gets multiplied by exp(log(random())/k) after each replacement. Same uniformity guarantee, far fewer random draws on huge streams.

P[item i in sample] = k/n for all i ∈ [1, n]
Total random draws: R ≈ n, L ≈ k(1 + ln(n/k))
Memory: O(k). The stream length n need not be known in advance.

🎰 Reservoir Sampling Visualizer

⚙️ Setup

🌊 Stream

🏺 Reservoir

📜 Log

📈 Uniformity Test

How It Works