How does it deal with the audio ring buffers on the various devices? Does it jus...

freemanjiang · on April 29, 2025

Great question! There's two steps:

First, I do clock synchronization with a central server so that all clients can agree on a time reference.

Then, instead of directly manipulating the hardware audio ring buffers (which browsers don't allow), I use the Web Audio API's scheduling system to play audio in the future at a specific start time, on all devices.

So a central server relays messages from clients, telling them when to start and which sample position in the buffer to start from.

camtarn · on April 29, 2025

Interesting. Feels like this might still have some noticeable tens-of-millisends latency on Windows, where the default audio drivers still have high latency. The browser may intend to play the sound at time t, but when it calls Windows's API to play the sound I'm guessing it doesn't apply a negative time offset?

serial_dev · on April 29, 2025

So it doesn't need to use the microphone? I guess from the "works across the ocean" comment and based on this description. I would have thought you would listen to the mic and sync based on surrounding audio somehow but it's good to know that it's not needed.

freemanjiang · on April 29, 2025

Yup no microphone. It's all clock sync

cosmotic · on April 29, 2025

Another issue is seeking in compressed audio. When seeking (to sync), some API's snap to frame boundaries.

cosmotic · on April 29, 2025

I solved this by decompressing the whole file into memory as PCM.

brcmthrowaway · on April 29, 2025

This is my question, does it do interpolation or pitch bending