DACsOnline, ladders vs noise shaping
Learning With Errors
bits & voltages

Starter draft. The explanations lean on Schiit Happened (Jason Stoddard's book, shared free by the author) and the head-fi threads, paraphrased and cited. The five demos run real math: a quantized staircase, an R-2R ladder you can mismatch, an actual DFT of a noise-shaped signal, a Web Audio tone you can hear at each bit depth, and a reconstruction filter showing the time-frequency tradeoff.

aadharsh

every dac debate online collapses into multibit versus delta-sigma, and everyone gets religious about it. what's the actual engineering difference, underneath the tribalism?

DACs

Both turn a stream of numbers into a voltage. They split on how to rebuild the wave from the samples. Multibit builds each sample as one exact level. Delta-sigma approximates that level with a coarse switch running very fast, then hides the leftover error where your ears can't reach. Two roads to the same clean output.

aadharsh

start with multibit. that's the one that works the way I'd naively guess.

DACs

It does. An N-bit value selects one of 2N voltages, and a resistor ladder (R-2R) sums the bits into exactly that level. A 16-bit sample picks one of 65,536 voltages. The raw output is a staircase, one flat step per sample, which a filter then smooths into a curve.

the staircase · drag the bit depth

A multibit DAC builds each sample as one exact voltage, chosen from 2N levels a resistor ladder defines. Drag the bit depth: more bits means a finer staircase, closer to the smooth wave underneath.

aadharsh

back up. how does a ladder of resistors actually turn the bits into that exact voltage?

DACs

It is a chain of voltage dividers built from only two resistor values, R and 2R, one rung per bit. The trick: from any rung, the resistance looking back toward the smaller bits always works out to R, so each rung halves the contribution of the one above it. The most significant bit lands Vref/2 at the output, the next Vref/4, then Vref/8, on down. Switch a bit to Vref and it adds its rung's share, so the total is Vref times the binary value over 2N, all from two resistor values repeated.

the R-2R ladder · toggle bits, then add resistor error

An R-2R ladder turns N bits into one voltage from just two resistor values. Each raised bit adds half the previous bit's share: the MSB adds Vref/2, the next Vref/4, and so on. Toggle the bits, then crank the resistor tolerance and watch the transfer curve bend.

Vout = 0.500 Vref

The transfer curve over all 64 codes, ideal (straight) against actual. Errors in the big bits bend the line; push the tolerance and a step can run backward (red), a non-monotonic glitch. That is why a 20-bit ladder wants parts matched to about 1 part per million: the MSB resistor alone has to beat the size of the smallest step.

aadharsh

so a tiny error in the big bits wrecks it. that's where the cost goes.

DACs

Exactly. Those resistors have to be matched to brutal tolerances, which is instrumentation hardware, not audio-grade parts. Schiit's Yggdrasil runs a quad of 20-bit Analog Devices AD5791 ladders, the same parts that sit in MRI machines, because ordinary chips can't hold that line.

[src]
aadharsh

ok, delta-sigma. how do you get clean audio out of a switch that's basically on or off?

DACs

Speed plus a feedback loop. Run a 1-bit quantizer at 64 to 256 times the audio rate, and feed the quantization error back so the output works out to X(z)·z-1 + E(z)·(1 - z-1). That (1 - z-1) is a high-pass filter on the error: it shoves the noise up into ultrasonic frequencies, out of the band you hear, where a cheap analog filter wipes it. You trade amplitude precision for timing precision, which is exactly what digital is good at.

[src]
noise shaping · a live DFT

A delta-sigma DAC runs a coarse quantizer fast, with a feedback loop that shapes the quantization noise. Toggle it: the same 3-bit quantizer either way, but shaping pushes the error up out of the band you can hear (shaded).

3-bit quantizer, 1st-order shaping

Spectrum of the quantizer output. The spike is the tone; the floor is quantization noise. Shaping tilts that floor up and away from the shaded audible band.

aadharsh

huh. same coarse quantizer either way, but shaping pulls the floor down inside the band. that's the whole move?

DACs

That's the whole move. Stack more integrators and the term becomes (1 - z-1)2 or 3 for a steeper tilt. Because the quantizer is just a fast switch, it stays cheap and very linear, which is why nearly every DAC chip on the market (ESS, AKM, Cirrus) is delta-sigma and posts excellent THD and SNR.

aadharsh

both end up as a stairstep or a 1-bit stream. how does that become a smooth analog wave? something filters it, right?

DACs

The reconstruction filter, and it carries the other big optimization. The ideal one is a sinc, a perfect brick wall at 20 kHz, but a brick wall in frequency is an endless ripple in time. That is the Fourier tradeoff: you get sharpness in frequency or cleanliness in time, and tightening one loosens the other. A fast, sharp filter kills the images just above the band but rings; a slow filter rings less but rolls off some treble and lets a little imaging through. NOS DACs skip the filter (no ringing, images and all); oversampling first pushes the images far up, so a gentle filter can finish without ringing much.

reconstruction filter · time against frequency

A windowed-sinc reconstruction filter. Drag the sharpness: more taps steepen the frequency wall on the right, and the impulse response on the left rings longer to pay for it. You cannot tighten both at once.

Left: the impulse response in time, the sinc and its ringing. Right: the frequency response, flat through the shaded audio band and rolling off near Nyquist. A sharper wall on the right means longer ringing on the left.

aadharsh

so the filter shape is a taste call as much as a number on a spec sheet.

DACs

Largely yes, and a lot of the audible gap between two competent DACs lives right here, more than in the last decimal of THD. It is also Mike Moffat's whole angle: instead of the stock windowed-sinc he ships a custom closed-form filter, the kind of choice a measurement cannot fully settle.

aadharsh

so if delta-sigma measures better and costs less, why does multibit still have a cult around it?

DACs

Two honest reasons. First, coarse bits are audible when nothing shapes them away, and some listeners argue the modulator's behavior on quiet passages and decay sounds different in ways a steady-tone measurement won't show. Second, Mike Moffat, the digital half of Schiit, bet a company on multibit, the ladders plus that closed-form filter, and built a following that hears a difference. The measurable gap is tiny and often below audibility; the preference is real to the people who hold it and hard to reduce to one number.

[src]
aadharsh

I get the theory, but I still can't reconcile what people MEAN by the multibit sound versus delta-sigma. can you make that difference concrete?

DACs

The honest place it lives is low-level linearity: the soft tail of a note, where only the bottom bits are working. Multibit rounds to its nearest exact level, which down there bends the wave into harmonic distortion. Delta-sigma keeps the level honest on average and trades that for a shaped noise floor. Same quiet tone, two different imperfections. Here they are side by side, as spectra you can also play.

the contested ground · low-level linearity

Where the multibit-versus-delta-sigma argument actually lives: a quiet tone, the soft tail of a note, where only the lowest bits are working. Drag the bit depth. Multibit rounds to its nearest exact level, which at low level bends the wave into harmonic distortion; delta-sigma keeps the level honest on average but rides a shaped noise floor. Same tone, two flavors of imperfection.

multibit (round to level)
delta-sigma (noise-shaped)

Both spectra are the same quiet tone. Multibit sprouts harmonic spikes at the tone's multiples (distortion); delta-sigma stays a single clean spike on a raised noise floor (no harmonics). Push the bit depth toward 16 and both shrink below hearing.

aadharsh

so multibit adds harmonics, delta-sigma adds noise, and by 16-bit both basically vanish.

DACs

That is the whole reconciliation: pick your imperfection. Harmonic distortion can read as warmth or as grit; a noise floor reads as clean, though some hear it as veiling. On good 24-bit gear both sit below audibility, so the preference comes down to taste and system matching rather than a measurement. Real multibit DACs usually dither as well, which trades their distortion for a noise floor of their own and narrows the gap further.

hear the bit depth

Quantization noise is audible when the bit depth is low. Click to hear a 440 Hz tone quantized to each depth (about a second each, through your speakers).

Same tone, fewer levels. At 2-bit the error is loud hash; by 16-bit it's gone. Delta-sigma keeps the bits low and moves that hash above 20 kHz instead.

aadharsh

yeah, I can hear the hash at 2-bit, and it's gone by 16. and delta-sigma just shoves that hash above 20k instead of spending money to delete it.

DACs

Right. Multibit pays in hardware to make every level physically exact. Delta-sigma pays in cleverness to make the errors inaudible. Both arrive at clean sound; they just disagree about where to put the hard part, and the filter is where a lot of the audible character is decided.

aadharsh
end of first pass
This is a recorded conversation. The demos above run live: drag, toggle, and click to hear.