[3] Bosi, M., & Goldberg, R. E. (2003). Introduction to Digital Audio Coding and Standards . Springer.
Dialnorm = -31 + 10 * log10( (1/N) * Σ x_i² ) where ( x_i ) are the dialogue-gated samples.
DEE selects the frame size adaptively. For low-delay applications (e.g., video conferencing), it can force 256-sample frames in E-AC-3, reducing algorithmic delay to 5.3 ms at 48 kHz. Dialnorm is a metadata parameter that indicates the average loudness of dialogue relative to -31 dB FS. DEE implements a Leq(A) measurement over a sliding window of 5 seconds, as per ITU-R BS.1770-4. dolby encoding engine
[4] ITU-R BS.1770-4. (2015). Algorithms to measure audio programme loudness and true-peak audio level . International Telecommunication Union.
[5] PEAQ – ITU-R BS.1387-1. (2001). Method for objective measurements of perceived audio quality . Dolby Encoding Engine, AC-3, E-AC-3, Dolby Atmos, audio compression, broadcast, low-latency encoding, dialnorm. [3] Bosi, M
Abstract: The proliferation of multi-channel audio formats (5.1, 7.1, Atmos) and object-based audio has created a need for efficient, real-time encoding solutions. The Dolby Encoding Engine (DEE) serves as a software-based, enterprise-grade tool for transcoding Pulse Code Modulation (PCM) audio into Dolby Digital (AC-3), Dolby Digital Plus (E-AC-3), and Dolby Atmos metadata. This paper examines the system architecture of DEE, its psychoacoustic modeling core, bit-rate reduction techniques, and its role in live broadcast, over-the-top (OTT) streaming, and post-production workflows. We compare its latency profile against hardware-based encoders and analyze its compliance with the ATSC 3.0 and DVB standards. 1. Introduction As consumer audio moved from stereo to immersive formats, broadcasters and streaming services required a software-defined method to encode multichannel audio without dedicated DSP chips. Dolby Laboratories introduced the Dolby Encoding Engine (DEE) as a cross-platform (Linux, Windows, macOS) solution that integrates into transcoding pipelines (e.g., FFmpeg, AWS Elemental, Harmonic).
[2] ATSC Standard A/52:2020. (2020). Digital Audio Compression (AC-3, E-AC-3) . Advanced Television Systems Committee. Introduction to Digital Audio Coding and Standards
| Metric | DEE (Software) | DP580 (Hardware) | |--------------------------|-------------------------|---------------------------| | Latency (5.1, 384 kbps) | 18.2 ms | 16.0 ms | | CPU usage (per stream) | 12% (Xeon Gold 6248) | N/A | | Concurrent streams (1U) | 32 (at 3.0 GHz) | 1 (per card) | | Power consumption | 150 W (server) | 25 W (encoder) | | Metadata flexibility | Full (API-driven) | Limited (front panel) |