Scalable Audio Compression Using Wavelet Packet Decomposition and Embedded Zero Tree Coding

Abstract

Multimedia transmission over Internet is getting popular and increasingly important. In particular, scalable coding is desirable for heterogeneous network with varies bandwidths. In this work, we propose a scalable embedded zero tree wavelet packet (EZWP) audio coding system that is a scalable audio compression system using wavelet packet decomposition and embedded zero-tree coding. We focus on multi-layer low bitrate coding which delivers high perceptual quality. In the base layer, the overlapped audio segment is first transformed by wavelet packet. Then the local significant coefficients are extracted, quantized, and coded by variable length coding. In the enhancement layer and the full band layer, the residual signal that is the difference between the original and the output of the previous layer is coded via EZW with psychoacoustic model and arithmetic coding. The target bit rates for three layers are 16, 32, and 64 Kbps, respectively. The performance of the proposed coding system is only slightly inferior to MPEG-1 layer 3 while it provides bitrate scalability. Therefore, it is suitable for multimedia distribution over Internet that is composed of heterogeneous networks.