Scalable Audio Compression
Using Wavelet Packet Decomposition and Embedded Zero Tree Coding
Abstract
Multimedia
transmission over Internet is getting popular and increasingly important.
In particular, scalable coding is desirable for heterogeneous network with
varies bandwidths. In this work, we propose a scalable embedded zero tree wavelet
packet (EZWP) audio coding system that is a scalable audio compression
system using wavelet packet decomposition and embedded zero-tree coding. We
focus on multi-layer low bitrate coding which delivers high perceptual quality.
In the base layer, the overlapped audio segment is first transformed by wavelet
packet. Then the local significant coefficients are extracted, quantized,
and coded by variable length coding. In the enhancement layer and the full
band layer, the residual signal that is the difference between the original
and the output of the previous layer is coded via EZW with psychoacoustic
model and arithmetic coding. The target bit rates for three layers are 16,
32, and 64 Kbps, respectively. The performance of the proposed coding system
is only slightly inferior to MPEG-1 layer 3 while it provides bitrate scalability.
Therefore, it is suitable for multimedia distribution over Internet that is
composed of heterogeneous networks.