Performance of ADPCM in Frame Erasures

Introduction :

       Many present and proposed wireless communications systems employ the speech standards, Adaptive Differential Pulse Code (ADPCM), because it offers high quality, low delay, and low complexity. When transmission errors occur,  the coded speech will have error propagations. And, transmission  impairments may cause the loss of entire frames of coded speech, the performance of speech will decay rapidly. So, we will measure the performance of speech in frame erasures.

¡@

¡@

ADPCM 

              

      

¡@

      ADPCM is a waveform speech coding. it is used to adaptive quantization and adaptive prediction.

¡@

¡@

Objective :

      In this final project, we report the performance of the ADPCM when frame erasures occur. And, we try to find  better methods which can reconstruct speech during erasures to improve the quality of the corrupted speech.

¡@

¡@

Four Methods of Speech Reconstruction During Erasures :

                    a) Silence Substitution :

       When a frame erasure is detected, all the bits in that frame are  replaced by the lowest quantization level. A simple error detection scheme or a signal from the channel decoder indicating a frame error is sufficient to perform silence substitution.

                   b) Fixed Frame Repeat :

       The input to the decoder is buffered and when a frame erasure is detected, samples delayed by 10 ms were introduced to the decoder input. If frame erasures are longer than 25 ms, the input is scaled down to the lowest quantization level.

                    c) State Reserving :

       The state is saved when frame erasure is detected. The state in the end of frame erasures is replaced by reserving the state saved in the beginning of frame erasure.

                    d) Encoder and Decoder Reset simultaneously :

       The encoder and decoder reset simultaneously at a constant interval. The quality of encoder and decoder reset simultaneously is poor than that of no reset during no frame erasure. But, in high Frame Error Rate (FER) the performance of encoder and decoder reset simultaneously is better than that of no reset.

¡@

Simulation :

                    Environment :

                                    a) speech samples
                                        - sampling rate : 8 kHz
                                        - speech length : 6.97 sec

                                    b) ADPCM
                                         - linear
                                        - quantization bits : 4 bits (data rate : 32 kbps)

                                    c) frame length
                                        - 20 ms

                                    d) frame error rate (FER)
                                        - 0.1¡B0.05¡B0.025
¡B0.01

                                    e) segment SNR (SSNR)
                                       - segment length : 16 ms

                   Experiment results :

                                    a) SNR & SSNR at different data rate

Data Rate  16 k 24 k 32 k 40 k
SNR 16.55715 21.63328 22.68998 23.68455
SSNR 18.44427 25.3381 31.42663 34.93507

                                                            

¡@                                           

¡@

¡@

¡@

¡@

¡@

¡@

¡@

¡@

¡@

¡@

¡@

                                   b) Silence Substitution

¡@

¡@

¡@

¡@

¡@

¡@

¡@

                                                    original decoded speech

                                                     reconstructed speech

¡@

                                    c) Fixed Frame Repeat

                                                       original speech

                                                       reconstructed speech

¡@

                                      d) State Reserving

                                                        original speech

                                                         reconstructed speech

¡@

                                       e) Encoder and Decoder Reset Simultaneously

 

                                                         original speech

                                                        reconstructed speech

¡@

                                        e) SSNR of Experiments

FER 0 0.01 0.025 0.05 0.1
Silence Substitution 31.32092 29.34896 25.77643 19.43691 11.23208
Fixed Frame Repeat 31.32092 29.38698 26.10405 20.15194 13.22028
State Reserving 31.32092 29.96891 27.2553 22.87107 15.63003
Encoder & Decoder Reset simultaneously 25.20946 24.80734 24.12372 22.92385 20.95093

¡@

¡@

 Conclusions :

       Silence substitution is the simplest method to reconstruct the corrupted speech. And, the performance of state reserving method is better than others. In high FER the performance of encoder & decoder reset simultaneously is better than others and avoid error propagation efficiently.

¡@Back