MPEG-4 Video Standard Rate Control
1.
Introduction
2.
Functionality of MPEG-4 Video
3.
Rate control method
4.
Simulation result
5.
Conclusion
6.
Future work
7.
VQ 投影片
•
1. Introduction
1.1 MPEG-4 AV Object
Multimedia Scene composed of “objects”
- Natural & SNHC A/V
-
–
2-D & 3-D Graphics
–
- Animated face, text
-
–
Arbitrary shapes or rectangular
•
Encoding the object independently
Coding scheme can differ for individual objects
•
From low bit-rates to (virtually) lossless quality
1.2 Feature of MPEG-4
Error Resilience
- Access on mobile networks (efficient coding and low bit
rates still important)
Scalable Coding
- Scalability based on (audio/visual) objects
- Different quality,priority,error protection for different
objects possible
Intellectual Property Rights (IPR)
- Identification (V.1) and protection (V.2)
1.3 MPEG-4 vs other video standard
Existing Standards
-
H.261:frame-based,low bit rates
-
–
H.263:frame-based,very low bit rates
-
MPEG2: frame-based, high bit rates
MPEG-4
- Object-based
- From very low bit rates to very high bit rates
- Both natural and synthetic
- Interactive at object level
•
1.4 Data Structure in Visual Part of MPEG-4
1.5 MPEG-4 Video Decoding
1.6 MPEG-4 Video Tools
2. Functionality of MPEG-4 Video
Content-Based Interactive
Coding Efficiency Parameter
-
–
5 kbits/s - 5 Mb/s
-
Resolution:
small ~ DTV
-
Progressive/Interlace
Error Resilience/Robustness
-
Mobile Environments
Scalability
-
–
Spatial and Temporal
2.1 Scalability
Object scalability
-
–
Achieved by the data structure used and the shape coding
Temporal scalability
-
–
Achieved by generalized scalability mechanism
Spatial scalability
-
–
Achieved by generalized scalable mechanism
•
•
2.2 Scalable Coding General Scheme
2.3
•
Temporal scalability
The temporal scalability is achievable for both rectangular frames
and arbitrarily shaped VOPs
The base layer is encoded conventional MPEG-4 video
The enhancement layer is encoded using one of the following two
mechanisms:
- type 1
- type2
2.3.1 Temporal Enhancement Type
2.3.2
•
Temporal scalability
Type 1
Only a portion of the VOP in the base layer is enhanced
2.3.3
•
Temporal scalability
Type 2
The entire VOP in the base layer is enhanced
2.4
Spatial scalability
The base layer is coded as conventional MPEG-4 video
The enhancement layer is encoded using prediction
mechanisms from the base layer
3. Rate control method
Coding Scalability
-
Object Scalability
-
Spatial / Temporal Scalability
-
SNR Scalability
•
SNR Scalability
-
Set target bit rates → decide quantization level
3.1 Quantization
Method 1: Similar to that of H.263
Method 2: Similar to that of MPEG-2
Optimized non-linear quantization of DC coefficients
Quantization matrices and loading mechanism
3.2 Configuration parameter for rate control
General parameter
-
–
Type of control (fixed QP,VM4,VM5+,….)
-
–
Control algorithm (global or individual,different algorithm
-
–
Statistical models for rates and distortion
in each case)
•
Parameter for global control
-
–
Global bit rates
-
–
Buffer size
Parameter for individual control
-
–
Bit-rate per VO
-
–
Buffer size for each VO
•
3.3 Selection of type
Fixed QP
-
–
For each type of VOP (I,P,B):no rate control
VM4
-
–
Primitive version of RC,work independently for
every VOP
VM5+
-
–
All other algorithm (for either global or independent control)
•
TM5
-
–
MPEG2 algorithm
4. Simulation result
4.1 Simulation Environments
Program:
Momusys Version 2.0
Akiyo_qcif.yuv
•
150 Frames
4.2 Simulation Result
Global
target bit rate =
2
buffer size
5. Conclusion
MPEG-4 more smart and flexible
-
–
content-based interactive
-
–
bit-stream editing
-
–
scene authoring
More Functionality
-
–
For error-prone environments
-
–
For rate control
6. Future work
Keep studying MPEG4 and its program
Understanding all the parameter in the .ctl .cfg file
Encode multiple objects into a visual scene
Familiar with scalable technology and rate control method
D
iscuss deliver MPEG4 bit-streams over internet issues