Download A Fast MB Mode Decision Algorithm for MPEG-2 to H.264 P

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
A FAST MB MODE DECISION
ALGORITHM FOR MPEG-2 TO
H.264 P-FRAME TRANSCODING
PEDRO CUENCA, MEMBER, IEEE,
LUIS OROZCO-BARBOSA, MEMBER, IEEE,
GERARDO FERNÁNDEZ-ESCRIBANO,
ANTONIO GARRIDO,
HARI KALVA
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS
FOR VIDEO TECHNOLOGY, 2008
Outline
2




Introduction
Fast MB Mode Decision Using Machine Learning
Performance Evaluation
Conclusion
Introduction
1/3
3

Motivation:
make transcoding from MPEG-2 to H.264 seamless.

Hypothesis:
the MB mode decision in H.264 have a correlation with the
distribution of the motion compensated residual in MPEG-2
video.
Introduction
2/3
4
the H.264 MB mode computation
problem is posed as a data
classification problem.

the MPEG-2 MB coding mode
and residual have to be classified
into one of the several H.264
coding modes.

Fig. 1. Relationship between MPEG-2 MB
residual and H.264 MB coding mode.
Introduction
3/3
5

Method:
use machine learning tools to exploit the correlation and
construct decision trees to classify the MPEG-2 MBs into one of
the coding modes in H.264.
6
Fast MB Mode Decision
Using Machine Learning
1/14
Fig. 2. Process for building decision trees for
MPEG-2 to H.264 transcoding.
7
Fast MB Mode Decision
Using Machine Learning
2/14

WEKA data mining tool :
machine learning software written
in Java and supports several
standard data mining tasks.

the J48 algorithm:
implemented in the WEKA data mining tool was used to
create the WEKA decision trees.
 the J48 algorithm is an implementation of the C4.5
algorithm which widely used as a reference for building
decision trees.

8
Fast MB Mode Decision
Using Machine Learning
3/14

Attribute-Relation File Format (ARFF):
The file used by the WEKA data mining program, contain the
existing relationship between a set of attributes.
An ARFF file has two sections:
(1) header: contains the name of the relation, the attributes
and their types.
(2) section: containing the data.
9
Fast MB Mode Decision
Using Machine Learning
4/14

Training sets:

the MPEG-2 sequences encoded at high quality since no
B-frames have been used.
 use
H.264 encoder with a QP of 25 and the R-D
optimization enable.

Goal:
develop a single, generalized, decision tree to be used for the
MPEG-2 to H.264 transcoding process. It’s found that Flower
sequence was good for a large number of videos.
10
Fast MB Mode Decision
Using Machine Learning
5/14

The Decision Tree for the proposed transcoder is a hierarchical
decision tree consisting of three different WEKA trees.
Fig. 3. Decision tree.
11
Fast MB Mode Decision
Using Machine Learning
6/14
A. Creating the Training Files



mean and variance of each one
of the 4x4 residual subblocks.
MB mode in MPEG-2.
coded block pattern (CBPC) used
in MPEG-2.
12
Fast MB Mode Decision
Using Machine Learning
7/14
B. Decision Tree
decision tree
Works as follow
Node 1
Input: MPEG-2 MB information.
Output: First level decision that classifies the MB as Skip, Intra, Inter8x8 or Inter-16x16.
Rule:
MPEG-2 MB mode
H.264 MB mode
MC not coded
Inter-16x16
intra
Intra or Inter-8X8
skip
skip
13
Fast MB Mode Decision
Using Machine Learning
8/14
B. Decision Tree
decision tree
Node 2
Works as follow
Input: 16x16 MBs classified by the Node 1.
Output: 16x16 submode decision used for coding the MB into
16x16, 16x8 or 8x16.
Rule: This tree examines if there are continuous 16x8 or 8x16
subblocks that might result in a better prediction.
14
Fast MB Mode Decision
Using Machine Learning
9/14
B. Decision Tree
decision tree
Node 3
Works as follow
Input: The MBs classified by Node 1 as 8x8.
Output: 8x8 submode decision used for coding the MB into
8x8, 8x4, 4x8 or 4x4.
Rule:
(1)Evaluates only the H.264 8x8 modes using the third WEKA tree
and selects the best option.
(2)This node is different from the others since this one only uses four
means and four variances to make the decision.
15
Fast MB Mode Decision
Using Machine Learning
10/14
B. Decision Tree
decision tree
Node 4
Works as follow
Input:
(1) skip-mode MBs in the MPEG-2 bit stream classified by Node 1
(2) the 16x16 MBs classified by Node 2
Output: Select skip or inter-16x16.
Rule: Evaluates only the H.264 16x16 mode (without the submodes
16x8 or 8x16). Then, the node selects the best option.
16
Fast MB Mode Decision
Using Machine Learning
11/14


MB mode decision and threshold used in the
decision tree depend on the QP used in the
H.264 encoding stage.
The mean and variance threshold will have to
be different at each QP.
17
Fast MB Mode Decision
Using Machine Learning
12/14
Solution(1):
method: Develop the decision trees for each QP and use the
appropriate decision tree depending on the QP
selected.
drawback: It's complex since implies to switch between 52
different decision trees resulting in 156 WEKA
trees for a transcoder.
18
Fast MB Mode Decision
Using Machine Learning
13/14
Solution(2):
method:
 Develop a single decision tree and adjust the mean and
variance threshold used by the trees based on the QP of
25.

For QP values higher than 25, the thresholds are
decreased and for QP values lower than 25 thresholds
are oportionally increased. The threshold are adjusted by
2.5% for a change in QP of 1.
19
Fast MB Mode Decision
Using Machine Learning
14/14
Fig. 2. Process for building decision trees
for MPEG-2 to H.264 transcoding
.
.
Fig. 4. Proposed transcoder
Performance Evaluation
1/8
20
Input:
(1) A high quality MPEG-2 video.
(2) QP ranging from 5 up to 45 in steps of 5.
(3) The size of the GOP is 12 frames;where the first
frame was I-frame, and the rest of the frames were
P-frames.
(4) The rate control and CABAC algorithms were disabled for
all the simulations.
(5) The number of reference in P-frames was set to 1.
(6) The motion search range was set to 16 pels with a MV
resolution of 1/4 pel.
Performance Evaluation
2/8
21
Fig. 6. MB mode decisions generated by the proposed algorithm for the first
P-frame in the Ayersroc, Paris, and Foreman sequence.
Full estimation
of H.264
Proposed
algorithm
Performance Evaluation
3/8
22
RD-results:
R-D-cost without FME option
or R-D-cost with FME option
Test sequence:
Martin, Ayersroc, Paries,
Tempete, News, Foreman
Fromat:
CCIR, CIF, QCIF
Performance Evaluation
4/8
23
Performance Evaluation
5/8
24
RD-results:
SAE-cost without FME option
or SAE-cost with FME option
Performance Evaluation
6/8
25
Performance Evaluation
7/8
26
Reference
transcoder
Proposed
transcoder
WIN
Performance Evaluation
8/8
27
Conclusion
28


The proposed algorithm uses machine learning
techniques to develop decision tree decide MPEG-2
to H.264 coding mode, considerably reducing the
computational complexity .
It can be applied to develop other transcoders as
well.