Difference between revisions of "JEG no-reference hybrid HEVC"
m (→Database) |
m (→Database) |
||
Line 19: | Line 19: | ||
[[File:Db_hevc_src02.jpg]] | [[File:Db_hevc_src02.jpg]] | ||
[[File:Db_hevc_src03.jpg]] | [[File:Db_hevc_src03.jpg]] | ||
+ | [[File:Db_hevc_src04.jpg]] | ||
+ | [[File:Db_hevc_src05.jpg]] | ||
+ | [[File:Db_hevc_src06.jpg]] | ||
+ | [[File:Db_hevc_src07.jpg]] | ||
+ | [[File:Db_hevc_src08.jpg]] | ||
+ | [[File:Db_hevc_src09.jpg]] | ||
+ | [[File:Db_hevc_src10.jpg]] | ||
===Key to file naming=== | ===Key to file naming=== |
Revision as of 12:08, 17 September 2014
JEG no-reference hybrid HEVC
JEG is currently developing a database of HEVC-encoded video sequences to eventually develop a Hybrid No Reference metric suited for HEVC.
Contents |
Main idea
Project Setup
Database
Link: ftp://ftp.ivc.polytech.univ-nantes.fr/VQEG/JEG/HYBRID/hevc_database/
Folders:
- ENC : compressed video sequences
- FR : available quality measures (see below)
- SRC : uncompressed original video sequences used for encoding and quality measures
Key to file naming
Example filename:
src01_960x544p25.yuv_LDGOP4_8000001_2_16_64_8_4.265
Convention
src<SEQN>_<WIDTH>x<HEIGHT>p25.yuv_<GOPTYPESIZE>_<RATECONTROL>_<REFRESH>_<INTRAPERIOD>_<SEARCHRANGE>_<BITDEPTH>_<SLICEARGUMENT>.265
Meaning of parameters (Refer to https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/branches/HM-11.0-dev/doc/software-manual.pdf for further details on the encoder)
SEQN = sequence number, from 01 to 10
WIDTH = picture width, possibile values: 960, 1280, 1920
HEIGHT = picture height, possibile values: 544, 720, 1080
GOPTYPESIZE = GOP type and size, possibile values: GOP2, GOP4, GOP8, LDGOP4
GOP2 : GOP size 2 GOP4 : GOP size 4 GOP8 : GOP size 8 LDGOP4 : GOP size 4 in low-delay configuration
RATECONTROL = parameter for rate control, possibile values: 26, 32, 38, 46 or a number >= 500000 (ending with 0 or 1)
The number <= 46 corresponds to the -q encoder option The number >= 500000 corresponds to the --TargetBitrate encoder option The last digit (0 or 1) if number >= 500000 corresponds to the --LCULevelRateControl encoder option 26 : fixed QP = 26 32 : fixed QP = 32 38 : fixed QP = 38 46 : fixed QP = 46 500000 : rate control at the frame level, 500 kbit/s 500001 : rate control at the LCU level, 500 kbit/s 1000000 : rate control at the frame level, 1 Mbit/s 1000001 : rate control at the LCU level, 1 Mbit/s 2000000 : rate control at the frame level, 2 Mbit/s 2000001 : rate control at the LCU level, 2 Mbit/s 4000000 : rate control at the frame level, 4 Mbit/s 4000001 : rate control at the LCU level, 4 Mbit/s 8000000 : rate control at the frame level, 8 Mbit/s 8000001 : rate control at the LCU level, 8 Mbit/s 16000000 : rate control at the frame level, 16 Mbit/s 16000001 : rate control at the LCU level, 16 Mbit/s
REFRESH = intra refresh type, possible values: 1, 2
The number corresponds to the --DecodingRefreshType encoder option 1 : Applies a non-IDR clean random access point (open GOP). 2 : Applies an IDR random access point (closed GOP).
INTRAPERIOD = intra frame period, possible values: 8, 16, 32, 64
The number corresponds to the --IntraPeriod encoder option 8 : one intra every 8 16 : one intra every 16 32 : one intra every 32 64 : one intra every 64
SEARCHRANGE = search range for motion estimation (around the predictor), possibile values: only 64
The number corresponds to the --SearchRange encoder option
BITDEPTH = bit depth used for coding, possibile values: only 8
The number corresponds to the --InternalBitDepth encoder option
SLICEARGUMENT = specifies how slicing is performed for each picture, possibile values: 0, 2, 4, 1500
The number corresponds to the --SliceMode and --SliceArgument encoder options 0 : only one slice per frame (SliceMode = 0 and SliceArgument = 0) 2 : 2 slices per frame (max 270,120,75 CTU per slice when WIDTH=1920,1280,960) (SliceMode = 1 and SliceArgument = 270 if 1920, 120 if 1280, 75 if 960) 4 : 4 slices per frame (max 130,60,34 CTU per slice when WIDTH=1920,1280,960) (SliceMode = 1 and SliceArgument = 130 if 1920, 60 if 1280, 34 if 960) 1500 : each slice is maximum 1500 bytes (SliceMode = 2 and SliceArgument = 1500)
Current status
The database is composed of 59520 sequences (10 different contents at three different resolutions each, encoded with 1984 different encoding parameters).
For all 59520 HEVC sequences in the database, the following objective quality metrics are available:
- PSNR (as computed by the VQMT software)
- SSIM (as computed by the VQMT software)
- VIF (as computed by the VQMT software)
- VQM (as computed by the software available at NTIA/ITS )
- PVQM
The database is currently being extended by computing the previous measures on sequences which have loss impairments. 25 different loss patterns are being used.
Glenn is computing PSNR, SSIM, VIF on all sequences when they are subject to 25 different loss patterns.
Enrico is computing VQM, PVQM on the 960x544 encoded sequences when they are subject to 25 different loss patterns. As of Sep 17, 2014 seq01 and seq10 are available (ask by email using: masala at polito for the results).