Difference between revisions of "JEG no-reference hybrid HEVC"

From VQEG JEG Wiki
Jump to: navigation, search
m (Current status)
m
Line 101: Line 101:
 
     1500 : each slice is maximum 1500 bytes
 
     1500 : each slice is maximum 1500 bytes
 
         (SliceMode = 2 and SliceArgument = 1500)
 
         (SliceMode = 2 and SliceArgument = 1500)
 +
 +
==Downloads==
  
 
==Current status==
 
==Current status==

Revision as of 14:56, 10 October 2014

JEG no-reference hybrid HEVC

JEG is currently developing a database of HEVC-encoded video sequences to eventually develop a Hybrid No Reference metric suited for HEVC.

Contents

Main idea

Project Setup

Database

Link: ftp://ftp.ivc.polytech.univ-nantes.fr/VQEG/JEG/HYBRID/hevc_database/

Folders:

  • ENC : compressed video sequences
  • FR : available quality measures (see below)
  • SRC : uncompressed original video sequences used for encoding and quality measures

Sequences (from src01 to src10):

src01 src02 src03 src04 src05 src06 src07 src08 src09 src10

Key to compressed file naming

Example filename:

 src01_960x544p25.yuv_LDGOP4_8000001_2_16_64_8_4.265

Convention

 src<SEQN>_<WIDTH>x<HEIGHT>p25.yuv_<GOPTYPESIZE>_<RATECONTROL>_<REFRESH>_<INTRAPERIOD>_<SEARCHRANGE>_<BITDEPTH>_<SLICEARGUMENT>.265

Meaning of parameters (Refer to https://hevc.hhi.fraunhofer.de/svn/svn_HEVCSoftware/branches/HM-11.0-dev/doc/software-manual.pdf for further details on the encoder)

SEQN = sequence number, from 01 to 10

WIDTH = picture width, possibile values: 960, 1280, 1920

HEIGHT = picture height, possibile values: 544, 720, 1080

GOPTYPESIZE = GOP type and size, possibile values: GOP2, GOP4, GOP8, LDGOP4

 GOP2 : GOP size 2
 GOP4 : GOP size 4
 GOP8 : GOP size 8
 LDGOP4 : GOP size 4 in low-delay configuration

RATECONTROL = parameter for rate control, possibile values: 26, 32, 38, 46 or a number >= 500000 (ending with 0 or 1)

 The number <= 46 corresponds to the -q encoder option
 The number >= 500000 corresponds to the --TargetBitrate encoder option
 The last digit (0 or 1) if number >= 500000 corresponds to the --LCULevelRateControl encoder option
   26 : fixed QP = 26
   32 : fixed QP = 32
   38 : fixed QP = 38
   46 : fixed QP = 46
   500000 : rate control at the frame level, 500 kbit/s
   500001 : rate control at the LCU level, 500 kbit/s
   1000000 : rate control at the frame level, 1 Mbit/s
   1000001 : rate control at the LCU level, 1 Mbit/s
   2000000 : rate control at the frame level, 2 Mbit/s
   2000001 : rate control at the LCU level, 2 Mbit/s
   4000000 : rate control at the frame level, 4 Mbit/s
   4000001 : rate control at the LCU level, 4 Mbit/s
   8000000 : rate control at the frame level, 8 Mbit/s
   8000001 : rate control at the LCU level, 8 Mbit/s
   16000000 : rate control at the frame level, 16 Mbit/s
   16000001 : rate control at the LCU level, 16 Mbit/s

REFRESH = intra refresh type, possible values: 1, 2

 The number corresponds to the --DecodingRefreshType encoder option
   1 : Applies a non-IDR clean random access point (open GOP).
   2 : Applies an IDR random access point (closed GOP).

INTRAPERIOD = intra frame period, possible values: 8, 16, 32, 64

 The number corresponds to the --IntraPeriod encoder option
   8 : one intra every 8
   16 : one intra every 16
   32 : one intra every 32
   64 : one intra every 64

SEARCHRANGE = search range for motion estimation (around the predictor), possibile values: only 64

 The number corresponds to the --SearchRange encoder option

BITDEPTH = bit depth used for coding, possibile values: only 8

 The number corresponds to the --InternalBitDepth encoder option

SLICEARGUMENT = specifies how slicing is performed for each picture, possibile values: 0, 2, 4, 1500

 The number corresponds to the --SliceMode and --SliceArgument encoder options
   0 : only one slice per frame
       (SliceMode = 0 and SliceArgument = 0)
   2 : 2 slices per frame (max 270,120,75 CTU per slice when WIDTH=1920,1280,960)
       (SliceMode = 1 and SliceArgument =  270 if 1920, 120 if 1280, 75 if 960)
   4 : 4 slices per frame (max 130,60,34 CTU per slice when WIDTH=1920,1280,960)
       (SliceMode = 1 and SliceArgument =  130 if 1920, 60 if 1280, 34 if 960)
   1500 : each slice is maximum 1500 bytes
       (SliceMode = 2 and SliceArgument = 1500)

Downloads

Current status

The database is composed of 59520 sequences (10 different contents at three different resolutions each, encoded with 1984 different encoding parameters).

For all 59520 HEVC sequences in the database, the following objective quality metrics are available:

  • PSNR (as computed by the VQMT software)
  • SSIM (as computed by the VQMT software)
  • VIF (as computed by the VQMT software)
  • VQM (as computed by the software available at NTIA/ITS )
  • PVQM (as computed by the PVQM_EM software)

The database is currently being extended by computing the previous measures on sequences which have loss impairments. 25 different loss patterns are being used.

Glenn is computing PSNR, SSIM, VIF on all sequences when they are subject to 25 different loss patterns.

Enrico is computing VQM, PVQM on the 960x544 encoded sequences when they are subject to 25 different loss patterns. As of Sep 17, 2014 seq01 and seq10 are available (ask by email using: masala at polito for the results).