Quality Feature Learning via Multi-Channel CNN and GRU for No-Reference Video Quality Assessment

Ngai Wing Kwong, Yui Lam Chan, Sik Ho Tsang, Daniel Pak Kong Lun

Research output: Journal article publicationJournal articleAcademic researchpeer-review

6 Citations (Scopus)

Abstract

Nowadays, video quality assessment (VQA) plays a vital role in video-related industries to predict human perceived video quality to maintain the quality of service. Although many deep neural network-based VQA methods have been proposed, the robustness and performance are limited by small scale of available human-label data. Recently, some transfer learning-based methods and pre-trained models in other domains have been adopted in VQA to compensate for the lack of enormous training samples. However, they result in a domain gap between the source and target domains, which provides sub-optimal feature representation for VQA tasks and deteriorates the accuracy. Therefore, in the paper, we propose quality feature learning via a multi-channel convolutional neural network (CNN) with a gated recurrent unit (GRU), taking into account both the motion-aware information and human visual perception (HVP) characteristics to solve the above issue for no-reference VQA. First, inspired by self-supervised learning (SSL), the multi-channel CNN is pre-trained on the image quality assessment (IQA) domain without using human annotation labels. Then, semi-supervised learning is applied on top of the pre-trained multi-channel CNN to fine-tune the model to transfer the domain from IQA to VQA while considering motion-aware information for better frame-level quality feature representation. After that, several HVP features are extracted with frame-level quality feature representation as the input of the GRU model to obtain the final precise predicted video quality. Finally, the experimental results demonstrate the robustness and validity of our model, which is superior to the state-of-the-art approaches and is closely related to human perception.

Original languageEnglish
Article number10076422
Pages (from-to)28060-28075
Number of pages16
JournalIEEE Access
Volume11
DOIs
Publication statusPublished - 2023

Keywords

  • Fine-tuning strategy
  • gated recurrent unit
  • human visual perception
  • motion-aware information
  • multi-channel convolutional neural network
  • no reference video quality assessment
  • self-supervised learning
  • semi-supervised learning

ASJC Scopus subject areas

  • General Computer Science
  • General Materials Science
  • General Engineering
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Quality Feature Learning via Multi-Channel CNN and GRU for No-Reference Video Quality Assessment'. Together they form a unique fingerprint.

Cite this