About | Contact Us | Register | Login
ProceedingsSeriesJournalsSearchEAI
2nd International ICST Conference on Mobile Multimedia Communications

Research Article

Priority coding for video-telephony applications based on visual attention

Cite
BibTeX Plain Text
  • @INPROCEEDINGS{10.1145/1374296.1374329,
        author={Nicolas Tsapatsoulis and Konstantinos Rapantzikos and Yannis Avrithis},
        title={Priority coding for video-telephony applications based on visual attention},
        proceedings={2nd International ICST Conference on Mobile Multimedia Communications},
        publisher={ACM},
        proceedings_a={MOBIMEDIA},
        year={2006},
        month={9},
        keywords={visual attention perceptional video coding saliency map video telephony},
        doi={10.1145/1374296.1374329}
    }
    
  • Nicolas Tsapatsoulis
    Konstantinos Rapantzikos
    Yannis Avrithis
    Year: 2006
    Priority coding for video-telephony applications based on visual attention
    MOBIMEDIA
    ACM
    DOI: 10.1145/1374296.1374329
Nicolas Tsapatsoulis1,*, Konstantinos Rapantzikos2,*, Yannis Avrithis3,*
  • 1: University of Cyprus, Cyprus
  • 2: National Technical University of Athens, Greece
  • 3: Technical University of Athens, Greece
*Contact email: nicolast@ucy.ac.cy, rap@image.ntua.gr, iavr@image.ntua.gr

Abstract

In this paper we investigate the utilization of visual saliency maps for ROI-based video coding of video-telephony applications. Visually salient areas indicated in the saliency map are considered as ROIs. These areas are automatically detected using an algorithm for visual attention (VA) which builds on the bottom-up approach proposed by Itti et al. A top-down channel emulating the visual search for human faces performed by humans has been added, while orientation, intensity and color conspicuity maps are computed within a unified multi-resolution framework based on wavelet subband analysis. Priority encoding, for experimentation purposes, is utilized in a simple manner: Frame areas outside the priority regions are blurred using a smoothing filter and then passed to the video encoder. This leads to better compression of both Intra-coded (I) frames (more DCT coefficients are zeroed in the DCT-quantization step) and Inter coded (P, B) frames (lower prediction error). In more sophisticated approaches, priority encoding could be incorporated by varying the quality factor of the DCT quantization table. Extended experiments concerning both static images as well as low-quality video show the compression efficiency of the proposed method. The comparisons are made against standard JPEG and MPEG-1 encoding respectively.

Keywords
visual attention perceptional video coding saliency map video telephony
Published
2006-09-20
Publisher
ACM
http://dx.doi.org/10.1145/1374296.1374329
Copyright © 2006–2025 ACM
EBSCOProQuestDBLPDOAJPortico
EAI Logo

About EAI

  • Who We Are
  • Leadership
  • Research Areas
  • Partners
  • Media Center

Community

  • Membership
  • Conference
  • Recognition
  • Sponsor Us

Publish with EAI

  • Publishing
  • Journals
  • Proceedings
  • Books
  • EUDL