Fusing disparate object signatures for salient object detection in video

Zhigang Tu, Zuwei Guo, Wei Xie, Mengjia Yan, Junsong Yuan, R.C. Veltkamp, Baoxin Li

    Research output: Contribution to journalArticleAcademicpeer-review

    Abstract

    We present a novel spatiotemporal saliency model for object detection in videos. In contrast to previous methods focusing on exploiting or incorporating different saliency cues, the proposed method aims to use object signatures which can be identified by any kinds of object segmentation methods. We integrate two distinctive saliency maps, which are respectively computed from object proposals of an appearance-dominated method and a motion-dominated algorithm, to obtain a refined spatiotemporal saliency maps. This enables the method to achieve good robustness and precision in identifying salient objects in videos under various challenging conditions. First, an improved appearance-based and a modified motion-based segmentation approaches are separately utilized to extract two kinds of candidate foreground objects. Second, with these captured object signatures, we design a new approach to filter the extracted noisy object pixels and label foreground superpixels in each object signature channel. Third, we introduce a foreground connectivity saliency measure to compute two types of saliency maps, from which an adaptive fusion strategy is exploited to obtain the final spatiotemporal saliency maps for salient object detection in a video. Both quantitative and qualitative experiments on several challenging video benchmarks demonstrate that the proposed method outperforms existing state-of-the-art approaches.
    Original languageEnglish
    Pages (from-to)285-299
    Number of pages15
    JournalPattern Recognition
    Volume72
    DOIs
    Publication statusPublished - Dec 2017

    Keywords

    • Spatiotemporal saliency computation
    • Salient video object detection
    • Object signatures
    • Fusion

    Fingerprint

    Dive into the research topics of 'Fusing disparate object signatures for salient object detection in video'. Together they form a unique fingerprint.

    Cite this