• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation


Abstract

Unsupervised Video Object Segmentation (UVOS) is a highly challenging problem in computer vision as the annotation of the target object in the testing video is unknown at all. The main difficulty is to effectively handle the complicated and changeable motion state of the target object and the confusion of similar background objects in video sequence. In this paper, we propose a novel deep Dual-stream Co-enhanced Network (DC-Net) for UVOS via bidirectional motion cues refinement and multi-level feature aggregation, which can fully take advantage of motion cues and effectively integrate different level features to produce high-quality segmentation mask. DC-Net is a dual-stream architecture where the two streams are co-enhanced by each other. One is a motion stream with a Motion-cues Refine Module (MRM), which learns from bidirectional optical flow images and produces fine-grained and complete distinctive motion saliency map, and the other is an appearance stream with a Multi-level Feature Aggregation Module (MFAM) and a Context Attention Module (CAM) which are designed to integrate the different level features effectively. Specifically, the motion saliency map obtained by the motion stream is fused with each stage of the decoder in the appearance stream to improve the segmentation, and in turn the segmentation loss in the appearance stream feeds back into the motion stream to enhance the motion refinement. Experimental results on three datasets (Davis2016, VideoSD, SegTrack-v2) demonstrate that DC-Net has achieved comparable results with some state-of-the-art methods.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
H. Zhu, H. Yin, Y. Liu, N. Chen, "Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation," KSII Transactions on Internet and Information Systems, vol. 18, no. 4, pp. 938-958, 2024. DOI: 10.3837/tiis.2024.04.007.

[ACM Style]
Hongliang Zhu, Hui Yin, Yanting Liu, and Ning Chen. 2024. Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation. KSII Transactions on Internet and Information Systems, 18, 4, (2024), 938-958. DOI: 10.3837/tiis.2024.04.007.

[BibTeX Style]
@article{tiis:90792, title="Dual-stream Co-enhanced Network for Unsupervised Video Object Segmentation", author="Hongliang Zhu and Hui Yin and Yanting Liu and Ning Chen and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2024.04.007}, volume={18}, number={4}, year="2024", month={April}, pages={938-958}}