• KSII Transactions on Internet and Information Systems
    Monthly Online Journal (eISSN: 1976-7277)

Geometric and Semantic Improvement for Unbiased Scene Graph Generation

Vol. 17, No. 10, October 31, 2023
10.3837/tiis.2023.10.003, Download Paper (Free):

Abstract

Scene graphs are structured representations that can clearly convey objects and the relationships between them, but are often heavily biased due to the highly skewed, long-tailed relational labeling in the dataset. Indeed, the visual world itself and its descriptions are biased. Therefore, Unbiased Scene Graph Generation (USGG) prefers to train models to eliminate long-tail effects as much as possible, rather than altering the dataset directly. To this end, we propose Geometric and Semantic Improvement (GSI) for USGG to mitigate this issue. First, to fully exploit the feature information in the images, geometric dimension and semantic dimension enhancement modules are designed. The geometric module is designed from the perspective that the position information between neighboring object pairs will affect each other, which can improve the recall rate of the overall relationship in the dataset. The semantic module further processes the embedded word vector, which can enhance the acquisition of semantic information. Then, to improve the recall rate of the tail data, the Class Balanced Seesaw Loss (CBSLoss) is designed for the tail data. The recall rate of the prediction is improved by penalizing the body or tail relations that are judged incorrectly in the dataset. The experimental findings demonstrate that the GSI method performs better than mainstream models in terms of the mean Recall@K (mR@K) metric in three tasks. The long-tailed imbalance in the Visual Genome 150 (VG150) dataset is addressed better using the GSI method than by most of the existing methods.


Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from December 1st, 2015)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article

[IEEE Style]
R. Zhang, P. Xu, K. Kang, Y. Yang, "Geometric and Semantic Improvement for Unbiased Scene Graph Generation," KSII Transactions on Internet and Information Systems, vol. 17, no. 10, pp. 2643-2657, 2023. DOI: 10.3837/tiis.2023.10.003.

[ACM Style]
Ruhui Zhang, Pengcheng Xu, Kang Kang, and You Yang. 2023. Geometric and Semantic Improvement for Unbiased Scene Graph Generation. KSII Transactions on Internet and Information Systems, 17, 10, (2023), 2643-2657. DOI: 10.3837/tiis.2023.10.003.

[BibTeX Style]
@article{tiis:56202, title="Geometric and Semantic Improvement for Unbiased Scene Graph Generation", author="Ruhui Zhang and Pengcheng Xu and Kang Kang and You Yang and ", journal="KSII Transactions on Internet and Information Systems", DOI={10.3837/tiis.2023.10.003}, volume={17}, number={10}, year="2023", month={October}, pages={2643-2657}}