Robust interactive image segmentation via graph-based manifold ranking

Hong Li; Wen Wu; Enhua Wu

doi:10.1007/s41095-015-0024-2

Computational Visual Media 2015, 1(3): 183-195 https://doi.org/10.1007/s41095-015-0024-2

Research Article |

Open Access | Issue | Published: 06 November 2015

Robust interactive image segmentation via graph-based manifold ranking

Show Author's Information Hide Author's Information Hong Li^¹(

), Wen Wu^¹, Enhua Wu^{¹^,²}

1 Department of Computer and Information Science, University of Macau, Macau 999078, China.

2 Chinese Academy of Sciences, Beijing 100000, China.

Keywords:

interactive image segmentation, manifold ranking, graph structure, graph edge weights, relevance inference

Cite this article:

Li H, Wu W, Wu E. Robust interactive image segmentation via graph-based manifold ranking. Computational Visual Media, 2015, 1(3): 183-195. https://doi.org/10.1007/s41095-015-0024-2

Download citation

EndNote(RIS)

BibTeX

645

Views

Downloads

Citations

Crossref

N/A

WoS

Scopus

CSCD

Abstract Full text About this article

Abstract

Interactive image segmentation aims at classifying the image pixels into foreground and background classes given some foreground and background markers. In this paper, we propose a novel framework for interactive image segmentation that builds upon graph-based manifold ranking model, a graph-based semi-supervised learning technique which can learn very smooth functions with respect to the intrinsic structure revealed by the input data. The final segmentation results are improved by overcoming two core problems of graph construction in traditional models: graph structure and graph edge weights. The user provided scribbles are treated as the must-link and must-not-link constraints. Then we model the graph as an approximatively k-regular sparse graph by integrating these constraints and our extended neighboring spatial relationships into graph structure modeling. The content and labels driven locally adaptive kernel parameter is proposed to tackle the insufficiency of previous models which usually employ a unified kernel parameter. After the graph construction, a novel three-stage strategy is proposed to get the final segmentation results. Due to the sparsity and extended neighboring relationships of our constructed graph and usage of superpixels, our model can provide nearly real-time, user scribble insensitive segmentations which are two core demands in interactive image segmentation. Last but not least, our framework is very easy to be extended to multi-label segmentation, and for some less complicated scenarios, it can even get the segmented object through single line interaction. Experimental results and comparisons with other state-of-the-art methods demonstrate that our framework can efficiently and accurately extract foreground objects from background.

Full text

Abstract

Full text

Outline

About this article

Robust interactive image segmentation via graph-based manifold ranking

Show Author's information Hide Author's Information Hong Li^¹(

), Wen Wu^¹, Enhua Wu^{¹^,²}

1 Department of Computer and Information Science, University of Macau, Macau 999078, China.

2 Chinese Academy of Sciences, Beijing 100000, China.

Abstract

Keywords: interactive image segmentation, manifold ranking, graph structure, graph edge weights, relevance inference

References(31)

[1]

Hu S.-M.; Chen T.; Xu K.; Cheng M.-M.; Martin R. R. Internet visual media processing: A survey with graphics and vision applications. The Visual Computer Vol. 29, No. 5, 393-405, 2013.

DOI Google Scholar

[2]

Comaniciu D.; Meer P. Mean shift: A robust approach toward feature space analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 24, No. 5, 603-619, 2002.

DOI Google Scholar

[3]

Xiao C.; Liu M. Efficient mean-shift clustering using Gaussian KD-tree. Computer Graphics Forum Vol. 29, No. 7, 2065-2073, 2010.

DOI Google Scholar

[4]

Vese L. A.; Chan T. F. A multiphase level set framework for image segmentation using the Mumford and Shah model. International Journal of Computer Vision Vol. 50, No. 3, 271-293, 2002.

DOI Google Scholar

[5]

Li C.; Xu C.; Gui C.; Fox M. D. Distance regularized level set evolution and its application to image segmentation. IEEE Transactions on Image Processing Vol. 19, No. 12, 3243-3254, 2010.

DOI Google Scholar

[6]

Liu Y.; Yu Y. Interactive image segmentation based on level sets of probabilities. IEEE Transactions on Visualization and Computer Graphics Vol. 18, No. 2, 202-213, 2012.

DOI Google Scholar

[7]

Shi J.; Malik J. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 22, No. 8, 888-905, 2000.

DOI Google Scholar

[8]

Huang H.; Zhang L.; Zhang H.-C. RepSnapping: Efficient image cutout for repeated scene elements. Computer Graphics Forum Vol. 30, No. 7, 2059-2066, 2011.

DOI Google Scholar

[9]

Eigen D.; Fergus R. Nonparametric image parsing using adaptive neighbor sets. In: Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2799-2806, 2012.

DOI

[10]

Boykov Y. Y.; Jolly M.-P. Interactive graph cuts for optimal boundary & region segmentation of objects in N–D images. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, Vol. 1, 105-112, 2001.

[11]

Rother C.; Kolmogorov V.; Blake A. "GrabCut": Interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics Vol. 23, No. 3, 309-314, 2004.

DOI Google Scholar

[12]

Li Y.; Sun J.; Tang C.-K.; Shum H.-Y. Lazy snapping. ACM Transactions on Graphics Vol. 23, No. 3, 303-308, 2004.

DOI Google Scholar

[13]

Price B. L.; Morse B.; Cohen S. Geodesic graph cut for interactive image segmentation. In: Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition, 3161-3168, 2010.

DOI

[14]

Gulshan V.; Rother C.; Criminisi A.; Blake A.; Zisserman A. Geodesic star convexity for interactive image segmentation. In: Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition, 3129-3136, 2010.

DOI

[15]

Mortensen E. N.; Barrett W. A. Intelligent scissors for image composition. In: Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques, 191-198, 1995.

DOI

[16]

Sundaramoorthi G.; Yezzi A.; Mennucci A. C. Coarse-to-fine segmentation and tracking using Sobolev active contours. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 30, No. 5, 851-864, 2008.

DOI Google Scholar

[17]

Nguyen T. N. A. N.; Cai J.; Zhang J.; Zheng J. Robust interactive image segmentation using convex active contours. IEEE Transactions on Image Processing Vol. 21, No. 8, 3734-3743, 2012.

DOI Google Scholar

[18]

Grady L. Random walks for image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 28, No. 11, 1768-1783, 2006.

DOI Google Scholar

[19]

Kim T. H.; Lee K. M.; Lee S. U. Generative image segmentation using random walks with restart. In: Lecture Notes in Computer Science, Vol. 5304. Berlin Heidelberg: Springer264-275, 2008.

DOI

[20]

Yang W.; Cai J.; Zheng J.; Luo J. User-friendly interactive image segmentation through unified combinatorial user inputs. IEEE Transactions on Image Processing Vol. 19, No. 9, 2470-2479, 2010.

DOI Google Scholar

[21]

Adams R.; Bischof L. Seeded region growing. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 16, No. 6, 641-647, 1994.

DOI Google Scholar

[22]

Ning J.; Zhang L.; Zhang D.; Wu C. Interactive image segmentation by maximal similarity based region merging. Pattern Recognition Vol. 43, No. 2, 445-456, 2010.

DOI Google Scholar

[23]

Noma A.; Graciano A. B. V.; Cesar Jr. R. M.; Consularo L. A.; Bloch I. Interactive image segmentation by matching attributed relational graphs. Pattern Recognition Vol. 45, No. 3, 1159-1179, 2012.

DOI Google Scholar

[24]

Greig D. M.; Porteous B. T.; Seheult A. H. Exact maximum a posteriori estimation for binary images. Journal of the Royal Statistical Society. Series B (Methodological) Vol. 51, No. 2, 271-279, 1989.

DOI Google Scholar

[25]

Zhou D.; Bousquet O.; Lal T. N.; Weston J.; Schölkopf B. Learning with local and global consistency. In: Proceedings of Advances in Neural Information Processing Systems, 321-328, 2003.

[26]

Zhou D.; Weston J.; Gretton A.; Bousquest O.; Schölkopf B. Ranking on data manifolds. In: Proceedings of Advances in Neural Information Processing Systems, 2004. Available at http://papers.nips.cc/paper/2447-ranking-on-data-manifolds.pdf.

[27]

Shen J.; Du Y.; Wang W.; Li X. Lazy random walks for superpixel segmentation. IEEE Transactions on Image Processing Vol. 23, No. 4, 1451-1462, 2014.

DOI Google Scholar

[28]

Achanta R.; Shaji A.; Smith K.; Lucchi A.; Fua P.; Susstrunk S. SLIC superpixels compared to state-of-the-art superpixel methods. IEEE Transactions on Pattern Analysis and Machine Intelligence Vol. 34, No. 11, 2274-2282, 2012.

DOI Google Scholar

[29]

Liu J. Y.; Sun J.; Shum H. Y. Paint selection. In: Proceedings of ACM SIGGRAPH 2009 papers, Article No. 69, 2009.

DOI

[30]

Kim T. H.; Lee K. M.; Lee S. U. Nonparametric higher-order learning for interactive segmentation. In: Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition, 3201-3208, 2010.

DOI

[31]

Sinop A. K.; Grady L. A seeded image segmentation framework unifying graph cuts and randomwalker which yields a new algorithm. In: Proceedings of IEEE 11th International Conference on Computer Vision, 1-8, 2007.

DOI

About this article

Publication history

Acknowledgements

Rights and permissions

Publication history

Revised: 11 August 2015

Accepted: 15 September 2015

Published: 06 November 2015

Issue date: September 2015

Copyright

Acknowledgements

The authors would like to thank the anonymous reviewers for their valued suggestions which helped a lot to improve the manuscript. This work has been supported by NSFC (National Natural Science Foundation of China, No. 61272326), the research grant of University of Macau (No. MYRG202(Y1-L4)-FST11-WEH), and the research grant of University of Macau (No. MYRG2014-00139-FST).

Rights and permissions

This article is published with open access at Springerlink.com

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www.editorialmanager.com/cvmj.