AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
Article Link
Collect
Submit Manuscript
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Regular Paper

SMEC: Scene Mining for E-Commerce

State Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China
School of Data Science and Engineering, East China Normal University, Shanghai 200062, China
JD.com, Inc., Beijing 100176, China
Baidu, Inc., Beijing 100085, China
Show Author Information

Abstract

Scene-based recommendation has proven its usefulness in E-commerce, by recommending commodities based on a given scene. However, scenes are typically unknown in advance, which necessitates scene discovery for E-commerce. In this article, we study scene discovery for E-commerce systems. We first formalize a scene as a set of commodity categories that occur simultaneously and frequently in real-world situations, and model an E-commerce platform as a heterogeneous information network (HIN), whose nodes and links represent different types of objects and different types of relationships between objects, respectively. We then formulate the scene mining problem for E-commerce as an unsupervised learning problem that finds the overlapping clusters of commodity categories in the HIN. To solve the problem, we propose a non-negative matrix factorization based method SMEC (Scene Mining for E-Commerce), and theoretically prove its convergence. Using six real-world E-commerce datasets, we finally conduct an extensive experimental study to evaluate SMEC against 13 other methods, and show that SMEC consistently outperforms its competitors with regard to various evaluation measures.

Electronic Supplementary Material

Download File(s)
JCST-2101-11277-Highlights.pdf (347.3 KB)

References

[1]
Sarwar B, Karypis G, Konstan J, Riedl J. Item-based collaborative filtering recommendation algorithms. In Proc. the 10th International Conference on World Wide Web, May 2001, pp.285–295. DOI: 10.1145/371920.372071.
[2]
Rendle S, Freudenthaler C, Schmidt-Thieme L. Factorizing personalized Markov chains for next-basket recommendation. In Proc. the 19th International Conference on World Wide Web, Apr. 2010, pp.811–820. DOI: 10.1145/1772690.1772773.
[3]
Zhou G R, Zhu X Q, Song C R, Fan Y, Zhu H, Ma X, Yan Y H, Jin J Q, Li H, Gai K. Deep interest network for click-through rate prediction. In Proc. the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Jul. 2018, pp.1059–1068. DOI: 10.1145/3219819.3219823.
[4]
Kiapour M H, Yamaguchi K, Berg A C, Berg T L. Hipster wars: Discovering elements of fashion styles. In Proc. the 13th European Conference on Computer Vision, Sept. 2014, pp.472–488. DOI: 10.1007/978-3-319-10590-1_31.
[5]
Liu Z W, Luo P, Qiu S, Wang X G, Tang X O. DeepFashion: Powering robust clothes recognition and retrieval with rich annotations. In Proc. the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 2016, pp.1096–1104. DOI: 10.1109/CVPR.2016.124.
[6]
Kang W C, Kim E, Leskovec J, Rosenberg C, McAuley J. Complete the look: Scene-based complementary product recommendation. In Proc. the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun. 2019, pp.10524–10533. DOI: 10.1109/CVPR.2019.01078.
[7]

Sun Y Z, Han J W, Yan X F, Yu P S, Wu T Y. PathSim: Meta path-based top- K similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment, 2011, 4(11): 992–1003. DOI: 10.14778/3402707. 3402736.

[8]
Huang Z P, Zheng Y D, Cheng R, Sun Y Z, Mamoulis N, Li X. Meta structure: Computing relevance in large heterogeneous information networks. In Proc. the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2016, pp.1595–1604. DOI: 10.1145/2939672.2939815.
[9]
Fang Y, Lin W Q, Zheng V W, Wu M, Chang K C C, Li X L. Semantic proximity search on graphs with metagraph-based learning. In Proc. the 32nd IEEE International Conference on Data Engineering, May 2016, pp.277–288. DOI: 10.1109/ICDE.2016.7498247.
[10]
Zhao H, Yao Q M, Li J D, Song Y Q, Lee D L. Meta-graph based recommendation fusion over heterogeneous information networks. In Proc. the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2017, pp.635–644. DOI: 10.1145/3097983.3098063.
[11]
Wang X, Ji H Y, Shi C, Wang B, Ye Y F, Cui P, Yu P S. Heterogeneous graph attention network. In Proc. the 28th World Wide Web Conference, May 2019, pp.2022–2032. DOI: 10.1145/3308558.3313562.
[12]
Liu J L, Wang C, Gao J, Han J W. Multi-view clustering via joint nonnegative matrix factorization. In Proc. the 13th SIAM International Conference on Data Mining, Dec. 2013, pp.252–260. DOI: 10.1137/1.9781611972832.28.
[13]
Qiu J Z, Dong Y X, Ma H, Li J, Wang K S, Tang J. Network embedding as matrix factorization: Unifying DeepWalk, LINE, PTE, and node2vec. In Proc. the 11th ACM International Conference on Web Search and Data Mining, Feb. 2018, pp.459–467. DOI: 10.1145/3159652.3159706.
[14]

Linden G, Smith B, York J. Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Computing, 2003, 7(1): 76–80. DOI: 10.1109/MIC.2003.1167344.

[15]
Zhang H, Chen X, Ma S. Dynamic news recommendation with hierarchical attention network. In Proc. the 19th IEEE International Conference on Data Mining, Nov. 2019, pp.1456–1461. DOI: 10.1109/ICDM.2019.00190.
[16]
Wang G, Guo Z Y, Li X, Yin D W, Ma S. SceneRec: Scene-based graph neural networks for recommender systems. In Proc. the 24th International Conference on Extending Database Technology, Mar. 2021, pp.397–402. DOI: 10.5441/002/EDBT.2021.41.
[17]

Xie J R, Kelley S, Szymanski B K. Overlapping community detection in networks: The state-of-the-art and comparative study. ACM Computing Surveys, 2013, 45(4): Article No. 43. DOI: 10.1145/2501654.2501657.

[18]

Dunn J C. A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. Journal of Cybernetics, 1973, 3(3): 32–57. DOI: 10.1080/01969 727308546046.

[19]
Lee D D, Seung H S. Algorithms for non-negative matrix factorization. In Proc. the 13th International Conference on Neural Information Processing Systems, Jan. 2000, pp.556–562. DOI: 10.5555/3008751.3008829.
[20]

Duan L, Ma S, Aggarwal C, Ma T J, Huai J P. An ensemble approach to link prediction. IEEE Trans. Knowledge and Data Engineering, 2017, 29(11): 2402–2416. DOI: 10.1109/TKDE.2017.2730207.

[21]

Wang F, Li T, Wang X, Zhu S H, Ding C. Community discovery using nonnegative matrix factorization. Data Mining and Knowledge Discovery, 2011, 22(3): 493–521. DOI: 10.1007/S10618-010-0181-Y.

[22]
Yang J, Leskovec J. Overlapping community detection at scale: A nonnegative matrix factorization approach. In Proc. the 6th ACM International Conference on Web Search and Data Mining, Feb. 2013, pp.587–596. DOI: 10.1145/2433396.2433471.
[23]
Wang X, Jin D, Cao X C, Yang L, Zhang W X. Semantic community identification in large attribute networks. In Proc. the 30th AAAI Conference on Artificial Intelligence, Feb. 2016, pp.265–271. DOI: 10.5555/3015812.3015851.
[24]
Kuang D, Park H, Ding C H Q. Symmetric nonnegative matrix factorization for graph clustering. In Proc. the 12th SIAM International Conference on Data Mining, Apr. 2012, pp.106–117. DOI: 10.1137/1.9781611972825.10.
[25]
Yang J, McAuley J, Leskovec J. Community detection in networks with node attributes. In Proc. the 13th IEEE International Conference on Data Mining, Dec. 2013, pp.1151–1156. DOI: 10.1109/ICDM.2013.167.
[26]
Liu Y, Niculescu-Mizil A, Gryc W. Topic-link LDA: Joint models of topic and author community. In Proc. the 26th Annual International Conference on Machine Learning, Jun. 2009, pp.665–672. DOI: 10.1145/1553374.1553460.
[27]
McAuley J, Leskovec J. Learning to discover social circles in ego networks. In Proc. the 25th International Conference on Neural Information Processing Systems, Dec. 2012, pp.539–547. DOI: 10.5555/2999134.2999195.
[28]
Coscia M, Rossetti G, Giannotti F, Pedreschi D. DEMON: A local-first discovery method for overlapping communities. In Proc. the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2012, pp.615–623. DOI: 10.1145/2339530.2339630.
[29]
Zhang X C, Li H X, Liang W X, Luo J B. Multi-type co-clustering of general heterogeneous information networks via nonnegative matrix tri-factorization. In Proc. the 16th IEEE International Conference on Data Mining, Dec. 2016, pp.1353–1358. DOI: 10.1109/ICDM.2016.0185.
[30]
Zhou Y, Liu L, Buttler D. Integrating vertex-centric clustering with edge-centric clustering for meta path graph analysis. In Proc. the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2015, pp.1563–1572. DOI: 10.1145/2783258.2783328.
[31]

Sun Y Z, Norick B, Han J W, Yan X F, Yu P S, Yu X. Pathselclus: Integrating meta-path selection with user-guided object clustering in heterogeneous information networks. ACM Trans. Knowledge Discovery from Data, 2013, 7(3): Article No. 11. DOI: 10.1145/2500492.

[32]
Shi Y, He X W, Zhang N J, Yang C, Han J W. User-guided clustering in heterogeneous information networks via motif-based comprehensive transcription. In Proc. the 2019 European Conference on Machine Learning and Knowledge Discovery in Databases, Sept. 2019, pp.361–377. DOI: 10.1007/978-3-030-46150-8_22.
[33]
Elsner U. Graph partitioning: A survey. Technical Report, SFB 393/97-27, Technische Universität Chemnitz, Chemnitz, Germany, 1997. https://core.ac.uk/download/pdf/153227811.pdf, Jan. 2024.
[34]
Tumasjan A, Sprenger T O, Sandner P G, Welpe I M. Predicting elections with Twitter: What 140 characters reveal about political sentiment. In Proc. the 4th International Conference on Weblogs and Social Media, May 2010, pp.178–152. DOI: 10.1609/icwsm.v4i1.14009.
[35]
Azarbonyad H, Dehghani M, Beelen K, Arkut A, Marx M, Kamps J. Words are malleable: Computing semantic shifts in political and media discourse. In Proc. the 2017 ACM on Conference on Information and Knowledge Management, Nov. 2017, pp.1509–1518. DOI: 10.1145/3132847.3132878.
[36]

Cui P, Wang X, Pei J, Zhu W W. A survey on network embedding. IEEE Trans. Knowledge and Data Engineering, 2019, 31(5): 833–852. DOI: 10.1109/TKDE.2018.2849727.

[37]
Lu Y F, Shi C, Hu L M, Liu Z Y. Relation structure-aware heterogeneous information network embedding. In Proc. the 33rd AAAI Conference on Artificial Intelligence, Jul. 2019, pp.4456–4463. DOI: 10.1609/AAAI.V33I01.33014456.
[38]
Tang J, Qu M, Wang M Z, Zhang M, Yan J, Mei Q Z. LINE: Large-scale information network embedding. In Proc. the 24th International Conference on World Wide Web, May 2015, pp.1067–1077. DOI: 10.1145/2736277.2741093.
[39]
Wang D, Cui P, Zhu W. Structural deep network embedding. In Proc. the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2016, pp.1225–1234. DOI: 10.1145/2939672.2939753.
[40]
Hu R J, Aggarwal C C, Ma S, Huai J. An embedding approach to anomaly detection. In Proc. the 32nd IEEE International Conference on Data Engineering, May 2016, pp.385–396. DOI: 10.1109/ICDE.2016.7498256.
[41]
Perozzi B, Al-Rfou R, Skiena S. DeepWalk: Online learning of social representations. In Proc. the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2014, pp.701–710. DOI: 10.1145/2623330.2623732.
[42]
Grover A, Leskovec J. node2vec: Scalable feature learning for networks. In Proc. the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2016, pp.855–864. DOI: 10.1145/2939672.2939754.
[43]
Fu T Y, Lee W C, Lei Z. Hin2vec: Explore meta-paths in heterogeneous information networks for representation learning. In Proc. the 2017 ACM on Conference on Information and Knowledge Management, Nov. 2017, pp.1797–1806. DOI: 10.1145/3132847.3132953.
[44]
Dong Y X, Chawla N V, Swami A. metapath2vec: Scalable representation learning for heterogeneous networks. In Proc. the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Aug. 2017, pp.135–144. DOI: 10.1145/3097983.3098036.
[45]
Kipf T N, Welling M. Semi-supervised classification with graph convolutional networks. In Proc. the 15th International Conference on Learning Representations, Apr. 2017.
[46]
Hamilton W L, Ying R, Leskovec J. Inductive representation learning on large graphs. In Proc. the 31st International Conference on Neural Information Processing Systems, Dec. 2017, pp.1024–1034. DOI: 10.5555/3294771.3294869.
[47]
Sun Y Z, Han J W. Mining Heterogeneous Information Networks: Principles and Methodologies. Springer, 2012: 1–159. DOI: 10.1007/978-3-031-01902-9.
[48]

Sun Y Z, Han J W. Mining heterogeneous information networks: A structural analysis approach. ACM SIGKDD Explorations Newsletter, 2012, 14(2): 20–28. DOI: 10.1145/ 2481244.2481248.

[49]
Xu W, Liu X, Gong Y H. Document clustering based on non-negative matrix factorization. In Proc. the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul. 2003, pp.267–273. DOI: 10.1145/860435.860485.
[50]

Koren Y, Bell R, Volinsky C. Matrix factorization techniques for recommender systems. Computer, 2009, 42(8): 30–37. DOI: 10.1109/MC.2009.263.

[51]
Boyd S, Vandenberghe L. Convex Optimization. Cambridge University Press, 2004. DOI: 10.1017/CBO9780511804441.
[52]
McAuley J, Targett C, Shi Q F, Van Den Hengel A. Image-based recommendations on styles and substitutes. In Proc. the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval, Aug. 2015, pp.43–52. DOI: 10.1145/2766462.2767755.
[53]
Yang J, Leskovec J. Community-affiliation graph model for overlapping network community detection. In Proc. the 12th IEEE International Conference on Data Mining, Dec. 2012, pp.1170–1175. DOI: 10.1109/ICDM.2012.139.
[54]

Fortunato S. Community detection in graphs. Physics Reports, 2010, 486(3/4/5): 75–174. DOI: 10.1016/J.PHYSREP.2009.11.002.

[55]
Gregory S. Fuzzy overlapping communities in networks. arXiv: 1010.1523, 2010. https://arxiv.org/abs/1010.1523, Jan. 2024.
Journal of Computer Science and Technology
Pages 192-210
Cite this article:
Wang G, Li X, Guo Z-Y, et al. SMEC: Scene Mining for E-Commerce. Journal of Computer Science and Technology, 2024, 39(1): 192-210. https://doi.org/10.1007/s11390-021-1277-0

99

Views

0

Crossref

0

Web of Science

0

Scopus

0

CSCD

Altmetrics

Received: 08 January 2021
Accepted: 24 October 2021
Published: 25 January 2024
© Institute of Computing Technology, Chinese Academy of Sciences 2024
Return