Journal Home > Volume 18 , Issue 4

The Internet of Things (IoT) implies a worldwide network of interconnected objects uniquely addressable, via standard communication protocols. The prevalence of IoT is bound to generate large amounts of multisource, heterogeneous, dynamic, and sparse data. However, IoT offers inconsequential practical benefits without the ability to integrate, fuse, and glean useful information from such massive amounts of data. Accordingly, preparing us for the imminent invasion of things, a tool called data fusion can be used to manipulate and manage such data in order to improve process efficiency and provide advanced intelligence. In order to determine an acceptable quality of intelligence, diverse and voluminous data have to be combined and fused. Therefore, it is imperative to improve the computational efficiency for fusing and mining multidimensional data. In this paper, we propose an efficient multidimensional fusion algorithm for IoT data based on partitioning. The basic concept involves the partitioning of dimensions (attributes), i.e., a big data set with higher dimensions can be transformed into certain number of relatively smaller data subsets that can be easily processed. Then, based on the partitioning of dimensions, the discernible matrixes of all data subsets in rough set theory are computed to obtain their core attribute sets. Furthermore, a global core attribute set can be determined. Finally, the attribute reduction and rule extraction methods are used to obtain the fusion results. By means of proving a few theorems and simulation, the correctness and effectiveness of this algorithm is illustrated.


menu
Abstract
Full text
Outline
About this article

An Efficient Multidimensional Fusion Algorithm for IoT Data Based on Partitioning

Show Author's information Jin ZhouLiang HuFeng WangHuimin LuKuo Zhao( )
College of Computer Science and Technology, Jilin University, Changchun 130012, China
College of Software, Changchun University of Technology, Changchun 130012, China.

Abstract

The Internet of Things (IoT) implies a worldwide network of interconnected objects uniquely addressable, via standard communication protocols. The prevalence of IoT is bound to generate large amounts of multisource, heterogeneous, dynamic, and sparse data. However, IoT offers inconsequential practical benefits without the ability to integrate, fuse, and glean useful information from such massive amounts of data. Accordingly, preparing us for the imminent invasion of things, a tool called data fusion can be used to manipulate and manage such data in order to improve process efficiency and provide advanced intelligence. In order to determine an acceptable quality of intelligence, diverse and voluminous data have to be combined and fused. Therefore, it is imperative to improve the computational efficiency for fusing and mining multidimensional data. In this paper, we propose an efficient multidimensional fusion algorithm for IoT data based on partitioning. The basic concept involves the partitioning of dimensions (attributes), i.e., a big data set with higher dimensions can be transformed into certain number of relatively smaller data subsets that can be easily processed. Then, based on the partitioning of dimensions, the discernible matrixes of all data subsets in rough set theory are computed to obtain their core attribute sets. Furthermore, a global core attribute set can be determined. Finally, the attribute reduction and rule extraction methods are used to obtain the fusion results. By means of proving a few theorems and simulation, the correctness and effectiveness of this algorithm is illustrated.

Keywords: Internet of Things, data fusion, multidimensional data, partitioning, rough set theory

References(21)

[1]
ITU Strategy and Policy Unit (SPU), ITU internet reports 2005: The internet of things. Geneva: International Telecommunication Union (ITU), 2005.
[2]
O. Vermesan, M. Harrison, H. Vogt, K. Kalaboukas, M. Tomasella, K. Wouters, S. Gusmeroli, and S. Haller, Internet of things strategic research roadmap. EPoSS: European Technology Platform on Smart Systems Integration, 2009.
[3]
P. Barnaghi, W. Wang, C. Henson, and K. Taylor, Semantics for the Internet of Things: Early progress and back to the future, International Journal on Semantic Web and Information Systems, vol. 8, no. 1, pp. 1-21, 2012.
[4]
C. C. Aggarwal, The Internet of Things: A survey and form the date-centric perspective, in Managing and Mining Sensor Data. New York, USA: Springer, 2013, pp. 383-428.
DOI
[5]
Z. Pawlak, Rough sets and intelligent data analysis, Information Sciences, vol. 147, no. 124, pp. 1-12, 2002.
[6]
Z. Pawlak, Rough sets theory and its applications, Journal of Communications and Information Technology, no. 3, pp. 7-10, Mar. 2002.
[7]
M. Krysikiewicz, Rough set approach to incomplete information system, Information Sciences, vol. 112, nos. 1-4, pp. 39-49, 1998.
[8]
L. Wald, Some terms of reference in data fusion, IEEE Transactions on Geosciences and Remote Sensing, vol. 37, no. 3, pp. 1190-1193, 1999.
[9]
E. F. Nakamura, A. A. F. Loureiro, and A. C. Frery, Information fusion for wireless sensor networks: Methods, models, and classifications, ACM Computing Surveys, vol. 39, no. 3, pp. 1-55, 2007.
[10]
M. Compton, P. Barnaghi, L. Bermudez, R. Garc¨ªa-Castro, O. Corcho, S. Cox, J. Graybeal, M. Hauswirth, C. Henson, A. Herzog, V. Huang, K. Janowicz, W. D. Kelsey, D. Le Phuoc, L. Lefort, M. Leggieri, H. Neuhaus, A. Nikolov, K. Page, A. Passant, A. Sheth, and K. Taylor, The SSN ontology of the W3C semantic sensor network incubator group, Journal of Web Semantics, vol. 17, pp. 25-32, 2012.
[11]
C. Henson, A. Sheth, and K. Thirunarayan, Semantic perception: Converting sensory observations to abstractions, IEEE Internet Computing, vol. 16, no. 2, pp. 26-34, 2012.
[12]
H. Patni, C. Henson, and A. Sheth, Linked sensor data, in Proc. 2010 International Symposium on Collaborative Technologies and Systems (CTS 2010), Chicago, USA, 2010, pp. 1-9.
DOI
[13]
M. Rinne, S. Torma, and E. Nuutila, SPARQL-based applications for RDF-encoded sensor data, in Proc. 5th International Workshop on Semantic Sensor Networks 2012 (SSN12), Boston, Massachusetts, USA, 2012, pp. 81-96.
[14]
A. K. Joshi, P. Jain, P. Hitzler, P. Z. Yeh, K. Verma, A. P. Sheth, and M. Damova, Alignment-based querying of linked open data, in Proc. on the Move to Meaningful Internet Systems: OTM 2012, Confederated International Conferences: CoopIS, DOA-SVI, and ODBASE 2012, Rome, Italy, 2012, pp. 1-18.
DOI
[15]
A. Rajaraman, J. Leskovec, and J. D. Ullman, Mining of Massive Datasets. Cambridge University Press, 2010.
DOI
[16]
N. Pham and R. Pagh, A near-linear time approximation algorithm for angle-based outlier detection in high-dimensional Data, in Proc. 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’12), Beijing, China, 2012, pp. 877-885.
DOI
[17]
S. Kahramanli, M. Hacibeyoglu, and A. Arslan, Attribute reduction by partitioning the minimized discernibility function, International Journal of Innovative Computing, Information and Control, Computer Research and Development, vol. 7, no. 5, pp. 2167-2186, 2011.
[18]
E. Oren, S. Kotoulas, G. Anadiotis, R. Srebes, A. ten Teiji, and F. van Harmelen, Marvin: Distributed reasoning over large-scale semantic web data, Journal of Web Semantics, vol.7, no. 4, pp. 305-316, 2009.
[19]
K. Thangavel and A. Pethalakshmi, Dimensionality reduction based on rough set theory: A review, Applied Soft Computing, vol. 1, no. 9, pp. 1-12, 2009.
[20]
Y. Qian, J. Liang, W. Pedrycz, and C. Dang, Positive approximation: An accelerator for attribute reduction in rough set theory, Artificial Intelligence, vol. 174, nos. 9-10, pp. 597-618, 2010.
[21]
D. Cao, X. Qiao, G. Judith, X. Li, and L. Meng, Mining data correlation from multi-faceted sensor data in the Internet of Things, China Communications, vol. 8, no. 1, pp. 132-138, 2011.
Publication history
Copyright
Acknowledgements
Rights and permissions

Publication history

Received: 25 June 2013
Accepted: 05 July 2013
Published: 05 August 2013
Issue date: August 2013

Copyright

© The author(s) 2013

Acknowledgements

This work was supported in part by the National High-Tech Research and Development (863) Program of China (No. 2011AA010101), the National Natural Science Foundation of China (Nos. 61103197, 61073009, and 61240029), the Science and Technology Key Project of Jilin Province (No. 2011ZDGG007), the Youth Foundation of Jilin Province of China (No. 201101035), the Fundamental Research Funds for the Central Universities of China (No. 200903179), China Postdoctoral Science Foundation (No. 2011M500611), the 2011 Industrial Technology Research and Development Special Project of Jilin Province (No. 2011006-9), the 2012 National College Students’ Innovative Training Program of China, and European Union Framework Program: MONICA Project under the Grant Agreement Number PIRSES-GA-2011-295222.

Rights and permissions

Return