location:Home > 2019 VOL.2 Apr No.2 > High-Dimensional Big Data Subspace Clustering Integrated Processing Method Based on Cloud Computing

2019 VOL.2 Apr No.2

  • Title: High-Dimensional Big Data Subspace Clustering Integrated Processing Method Based on Cloud Computing
  • Name: Anthony Hill
  • Company: The University of Auckland, Auckland, New Zealan
  • Abstract:

    In the field of data mining, high-dimensional data is generally sparse, there are a lot of redundant dimension, it is easy to cover the real structure of the data. Therefore, based on the high-dimensional data of a large cloud, subspace clustering to study the integration processing method. Features tend to have high dimensional data redundancy. These characteristics affect the accuracy of the redundant cluster, increasing the time complexity of the algorithm. If the advance of the high-dimensional feature selection data to select a proper subset of features, can effectively reduce the complexity of the process time and improve the accuracy of analysis results. Subspace clustering integrated approach is a way to analyze the problem of computer learning more learners integrate.

  • Keyword: Keywords: cloud computing; subspace; integration; method
  • DOI: 10.12250/jpciams2019020114
  • Citation form: Anthony Hill.High-Dimensional Big Data Subspace Clustering Integrated Processing Method Based on Cloud Computing[J]. Computer Informatization and Mechanical System, 2019, vol. 2, pp. 12-18.
Reference:

[1]Zhao He.Research on subspace ensemble learning method for high-dimensional large data[D].University of Chinese Academy of Sciences (Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences), 2017, 40 (14):2230-2232.

[2]Shuai Hui, Yuan Xiaotong, Liu Qingshan.Sparse subspace clustering based on L_0 constraint[J].Journal of Nanjing University (Natural Science), 2018, 54 (1).2017 (1): 1285-1289.

[3]Tian Shuai, Chen Yi.Summary of Visual Analysis of High Dimensional Data Based on Subspace Clustering[J].Computer Engineering and Applications, 2017,40(22): 2339-2341.

[4]Qiu Yunfei, Vivien, Liu Daqian. Overlapped subspace clustering algorithm based on probabilistic model [J]. Pattern recognition and artificial intelligence, 2017, 32 (7): 2333-2337.

[5]Song Kuiyong, Wang Nianbin, Wang Hongbin, et al. [J].Computer Engineering and Applications, 2017,24 (7): 2371-2379.

[6] Wang Guofeng, Liu Chuanyi, Pan Hezhong, et al. [J].Journal of Computer Science, 2017,34(2): 165-172.

[7]Zhao Boxin.Research on Outlier Detection Algorithms for High Dimensional Large Data Sets Based on Grid Partition[J].Management and Technology of SMEs (first ten days),.2017, 29 (9):1519-16983.

[9]Zhao He.Research on subspace ensemble learning method for high-dimensional large data[D].University of Chinese Academy of Sciences (Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences), 2017, 32 (20): 1205-1212.

[10] He Xinggao, Li Chanjuan, Wang Ruijin, et al. Research on dimensionality reduction algorithm for high-dimensional sparse large data based on information entropy [J].Journal of University of Electronic Science and Technology, 2017 (2): 1225-1227.

[11]Wen Heping, Yu Simin, Lu Jinhu.Encryption algorithm based on Hadoop large data platform and non-degenerate high-dimensional discrete hyperchaotic system[J].Acta Physics, 2017, 40 (5): 258-262.

[12]Liu Ming, Han Pu, Zhang Ting. Model Identification of Reheated Steam Temperature System for Ultra-Supercritical Secondary Reheating Unit [J].Computer Simulation,  2017,34 (11): 1180-1183.


Tsuruta Institute of Medical Information Technology
Address:[502,5-47-6], Tsuyama, Tsukuba, Saitama, Japan TEL:008148-28809 fax:008148-28808 Japan,Email:jpciams@hotmail.com,2019-09-16