Implementing Vertices Principal Component Analysis under Various Weighting Schemes for Interval Valued Observations with Applications to Data Mining

Md Anwarul Islam  Bhuiyan; Sohana Jahan; Mohammad Babul Hasan

doi:10.3329/dujs.v72i1.71184

Authors

Md Anwarul Islam Bhuiyan Department Mathematics, University of Dhaka, Dhaka-1000, Bangladesh
Sohana Jahan Department Mathematics, University of Dhaka, Dhaka-1000, Bangladesh
Mohammad Babul Hasan Department Mathematics, University of Dhaka, Dhaka-1000, Bangladesh

Keywords:

Data Mining, Interval Valued Data, Principal Component Analysis, Vertices Principal Component Analysis, K-Nearest Neighbor, Distance Matrix

Abstract

Data mining is the technique for deriving valuable data from a more extensive collection of raw data. It is the process of looking for irregularities, trends, and correlations in huge data sets in order to forecast results. Although a number of techniques have been developed to perform data mining on conventional data in the past years, there are huge scope to work with Interval Valued data (IVD). Working with IVD has been shown to be of significant importance when it comes to identifying the objective entity in a precise manner or representing incomplete knowledge on life situations. Unlike classical data where each object is represented by a point, in IVD the objects are represented by regions in Rp. In this paper, an extension of Principle Component Analysis (PCA) known as Vertices Principal Components method for interval-valued information has been explored. It additionally incorporated the relative contributions of the vertices depending on different choices of weighting schemes. A new idea for classification of the supervised IVD is proposed which is based on the idea of K-Nearest Neighbor (KNN) technique. The proposed approach is implemented on several benchmarking data sets. Numerical results suggest the proper choice of weighting schemes for each of the data set that will lead to better recognition rate.

Dhaka Univ. J. Sci. 72(1): 46-55, 2024 (January)

Abstract
269

PDF
338

Implementing Vertices Principal Component Analysis under Various Weighting Schemes for Interval Valued Observations with Applications to Data Mining

Authors

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

How to Cite

Make a Submission

Information