A new soft set-based technique for clustering attribute selection in educational data mining

Determining the best clustering attribute is an essential process in data clustering, since this task is a relatively simple and efficient for attributes-based data clustering. Five well-known rough and soft sets-based techniques for selecting a clustering attribute respectively TR, MMR, MDA, NSS, a...

Full description

Bibliographic Details
Main Author: Suhirman, .
Format: Thesis
Language:English
English
English
Published: 2016
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/15254/
http://umpir.ump.edu.my/id/eprint/15254/
http://umpir.ump.edu.my/id/eprint/15254/1/FSKKP%20-%20SUHIRMAN%20-%20CD%209878.pdf
http://umpir.ump.edu.my/id/eprint/15254/2/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%201.pdf
http://umpir.ump.edu.my/id/eprint/15254/3/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%203.pdf
id ump-15254
recordtype eprints
spelling ump-152542016-11-09T06:46:13Z http://umpir.ump.edu.my/id/eprint/15254/ A new soft set-based technique for clustering attribute selection in educational data mining Suhirman, . QA75 Electronic computers. Computer science T Technology (General) Determining the best clustering attribute is an essential process in data clustering, since this task is a relatively simple and efficient for attributes-based data clustering. Five well-known rough and soft sets-based techniques for selecting a clustering attribute respectively TR, MMR, MDA, NSS, and MAR have been proposed. MAR technique achieves better computational time than that the four other aforesaid approaches. However, in reviewing MAR, execution time is still an outstanding issue, due to iteration processes in determining the relative attribute. This research proposes an alternative soft set-based technique for selecting a clustering attribute, named Maximum Degree of Domination in Soft set theory (MDDS). In this technique, the notion of multi-soft sets is firstly described. Secondly, the domination of soft sets and its degree are defined. Finally, the maximum degree of domination is used to determine the best clustering attribute. The proposed technique is examined through eighteen UCI benchmark machine learning datasets and compared with the results obtained with that of MAR. The results show that MDDS technique achieves fairly well in reducing computation time and outperforms MAR technique up to 43.99%. Furthermore, MDDS has a good scalability, i.e. the executing time of the technique tends to increase linearly as the data sizes are increased. While the accuracy of eight data sets which have a class attributes has increased 3.23%. Furthermore, the proposed MDDS technique was used to solve real world clustering problem in Educational Data Mining. The data sets were taken from a survey on a few courses at the Information Engineering and the Architecture Departments of the University Technology of Yogyakarta (UTY) Indonesia during the last 4 years. The dominant attribute of dataset assessment were determined using MDDS technique, due to its increased efficiency and accuracy, so decisions can be made faster and accurately. 2016-01 Thesis NonPeerReviewed application/pdf en http://umpir.ump.edu.my/id/eprint/15254/1/FSKKP%20-%20SUHIRMAN%20-%20CD%209878.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/15254/2/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%201.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/15254/3/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%203.pdf Suhirman, . (2016) A new soft set-based technique for clustering attribute selection in educational data mining. PhD thesis, Universiti Malaysia Pahang. http://iportal.ump.edu.my/lib/item?id=chamo:96859&theme=UMP2
repository_type Digital Repository
institution_category Local University
institution Universiti Malaysia Pahang
building UMP Institutional Repository
collection Online Access
language English
English
English
topic QA75 Electronic computers. Computer science
T Technology (General)
spellingShingle QA75 Electronic computers. Computer science
T Technology (General)
Suhirman, .
A new soft set-based technique for clustering attribute selection in educational data mining
description Determining the best clustering attribute is an essential process in data clustering, since this task is a relatively simple and efficient for attributes-based data clustering. Five well-known rough and soft sets-based techniques for selecting a clustering attribute respectively TR, MMR, MDA, NSS, and MAR have been proposed. MAR technique achieves better computational time than that the four other aforesaid approaches. However, in reviewing MAR, execution time is still an outstanding issue, due to iteration processes in determining the relative attribute. This research proposes an alternative soft set-based technique for selecting a clustering attribute, named Maximum Degree of Domination in Soft set theory (MDDS). In this technique, the notion of multi-soft sets is firstly described. Secondly, the domination of soft sets and its degree are defined. Finally, the maximum degree of domination is used to determine the best clustering attribute. The proposed technique is examined through eighteen UCI benchmark machine learning datasets and compared with the results obtained with that of MAR. The results show that MDDS technique achieves fairly well in reducing computation time and outperforms MAR technique up to 43.99%. Furthermore, MDDS has a good scalability, i.e. the executing time of the technique tends to increase linearly as the data sizes are increased. While the accuracy of eight data sets which have a class attributes has increased 3.23%. Furthermore, the proposed MDDS technique was used to solve real world clustering problem in Educational Data Mining. The data sets were taken from a survey on a few courses at the Information Engineering and the Architecture Departments of the University Technology of Yogyakarta (UTY) Indonesia during the last 4 years. The dominant attribute of dataset assessment were determined using MDDS technique, due to its increased efficiency and accuracy, so decisions can be made faster and accurately.
format Thesis
author Suhirman, .
author_facet Suhirman, .
author_sort Suhirman, .
title A new soft set-based technique for clustering attribute selection in educational data mining
title_short A new soft set-based technique for clustering attribute selection in educational data mining
title_full A new soft set-based technique for clustering attribute selection in educational data mining
title_fullStr A new soft set-based technique for clustering attribute selection in educational data mining
title_full_unstemmed A new soft set-based technique for clustering attribute selection in educational data mining
title_sort new soft set-based technique for clustering attribute selection in educational data mining
publishDate 2016
url http://umpir.ump.edu.my/id/eprint/15254/
http://umpir.ump.edu.my/id/eprint/15254/
http://umpir.ump.edu.my/id/eprint/15254/1/FSKKP%20-%20SUHIRMAN%20-%20CD%209878.pdf
http://umpir.ump.edu.my/id/eprint/15254/2/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%201.pdf
http://umpir.ump.edu.my/id/eprint/15254/3/FSKKP%20-%20SUHIRMAN%20-%20CD%209878%20-%20CHAP%203.pdf
first_indexed 2023-09-18T22:19:43Z
last_indexed 2023-09-18T22:19:43Z
_version_ 1777415574999007232