Analysis of the Effect of Data Scaling on Machine Learning Algorithm Performance to Identify Abalone’s Sex

Jurnal Teknologi dan Sistem Komputer

View Publication Info
 
 
Field Value
 
Title Analysis of the Effect of Data Scaling on Machine Learning Algorithm Performance to Identify Abalone’s Sex
 
Creator Arifin, Willdan Aprizal
Ariawan, Ishak
Rosalia, Ayang Armelita
Lukman, Lukman
Tufailah, Nabila
 
Subject data scaling; machine learning algorithms; min-max normalization; zero-mean standardization
 
Description Analyzing the performance of machine learning algorithms with the data scaling process gives a view of how effective the method is. In this study, min-max (normalization) and zero-mean (standardization) are data scaling techniques used in the abalone dataset. The stages carried out in this study included data normalization on the data of abalone physical measurements features. The model evaluation was carried out using k-fold cross-validation with the number of k-fold 10. Abalone datasets were normalized in machine learning algorithms: Random Forest, Naïve Bayesian, Decision Tree, and SVM (RBF kernels and linear kernels). The eight features of the abalone dataset show that machine learning algorithms did not too influence data scaling. There is an increase in the performance of SVM while Random Forest decrease in performance when the abalone dataset is applied into data scaling. Even so, Random Forest has the highest average balanced accuracy (74.87%) without data scaling.
 
Publisher Departemen Teknik Komputer, Fakultas Teknik, Universitas Diponegoro
 
Date 2021-10-31
 
Type info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion

 
Identifier https://jtsiskom.undip.ac.id/article/view/14105
10.14710/jtsiskom.2021.14105
 
Source Jurnal Teknologi dan Sistem Komputer; 2021: Publication In-Press
Jurnal Teknologi dan Sistem Komputer; 2021: Publication In-Press
2338-0403
 
Language en
 
Rights Copyright (c) 2021 Willdan Aprizal Arifin, Ishak Ariawan, Ayang Armelita Rosalia, Lukman Lukman, Nabila Tufailah
https://creativecommons.org/licenses/by-sa/4.0
 

Contact Us

The PKP Index is an initiative of the Public Knowledge Project.

For PKP Publishing Services please use the PKP|PS contact form.

For support with PKP software we encourage users to consult our wiki for documentation and search our support forums.

For any other correspondence feel free to contact us using the PKP contact form.

Find Us

Twitter

Copyright © 2015-2018 Simon Fraser University Library