Improving the accuracy of the machine learning predictive models for analyzing CHD dataset

Ivelin Georgiev Ivanov

Abstract


The problem to classify big data is an important one in machine learning. There are multiple ways to classify data, but the support vector machine (SVM) has become a great tool for the data scientist. In this paper we examine several modifications of the support vector machine algorithm that achieve better efficiency in terms of accuracy, F1 precision and CPU time when classifying test observations in comparison to the standard SVM algorithm. To make the modifications faster than standard SVM we use a special methodology which splits the input dataset into n folds and combine it with input data transformations. Each time we execute the process, one of the folds is saved as a test subset and the rest of the folds are applied for training.  The process is executed n times. In the proposed methodology we are looking for the pair of subsets which produces the highest accuracy result. This pair is saved as an output SVM model.

Full Text: PDF

Published: 2022-01-10

How to Cite this Article:

Ivelin Georgiev Ivanov, Improving the accuracy of the machine learning predictive models for analyzing CHD dataset, J. Math. Comput. Sci., 12 (2022), Article ID 50

Copyright © 2022 Ivelin Georgiev Ivanov. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

 

Copyright ©2024 JMCS