PERFORMANCE ANALYSIS OF PREDICTING SURVIVAL RATES IN IMBALANCED HEALTHCARE DATASET.

dc.contributor.authorR, Vani
dc.date.accessioned2020-09-29T05:25:59Z
dc.date.available2020-09-29T05:25:59Z
dc.date.issued2018-04
dc.description.abstractPredicting Patients health is a critical task in the Healthcare Industry. Healthcare datasets show a high degree of imbalance especially for rare diseases. The current work aims at predicting the post operative survival rate in thoracic surgery datasets. The dataset exhibits data imbalance with around 15% positive cases and remaining 85% negative cases. The commonly applicable machine learning techniques for prediction score poorly in predicting the positive cases in spite of high accuracy of the predictions for the negative cases. We use SMOTE (synthetic minority oversampling technique) to reduce the degree of imbalance and increase the positive samples proportion before the application of the following classifiers: Naive Bayes, Neural Networks, Random Forest, Boosting algorithms - Adaboost, Extreme Gradient boosting and Support Vector Machines and examine the results. The study shows that SVM and Naïve Bayes show significantly better performance on the imbalanced datasets than other models using synthetic datasets than under normal conditions.en_US
dc.identifier.issn0976-5697
dc.identifier.urihttp://www.ijarcs.info/index.php/Ijarcs/article/view/5523
dc.identifier.urihttps://dspace.psgrkcw.com/handle/123456789/1839
dc.language.isoenen_US
dc.publisherInternational Journal of Advanced Research in Computer Scienceen_US
dc.subjectclassificationen_US
dc.subjecthealthcareen_US
dc.subjectThoracicen_US
dc.subjectSMOTEen_US
dc.subjectSynthetic datasetsen_US
dc.subjectlung canceren_US
dc.subjectSVMen_US
dc.subjectNaive Bayesen_US
dc.titlePERFORMANCE ANALYSIS OF PREDICTING SURVIVAL RATES IN IMBALANCED HEALTHCARE DATASET.en_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
PERFORMANCE ANALYSIS OF PREDICTING SURVIVAL RATES IN IMBALANCED HEALTHCARE DATASET..docx
Size:
10.53 KB
Format:
Microsoft Word XML
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: