Abstract:
PCOS are the serious condition which affects female ovaries during their reproductive age of 15 to 45. This disease affects 5 to 10% of reproductive-age females. Although...Show MoreMetadata
Abstract:
PCOS are the serious condition which affects female ovaries during their reproductive age of 15 to 45. This disease affects 5 to 10% of reproductive-age females. Although it is difficult to fully resolve this issue,the PCOS affected women can be mitigated through proper exercise, by taking proper nutritious diet and maintaining the healthy BMI. Until they take a pregnancy test, the majority of women are unaware of the disease. The clinical dataset has the 541 instances and 45 attributes of unbalanced classes of 0 and 1 (no and yes) which has 364 instances of 0 (no) class and 177 instances of 1 (yes) class. Preprocessing is done for the unbalanced dataset by filling the null values and changing the datatype of all attributes to numeric datatypes. The unbalanced dataset is balanced by the balancing techniques of SMOTE and Random Over Sampling. Comparing the both balanced techniques through the accuracy the random oversampling gives the best.The supervised learning algorithms are Decision tree, KNN,Random Forest,AdaBoost, Logistic regression,Gradient boosting, cat boosting, XGBoosting, Linear SVM, Radial SVM, Linear discriminant analysis and Quadratic discriminant analysis are used.The supervised learning algorithms are trained and tested by splitting the dataset to 70% for training and 30% for testing. The ensemble stacking techniques are used by implementing the all models at the cross validation of 10.The xgboost gives the accuracy of 96% for the balanced dataset.
Date of Conference: 23-25 January 2023
Date Added to IEEE Xplore: 24 May 2023
ISBN Information: