PurposeParkinson's disease (PD) is one of the most prevalent neurodegenerative diseases in the global context. The presently available detecting process of PD is costly and labour-intensive. Along with a movement disorder, PD also affects speech differently. By causing variation in pitch, monotonicity, slurring of words, or slow speed of talking. This study uses different machine learning binary classification algorithms for the detection and classification of PD.MethodsThe publicly available Parkinson's disease speech features dataset is imbalanced, with only 192 healthy instances compared to 564 PD instances. Synthetic Minority Oversampling Technique - Edited Nearest Neighbours (SMOTE-ENN) algorithms rectify the class imbalance by oversampling and under sampling. Thus, it results in a balanced dataset free of noisy samples. Machine learning binary classifiers, Random Forest, K-Nearest Neighbours, Support Vector Machine, Extreme Gradient Boosting, Decision Tree, and Logistic Regression are investigated.ResultsThe classification algorithms have been analysed and compared based on several standard evaluation metrics. The classification model, resampling using SMOTE-ENN technique, and dimensionality reduction using principal component analysis (PCA) have been performed on the dataset to enhance the performance and prevent overfitting. The combination of SMOTE-ENN and Support Vector Machine (SVM) yields a good accuracy of 96.5%.ConclusionSpeech features are a predictive and non-intrusive method, thus making the diagnostic process painless and straightforward. The reported results are promising to aid the diagnosis of PD so that treatment can be administered as early as possible. Thus, the primary findings are beneficial to detect PD at an early stage with optimal accuracy.