In this project, we reduced an imbalanced dataset to a balanced dataset using Under-sampling approach by applying Consensus Clustering using 'Simple Majority Voting' consensus function and further saw the increase in the accuracy of disease prediction by running multiple classifiers with bagging and boosting technique.
This project explains why and how are the Bagged Models better than the Complete Model. Bagged Model parameters have tighter confidence interval and a lower bias.
Reducing imbalanced dataset (Undersampling) by Consensus Clustering (Simple Majority Voting function) and validating the changes using different classifier model with bagging and boosting techniques.
The objective of this project is to build a “Risk Analytics model” to understand the renewal potential and claim propensity of Existing Customers under Personal Auto Insurance Lines. The final models for claims and renewals were able to accurately identifies over 70% of claims & 85% renewals.