Skip to content
#

boosted-trees

Here are 21 public repositories matching this topic...

This repository implements the basic machine learning classifiers for the problem of Yelp reviews classification. We assume the problem to be a binary classification problem. The models implemented are Naive Bayes, Logistic Regression, Support Vector Machine (linear), Decision Trees, Bagged Decision Trees, Random Fforests, and Boosted Decision Trees.

  • Updated May 6, 2018
  • Python

This project aims at developing, validating, and testing several classification statistical models that could predict whether or not an office room is occupied using several data features, namely temperature (◦C), light (lx), humidity (%), CO2 (ppm), and a humidity ratio. The data is modeled using classification techniques i.e. Logistic regression, Classification tree, Bagging-Random forest, and Gradient boosted trees. These models were trained and then after evaluated against validation and test sets and using confusion matrices to obtain classification and misclassification rates. The logistic model was trained using glmnet R package, Tree package for classification tree model, randomForest for both Bagging and Random Forest Models, and gbm package for Gradient Boosted Model. The best accuracy was obtained from the Random Forest Model with a classification rate of 93.21% when it was evaluated against the test set. Light sensor is also the most significant variable in predicting whether the office room is occupied or not, this was observed in all the five models.

  • Updated Dec 21, 2020

Improve this page

Add a description, image, and links to the boosted-trees topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the boosted-trees topic, visit your repo's landing page and select "manage topics."

Learn more