Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow different approach of Supervised Machine Learning.
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
This Master´s Degree Thesis focuses on the application of Machine and Deep Learning algorithms to analyse and extract the value of texts and the search for hidden patterns from the European University research project called "Applications based on IoT and Big Data in the hospital environment".
NLP implementations like information-theoretic measures of distributional similarity, text preprocessing using shell commands, Naive Bayes text categorization model, Cocke-Younger-Kasami parsing.
Binary text categorization classifier which detects the presence of customer experience in a message. Trained with Twitter data. Tested with Twitter and Facebook data.