General Boilerplate Code

Datasets

Python

The following are all python notebooks containing boilerplate code for EDA, data cleaning, and various supervised classification models using scikit-learn or TensorFlow.

Text Data Examples (Most Updated)

Contains XGBoost, Random Forest, Logistic Regression, and KNN code.

Non-Text Data Examples (Most Updated)

Could be adapted to any non NN supervised classifier (just have to adjust hyper parameter tuning).

R – Coming Soon

The following are all R Markdown files containing boilerplate code for EDA, data cleaning, and various supervised classification models.