mlp visualization cleanup |
||
---|---|---|
.gitignore | ||
CNN.keras | ||
Project.ipynb | ||
README.md | ||
model_plot.png | ||
requirements.txt |
README.md
README
Project Overview
This Python notebook is a comprehensive machine learning project that performs data preprocessing, feature engineering, model training, and model evaluation. It uses various machine learning algorithms such as Random Forest, Support Vector Machine (SVM), Logistic Regression, Gradient Boosting, K-Nearest Neighbors (KNN), Naive Bayes, K-Means Clustering, Convolutional Neural Network (CNN), and Multi-Layer Perceptron (MLP) for classification tasks. The project also includes data visualization and model performance evaluation.
Getting Started
Prerequisites
- Python 3.7 or later
- pip (Python Package Installer)
Virtual Environment Setup
It's recommended to create a virtual environment to keep the dependencies required by this project separate from your other Python projects.
For Unix or MacOS, run:
python3 -m venv env
For Windows, run:
py -m venv env
This will create a new virtual environment in a folder named env
.
To activate the virtual environment, on Unix or MacOS, run:
source env/bin/activate
On Windows, run:
.\env\Scripts\activate
Dependencies Installation
After activating the virtual environment, you can install the necessary dependencies by running:
pip install -r requirements.txt
This command installs all the necessary libraries used in this project, including:
- pandas
- numpy
- matplotlib
- seaborn
- scikit-learn
- tensorflow
Usage
After setting up the environment and installing the dependencies, you can start the Jupyter notebook by running:
jupyter notebook
Then, navigate to the notebook file (.ipynb) in the Jupyter notebook web interface and open it.
Notebook Structure
The notebook is structured as follows:
-
Data Loading and Cleaning: The notebook starts by loading the data from a pickle file. If the file doesn't exist, it loads the data from a dataset directory, cleans it, and saves it to a pickle file.
-
Data Visualization: The notebook visualizes the data using various plots.
-
Model Training and Evaluation: The notebook trains various machine learning models on the data and evaluates their performance. It also saves the trained models to a file for future use.
-
Model Visualization: The notebook visualizes the performance of the trained models using various plots and metrics.