Member-only story
Mushroom dataset analysis and classification in python
Introduction
Mushroom classification is a beginner machine learning problem and the objective is to correctly classify if the mushroom is edible or poisonous by its specifications like cap shape, cap color, gill color, etc. using different classifiers.
In this project I have used the following classifiers to make the prediction:-
- Logistic Regression
- KNN Logistic Regression,
- SVM,
- Naive Bayes
- Decision Tree,
- Random Forest Classifier
If you want to see the original notebook in Kaggle please visit kaggle-milindsoorya.
Dataset
The dataset used in this project contains 8124 instances of mushrooms with 23 features like cap-shape, cap-surface, cap-color, bruises, odor, etc.
you can download the dataset from kaggle if you want to follow along locally — mushroom-dataset
The python libraries and packages we’ll use in this project are namely:
- NumPy
- Pandas
- Seaborn
- Matplotlib
- Graphviz