Skip to content

🔍Explore steel plate defect prediction with EDA, modelling, and multi-class classification 🛠

License

Notifications You must be signed in to change notification settings

hardikjp7/SteelPlate-Multiclass-EDA-Modeling

Repository files navigation

Playground Series - Season 4, Episode 3: EDA/Modelling for Multi-Class Prediction of Steel Plate Defects

This repository contains a Jupyter Notebook detailing the exploratory data analysis (EDA) and modeling process for multi-class prediction of steel plate defects. The notebook is part of the Playground Series - Season 4, Episode 3.

Overview

The notebook is structured into six main parts:

  1. Data loading and first exploration
  2. Target analysis
  3. EDA and data preparation
  4. Modeling
  5. Explainability
  6. Preparation of the submission

Part 1: Data loading and first exploration

The data is loaded and basic exploration is performed to understand the dataset's structure and features.

Part 2: Target analysis

An analysis of the target variables is conducted to understand their distribution and characteristics.

Part 3: EDA and data preparation

Exploratory data analysis (EDA) techniques are applied to understand the relationships between features and prepare the data for modeling.

Part 4: Modeling

Modeling is performed using XGBoost with a focus on optimizing hyperparameters and evaluating model performance.

Part 5: Explainability

The model's explainability is explored using SHAP values to understand feature importance and model predictions.

Part 6: Preparation of the submission

The final model predictions are prepared for submission, including ensembling strategies to improve performance.

For the full details and code implementation, please refer to the notebook in this repository.

About

🔍Explore steel plate defect prediction with EDA, modelling, and multi-class classification 🛠

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published