Dataset : https://www.kaggle.com/datasets/adrianmcmahon/imdb-india-movies
Description : Detailed analysis and Rating prediction for IMDb India movies (till 2021).
Difficulty : Intermediate
Tasks performed:
- Data Preprocessing :
- Feature selection /li>
- Handling Null values
- Managing outliers
- Feature Engineering :
- One hot encoding
- Value manupulation
- Creating new columns
- Log Transformation of votes to manage skewness
- Data Analysis and Visualisation
- Box plots
- Violin plots
- Density graphs
- Scatter plots
- Bar graphs
- Model Building
- Train_Test_Split
- Decision Trees
- Random Forest
- Gradient Boosting
- SVM
- Neural Networks
- XGBoost
- CatBoost
- Accuracy Metrics Evaluation