Skip to content

AryanKKate/IMDb_analysis_for_bollywood

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

🎥 IMDb_analysis_for_bollywood 🎥

Dataset : https://www.kaggle.com/datasets/adrianmcmahon/imdb-india-movies

Description : Detailed analysis and Rating prediction for IMDb India movies (till 2021).

Difficulty : Intermediate

Tasks performed:

  1. Data Preprocessing :
    1. Feature selection /li>
    2. Handling Null values
    3. Managing outliers
  2. Feature Engineering :
    1. One hot encoding
    2. Value manupulation
    3. Creating new columns
    4. Log Transformation of votes to manage skewness
  3. Data Analysis and Visualisation
    1. Box plots
    2. Violin plots
    3. Density graphs
    4. Scatter plots
    5. Bar graphs
  4. Model Building
    1. Train_Test_Split
    2. Decision Trees
    3. Random Forest
    4. Gradient Boosting
    5. SVM
    6. Neural Networks
    7. XGBoost
    8. CatBoost
  5. Accuracy Metrics Evaluation

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published