Skip to content

Latest commit

 

History

History
59 lines (35 loc) · 1.73 KB

README.md

File metadata and controls

59 lines (35 loc) · 1.73 KB

app-rating-predictor

SC1015 Data Science Project Group Members: Aaron, Ivan, Yifei

Project Plan

Marking Rubrics

  1. 10% for coming up with your own problem definition based on a dataset

  2. 10% for data preparation and cleaning to suit the problem of your choice

  3. 20% for exploratory data analysis/visualization to gather relevant insights

  4. 20% for the use of machine learning techniques to solve specific problem

  5. 20% for the presentation of data-driven insights and the recommendations

  6. 10% for the quality of your final team presentation and overall impressions

  7. 10% for learning something new and doing something beyond this course

Problem statement

  1. Machine learning main goal: predict rating of the app using features.
  2. Which genre of apps has the highest rating?
  3. Which country makes the best apps?
  4. Does content rating, price(free / paid), ad supported apps has impact on the rating?
  5. Does size of the app affect total installs? (some people don't like to install large apps)
  6. Which type of games is the most successful?
  7. Best developers and their top categories.
  8. Developers that made the most apps.
  9. FAANG, which company made the best apps?
  10. How to get "High" Rating on Play Store?

Data preparation and cleaning

Dataset link: google-playstore-apps

Scrapper folder ./google-play-scrapper

Exploration data analysis

Machine learning techniques

Problem 1: Predicting the rating of the app (Numerical)

Linear Regression

Problem 2: Prediciting the app price (free/paid)

Classification

Problem 3: Predicting size of the app?

Presentation of data driven insights

Streamlit link