A Python Web Scraping Project to scrape topic titles, descriptions, and URLs from GitHub's topics page and organize them into a Pandas DataFrame or .csv file
- Scrapes TOP 30 topics information from GitHub.
- Cleans and structures data for easy analysis.
- Outputs data as a Pandas DataFrame and .csv file
- Python
- Requests
- os
- BeautifulSoup
- Pandas
Clone the repo:
git clone https://github.com/mShubham18/DataScience-Projects-GitHub-Top30-Repositories-Data-Scrapping.git
cd DataScience-Projects-GitHub-Top30-Repositories-Data-Scrapping
- Data folder contains all the output csv's
Notebook.ipynb
is the jupyter notebook file containing the entire project source code.