GitHub - zain-ul-din-zafar/medicine-crawler

A Webscrapper capable of scrapping all medicine data from dawaai.pk

Usage

You probably want to scrap the website for the medicine data you could use this data that I scrapped link is available here. if you want to scrap the latest data here are the instructions on how to do that.

clone this repo
cd into the cloned folder and run npx yarn which will install dependencies.
run the project by typing npx yarn dev command and it will start generating files inside the data folder. Don't forget to clear previous data before running it.

FAQ

1. How did you make 4000+ commits?

Instead of running this crawler locally, i decided to run it on the codespace which is free but the only limitation is that GitHub automatically closes them if they remain idle for 4 hours. To save the codespace state I wrote a handy dandy bash script that pushed my code to GitHub after crafting each page data.

Name		Name	Last commit message	Last commit date
Latest commit History 4,627 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
commit.bash		commit.bash
package-lock.json		package-lock.json
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Usage

FAQ

About

Releases

Packages

Contributors 2

Languages

zain-ul-din-zafar/medicine-crawler

Folders and files

Latest commit

History

Repository files navigation

Usage

FAQ

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages