#006 : Bookstore Scraping

The platform/tools used:

  • Data cleaning & manipulation : Pandas
  • Database System : SQLite
  • Container : Docker

Project : https://github.com/nairkivm/book-scrape


One day, you are hired to gather information about books that your uncle wants to sell at his new bookstore. As you delve into this task, you begin collecting data on book titles, prices, and other relevant details. However, you don’t want to engage in the tedious process of copying and pasting back and forth between the web and your spreadsheet, do you?

Don’t worry, I got your back! I built this project for you to automatically fetch all the data that you want and store the data in an SQLite database with just one click!

project-006-workflow
Workflow of this project.

I used requests and Beautiful Soup module to get the raw HTML data and parse it into meaningful data. To organize and format the data, I used Pandas module before inserting it in the database.

Here’s the repository of this project. Good luck with your uncle’s new bookstore!

project-006-data-sample
Data sample.