BADS Bike Shop

Table of Contents

About the project

This project encompasses the development of sophisticated data architecture and processing pipelines for BADS Bike Shop, a fictional entity specializing in bicycle sales and rentals. We've leveraged a dataset, encompassing transactional data from Kaggle, customer data from Mockaroo, and simulated GPS and battery data for rental bikes. Our approach includes two key pipelines: a batch pipeline for analytical insights and a stream pipeline for real-time operational monitoring. We've implemented these using Google Cloud's BigQuery and Dataproc services, creating two dashboards - the BI & KYC Dashboard for customer demographics and the Operations Dashboard for real-time bike tracking.

Built With

Project Structure

├───batch
│   ├───cleaned
│   ├───data
│   └───integration
├───pipelines
└───stream
    ├───data
    ├───kafka
    ├───notebooks
    └───producer

Batch Processing

Spark Notebooks: A collection of Jupyter notebooks containing Spark programs for data processing. This includes:
Data: Contains datasets used for completeness.

CI/CD Pipelines

Contains a ci/cd pipeline that automates the conversion of Jupyter notebooks (.ipynb) into Python scripts (.py) and subsequently uploads them to a cloud repository.

Stream Processing

Data: Holds datasets that simulate streaming data for completeness and testing purposes.
Kafka: Contains a docker-compose file to set up a Kafka consumer environment.
Notebooks: A Spark program designed to process data incoming from the Kafka stream.
Producer: A Python program that simulates GPS stream data, effectively acting as a stream data producer from a laptop.

GCP Architecture

Slides for demo

Go to Google Slides.

Authors

Andy Huang
Huub van de Voort
Oumaima Lemhour
Roman Nekrasov
Tom Teurlings

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
batch		batch
ci_cd_pipelines		ci_cd_pipelines
stream		stream
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BADS Bike Shop

About the project

Built With

Project Structure

Batch Processing

CI/CD Pipelines

Stream Processing

GCP Architecture

Slides for demo

Authors

About

Uh oh!

Releases

Packages

Languages

hvdv99/bikeshop

Folders and files

Latest commit

History

Repository files navigation

BADS Bike Shop

About the project

Built With

Project Structure

Batch Processing

CI/CD Pipelines

Stream Processing

GCP Architecture

Slides for demo

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages