Stars
This is a repo with links to everything you'd ever want to learn about data engineering
ClickHouse® is a real-time analytics database management system
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Terraform provider for managing Snowflake accounts
PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2
The swiss army knife of lossless video/audio editing
Continuous Unix commit history from 1970 until today
A CLI tool that automatically writes commit messages for you.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Fully automated homelab from empty disk to running services with a single command.
GINO Is Not ORM - a Python asyncio ORM on SQLAlchemy core.
A python documentation linter which checks that the docstring description matches the definition.
Python assignments for the machine learning class by andrew ng on coursera with complete submission for grading capability and re-written instructions.
Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable,…
xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
Python regular expressions made easy
Algorithms for Decision Making textbook
🎓 Path to a free self-taught education in Computer Science!
Track GitHub trending repositories in your favorite programming language by native GitHub notifications!
Python best practices guidebook, written for humans.
Curated coding interview preparation materials for busy software engineers
📚 Freely available programming books