Skip to content
View sashajain's full-sized avatar

Block or report sashajain

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Flexible and powerful framework for managing multiple AI agents and handling complex conversations

Python 7,133 654 Updated Dec 8, 2025

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 17,092 1,305 Updated Sep 21, 2025

The open source coding agent.

TypeScript 39,625 3,348 Updated Dec 17, 2025

CC signals is a framework for a simple pact between those stewarding data, and those reusing it for AI development. CC signals provide a set of shared ground rules for an AI ecosystem that is mutua…

97 22 Updated Dec 4, 2025

A PyTorch native platform for training generative AI models

Python 4,853 642 Updated Dec 17, 2025
Jupyter Notebook 8 3 Updated Apr 29, 2025

Example projects and demos around data streaming , stream processing, change data capture, and more.

Java 11 1 Updated Aug 6, 2025

Self-contained worked examples of Apache Lucene features and functionality

Java 211 35 Updated Nov 26, 2025

This is a basic example about the setup and use of SQLMesh.

4 2 Updated Jan 10, 2025

A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.

126 16 Updated Feb 22, 2021

Data Engineering on Google Cloud Platform

Jupyter Notebook 379 205 Updated Jul 29, 2024

List of changes announced for AWS that may break existing code

1,550 51 Updated May 20, 2025

Cloud native secrets management for developers - never leave your command line for secrets.

Rust 3,157 196 Updated Jul 30, 2024

Import Letterboxd movie list (diary) into trakt.tv

Python 106 20 Updated May 12, 2024

Data Analysis Workflows & Reproducibility Learning Resources

114 8 Updated Mar 4, 2021

This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.

Python 1,202 156 Updated Sep 8, 2025

The official Python SDK for the Foundry API

Python 112 26 Updated Dec 16, 2025

JupyterLab desktop application, based on Electron.

TypeScript 4,172 460 Updated Dec 16, 2025

This repository is a production dbt pipeline example that model the profitability of an e-commerce business. Data is extracted and loaded to a BigQuery dwh by Airbyte. Data sources include Shopify,…

28 1 Updated Jun 14, 2024

Source for Google Click to Deploy solutions listed on Google Cloud Marketplace.

Python 766 462 Updated Sep 30, 2025

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,266 3,944 Updated Dec 17, 2025

Apache DataFusion Python Bindings

Python 538 134 Updated Dec 14, 2025

Boring Data Tool

Rust 238 22 Updated Mar 21, 2024

Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

Java 1,136 196 Updated Dec 17, 2025

Code for "Efficient Data Processing in Spark" Course

Python 350 71 Updated Oct 16, 2025

Devon: An open-source pair programmer

Python 3,463 284 Updated May 26, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 43,570 16,113 Updated Dec 17, 2025

Scalable and efficient data transformation framework - backwards compatible with dbt.

Python 2,815 320 Updated Dec 15, 2025

Python SQL Parser and Transpiler

Python 8,710 1,032 Updated Dec 17, 2025
Next