Skip to content
View mayrop's full-sized avatar
🙈
Try your best, make it happen!
🙈
Try your best, make it happen!

Organizations

@justia @free-law-coalition

Block or report mayrop

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qforia Best For specific Industry's Respect to wordlift and IPULL RANK

Python 2 3 Updated Sep 25, 2025

This solution accelerator leverages Azure AI Foundry, Azure AI Content Understanding, Azure OpenAI Service, and Azure AI Search to enable organizations to derive insights from volumes of conversati…

Python 383 229 Updated Nov 21, 2025

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 2,527 229 Updated Nov 4, 2025

The best open-source python library to generate and process SAT's CFDI

Python 101 30 Updated Nov 2, 2025

PHP Common utilities for Mexican CFDI 3.2, 3.3 & 4.0

PHP 137 52 Updated Sep 26, 2025

Repository of AutoPatent.

JavaScript 161 23 Updated Nov 14, 2025

Code for my "Efficient Data Processing in SQL" book.

Python 60 19 Updated Aug 6, 2024

R Package of automated tools to retrieve, parse, clean, and analyze documents from the United States Supreme Court - including: oral argument transcripts, motions, applications, orders, and decision.

R 6 Updated Apr 11, 2025

MTEB: Massive Text Embedding Benchmark

Python 2,981 507 Updated Nov 23, 2025

This repository goes over how to handle massive variety in data engineering

Scala 307 68 Updated Jan 16, 2023

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 38,754 7,439 Updated Oct 29, 2025

data load tool (dlt) is an open source Python library that makes data loading easy 🛠️

Python 4,588 395 Updated Nov 23, 2025

OWASP Juice Shop: Probably the most modern and sophisticated insecure web application

TypeScript 12,019 15,299 Updated Nov 23, 2025

Google Tag Manager Variable Template for fetching nested properties from objects using dot notation

Smarty 6 Updated Sep 16, 2024

set of functions and operators for executing similarity queries

C 391 41 Updated May 29, 2025

"1 + 1 = 1 or Record Deduplication with Python" Jupyter Notebook

Jupyter Notebook 84 32 Updated Dec 8, 2022

Rapid fuzzy string matching in Python using various string metrics

Python 3,534 144 Updated Nov 24, 2025

Collect, aggregate, and visualize a data ecosystem's metadata

Java 2,061 379 Updated Nov 14, 2025

Data Pipeline Framework using the singer.io spec

Python 655 131 Updated Nov 22, 2025

🦾 Take control of your AI agents

Python 1,384 115 Updated Aug 22, 2025

Distributed query engine providing simple and reliable data processing for any modality and scale

Rust 4,827 349 Updated Nov 23, 2025

Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.

Python 2,264 187 Updated Nov 20, 2025

CAP database scripts.

HTML 192 44 Updated Sep 10, 2024

⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io

Python 2,236 248 Updated Nov 23, 2025

Source files used for an introduction to Twisted

Python 608 282 Updated Jun 20, 2020

Prefect tasks and subflows for interacting with shell commands.

Python 53 8 Updated Apr 26, 2024

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,344 2,849 Updated Nov 3, 2025

Event-driven networking engine written in Python.

Python 5,909 1,209 Updated Nov 19, 2025

dataform-ga4-sessions is a Dataform package to prepare session and event tables from Google Analytics 4 (GA4) BigQuery raw data

JavaScript 78 22 Updated Jun 1, 2025
Next