Skip to content
View MiniZhuwei's full-sized avatar

Block or report MiniZhuwei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A portable embedded database using Arrow.

Rust 1,219 83 Updated Nov 11, 2025

Stack trace visualizer

Perl 19,009 2,066 Updated Oct 20, 2024

Get Method Sampling from Java Flight Recorder Dump and convert to FlameGraph compatible format.

Java 269 63 Updated Oct 25, 2023

Java Native Access

Java 8,845 1,693 Updated Nov 19, 2025

Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…

C 19,195 5,253 Updated Nov 29, 2025

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

Java 2,109 232 Updated Feb 17, 2025

Stream summarizer and cardinality estimator.

Java 2,264 559 Updated Nov 28, 2019

LinDB is a scalable, high performance, high availability distributed time series database.

Go 3,043 280 Updated Oct 30, 2025

Scalable NameNode RPC Proxy for HDFS Federation

Java 86 16 Updated Apr 19, 2016

A high performance and generic framework for distributed DNN training

Python 3,714 494 Updated Oct 3, 2023

Deep Learning Pipelines for Apache Spark

Python 1,995 493 Updated Mar 30, 2023

Go configuration with fangs

Go 29,625 2,089 Updated Nov 20, 2025

StarCraft II Learning Environment

Python 8,219 1,174 Updated Jul 23, 2024

An end-to-end machine learning and data mining framework on Hadoop

Java 256 111 Updated May 13, 2024

Classical RecSys algorithms implemented by using TensorFlow Estimators

Python 184 73 Updated Nov 1, 2018

A Flexible and Powerful Parameter Server for large-scale machine learning

Java 6,783 1,591 Updated Oct 13, 2025

A small utility to modify the dynamic linker and RPATH of ELF executables

C 4,086 514 Updated Nov 24, 2025

An end-to-end machine learning and data mining framework on Hadoop

Java 1 Updated Apr 1, 2021

Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"

Java 1 Updated Dec 22, 2014

Aim to create distributed inverted indexes of English Wikipedia dump using Hadoop.

Java 1 Updated Dec 28, 2013
PHP 1 Updated Dec 13, 2013

Liu Yang's implementation for Gibbs Sampling of LDA

Java 1 Updated Nov 26, 2013

Algorithm implementation for filling a polygon

3 Updated Oct 9, 2013

It's my project of Object-Oriented Technology course 2013 in Fudan University.

Java 1 Updated Jun 8, 2013

spark ml 算法原理剖析以及具体的源码实现分析

1,959 824 Updated Mar 25, 2019

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,404 28,954 Updated Nov 30, 2025

Type-safe data migration tool for Slick, Git and beyond.

Scala 189 32 Updated Aug 19, 2024

Azkaban workflow manager.

Java 1 Updated May 29, 2016

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,237 3,579 Updated Nov 25, 2025

Notes talking about the design and implementation of Apache Spark

5,347 1,839 Updated Apr 2, 2024
Next