Stars
Get Method Sampling from Java Flight Recorder Dump and convert to FlameGraph compatible format.
Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…
A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Stream summarizer and cardinality estimator.
LinDB is a scalable, high performance, high availability distributed time series database.
A high performance and generic framework for distributed DNN training
Deep Learning Pipelines for Apache Spark
An end-to-end machine learning and data mining framework on Hadoop
Classical RecSys algorithms implemented by using TensorFlow Estimators
A Flexible and Powerful Parameter Server for large-scale machine learning
A small utility to modify the dynamic linker and RPATH of ELF executables
junshiguo / shifu
Forked from ShifuML/shifuAn end-to-end machine learning and data mining framework on Hadoop
junshiguo / AMC
Forked from brett-pplx/AMCCode for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"
junshiguo / EnWikiIndexing
Forked from Raysmond/EnWikiIndexingAim to create distributed inverted indexes of English Wikipedia dump using Hadoop.
Liu Yang's implementation for Gibbs Sampling of LDA
junshiguo / xksystem
Forked from Raysmond/xksystemIt's my project of Object-Oriented Technology course 2013 in Fudan University.
Apache Spark - A unified analytics engine for large-scale data processing
Type-safe data migration tool for Slick, Git and beyond.
A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.
Notes talking about the design and implementation of Apache Spark