Skip to content
View elmer-garduno's full-sized avatar

Block or report elmer-garduno

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Public collaboration of Scalable Single Cell Analytics

Python 12 5 Updated Nov 21, 2017

a futureproof crossword corpus toolset

Python 251 29 Updated May 19, 2025

Examples of using CloudML with genomic data.

Python 18 11 Updated May 24, 2019

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.

Java 1,034 380 Updated Oct 6, 2025

Example spark integrations.

Scala 1 Updated Oct 6, 2015

Apache Spark jobs such as Principal Coordinate Analysis.

Scala 75 38 Updated Jan 30, 2017

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means

Java 2,103 232 Updated Feb 17, 2025

Caffe: a fast open framework for deep learning.

C++ 34,720 18,597 Updated Jul 31, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 42,266 28,926 Updated Nov 10, 2025

network-based vaccination game

JavaScript 84 41 Updated Nov 18, 2021

A simple demonstration of sub-sequence sampling as used for anomaly detection with EKG signals

Java 103 30 Updated Oct 13, 2020

A visualization grammar.

JavaScript 11,682 1,551 Updated Nov 9, 2025

Stanford Network Analysis Platform (SNAP) is a general purpose network analysis and graph mining library.

C++ 2,247 805 Updated Dec 10, 2023

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

Scala 1,039 316 Updated Jul 12, 2025

scikit-learn: machine learning in Python

Python 63,982 26,423 Updated Nov 10, 2025

aka "Bayesian Methods for Hackers": An introduction to Bayesian methods + probabilistic programming with a computation/understanding-first, mathematics-second point of view. All in pure Python ;)

Jupyter Notebook 28,236 7,948 Updated Jun 25, 2024

Breeze is/was a numerical processing library for Scala.

Scala 3,456 694 Updated Oct 4, 2025

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.

Java 9,995 2,719 Updated Nov 9, 2025

OpenRefine is a free, open source power tool for working with messy data and improving it

Java 11,583 2,093 Updated Nov 6, 2025

Streaming MapReduce with Scalding and Storm

Scala 2,129 265 Updated Jan 19, 2022

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Scala 660 120 Updated Feb 6, 2014

A python script for summarizing articles using nltk

Python 546 121 Updated Jul 19, 2016

Twitter common libraries for python and the JVM (deprecated)

Java 2,090 563 Updated Dec 7, 2019

Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.

Java 1,134 385 Updated Apr 10, 2023

An accelerated framework for manipulating and interpreting high-throughput sequencing data

C++ 26 7 Updated Jul 9, 2013

Lightning-fast cluster computing in Java, Scala and Python.

Scala 1,427 383 Updated Apr 8, 2014

SSE Stream Aggregator

Java 832 252 Updated Apr 10, 2023

pushState + ajax = pjax

JavaScript 16,683 1,953 Updated Nov 30, 2022

Machine Learning / Natural Language Processing / Information Retrieval

Python 715 292 Updated Feb 5, 2021
Next