Stars
4
stars
written in Scala
Clear filter
The leader in Customer Data Infrastructure
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
REST job server for Apache Spark
DBpedia Spotlight is a tool for automatically annotating mentions of DBpedia resources in text. Improving Efficiency and Accuracy in Multilingual Entity Extraction approach