- Belgrade, Serbia
- https://www.datakolektiv.com/
- @GSMilovanovic
- in/gmilovanovic
Stars
ETL, Analytics, Versioning for Unstructured Data
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activelo…
Official repo for the #tidytuesday project
Schema.org - schemas and supporting software
OpenRefine is a free, open source power tool for working with messy data and improving it
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)
Apache Superset is a Data Visualization and Data Exploration Platform