Starred repositories
An open source Minecraft reimplementation written from scratch. Mirror of https://gitlab.bixilon.de/bixilon/minosoft
Open source framework for processing, monitoring, and alerting on time series data
Lightweight alternative to github.com/prometheus/client_golang
"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
SafeLine is a self-hosted WAF(Web Application Firewall) / reverse proxy to protect your web apps from attacks and exploits.
Sophos-ReversingLabs 20 million sample dataset
A Minecraft Launcher which is multi-functional, cross-platform and popular
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
A lightweight, powerful framework for multi-agent workflows
DeepEvolve is a research and coding agent for new algorithm discovery in different science domains with Deep Research and AlphaEvolve.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Multilingual Document Layout Parsing in a Single Vision-Language Model
Updated lists of IP addresses/whitelists of good bots and crawlers. Includes GoogleBot, BingBot, DuckDuckBot, etc.
A simple to use Go (golang) package to generate or parse Twitter snowflake IDs
A distributed unique ID generator inspired by Twitter's Snowflake
High-performance, columnar, in-memory store with bitmap indexing in Go
PyTorch implementation of Deeplog: Anomaly detection and diagnosis from system logs through deep learning
A deep learning toolkit for log-based anomaly detection
A machine learning toolkit for log parsing [ICSE'19, DSN'16]
A large collection of system log datasets for AI-driven log analytics [ISSRE'23]
A robust streaming log template miner based on the Drain algorithm
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Model Context Protocol Servers
E-mails, subdomains and names Harvester - OSINT
Code for voicing silent speech from EMG. Official repository for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021. Also incl…
A topic-centric list of HQ open datasets.
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more!