Stars
MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.
[ACL 2025 Best Theme Paper] This is the official implementation for the paper: "Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models"
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
🔥 人人可用的开源 BI 工具,数据可视化神器。An open-source BI tool alternative to Tableau.
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance …
james-hadoop / metricflow
Forked from dbt-labs/metricflowMetricFlow allows you to define, build, and maintain metrics in code.
james-hadoop / dowhy
Forked from py-why/dowhyDoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and provi…
MetricFlow allows you to define, build, and maintain metrics in code.
An easy to use, self-service open BI reporting and BI dashboard platform.
spring boot 实践学习案例,是 spring boot 初学者及核心技术巩固的最佳实践。
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…