Lists (2)
Sort Name ascending (A-Z)
Starred repositories
OCR, layout analysis, reading order, table recognition in 90+ languages
Python tools for creating suitable dataset for OpenAI's im2latex task: https://openai.com/requests-for-research/#im2latex
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
AgentFlow: In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Extract-0: A Specialized Language Model for Document Information
Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple regression tasks.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.
Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini
Comfy, playful but productive theme for Obsidian. "Primary instantly puts you in a relaxed state that opens the door to creativity and exploration. Wonderfully executed down to the smallest details,"
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Use your Mac trackpad as a weighing scale
My personal Obsidian vault template. A bottom-up approach to note-taking and organizing things I am interested in.
🔥 Clone and recreate any website as a modern React app in seconds
Renderer for the harmony response format to be used with gpt-oss
Hierarchical Reasoning Model Official Release
AI-powered desktop automation — open source, MIT-licensed, cross-platform, accessibility-first. Works across all apps and browsers. Inspired by GitHub Actions & Playwright. 100x faster than generic…
MCP server that execute applescript giving you full control of your Mac
🦎 Yo'Chameleon: Your Personalized Chameleon (CVPR 2025)
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.