-
Northeastern University
Starred repositories
Convert PDF to markdown + JSON quickly with high accuracy
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
An Obsidian plugin for displaying markdown notes as mind maps using Markmap.
Production-ready platform for agentic workflow development.
DSPy: The framework for programming—not prompting—language models
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
A repo lists papers related to LLM based agent
Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"
PDF++: the most Obsidian-native PDF annotation & viewing tool ever. Comes with optional Vim keybindings.
A plugin for reading and annotating PDFs and EPUBs in obsidian.
The Web framework for perfectionists with deadlines.
Robust Speech Recognition via Large-Scale Weak Supervision
⚡ The one-liner node.js http-proxy middleware for connect, express, next.js and more
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Templates and example code for creating Streamlit Components
Reference implementations of several LangChain agents as Streamlit apps
Streamlit — A faster way to build and share data apps.
A collection of my book notes on various subjects, mainly computer science
https://huyenchip.com/ml-interviews-book/
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Code for the paper "Language Models are Unsupervised Multitask Learners"
LLM training code for Databricks foundation models
A repository for research on medium sized language models.