- New York, New York
Lists (1)
Sort Name ascending (A-Z)
Stars
Assistant0: An AI Personal Assistant Secured with Auth0
Collection of reinforcement learning algorithms
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Benchmarking LLM agents on Cyber Threat Investigation.
Example implementations of Claude's Memory Tool API - Next.js web app and Python CLI for building applications with persistent memory
An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning
A simple yet powerful agent framework that delivers with open-source models
Automatic compile-time instrumentation of Go code
An automated data pipeline scaling RL to pretraining levels
Testing baseline LLMs performance across various models
KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)
dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.
Atlas is infrastructure for continual learning in LLM agents.
An open source benchmarking framework for IT automation
Post-training with Tinker
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
A lightweight, powerful framework for multi-agent workflows
A Survey of Reinforcement Learning for Large Reasoning Models
Optimize prompts, code, and more with AI-powered Reflective Text Evolution