jbarnes850

Follow

🎯

Focusing

Jarrod Barnes jbarnes850

🎯

Focusing

Follow

40 followers · 146 following

New York, New York

Achievements

Achievements

Lists (1)

Sort

✨ Inspiration

Stars

openai / openai-guardrails-python

OpenAI Guardrails - Python

Python 124 17 Updated Nov 8, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 36,129 4,210 Updated Nov 5, 2025

auth0-samples / auth0-assistant0

Assistant0: An AI Personal Assistant Secured with Auth0

TypeScript 57 56 Updated Nov 7, 2025

temporalio / sdk-python

Temporal Python SDK

Python 855 140 Updated Nov 7, 2025

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,800 564 Updated Jun 17, 2024

anthropics / claude-cookbooks

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 27,237 2,730 Updated Nov 7, 2025

anthropics / skills

Public repository for Skills

Python 15,935 1,379 Updated Oct 18, 2025

microsoft / SecRL

Benchmarking LLM agents on Cyber Threat Investigation.

Jupyter Notebook 98 13 Updated Nov 5, 2025

TinyCloudLabs / web-sdk

TypeScript 2 Updated Aug 21, 2025

DataDog / datadog-agent

Main repository for Datadog Agent

Go 3,387 1,351 Updated Nov 8, 2025

shlokkhemani / claude-memory-tools

Example implementations of Claude's Memory Tool API - Next.js web app and Python CLI for building applications with persistent memory

TypeScript 42 4 Updated Oct 14, 2025

aimagelab / mammoth

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

Python 749 134 Updated Aug 23, 2025

TencentCloudADP / youtu-agent

A simple yet powerful agent framework that delivers with open-source models

Python 3,773 369 Updated Nov 6, 2025

DataDog / orchestrion

Automatic compile-time instrumentation of Go code

Go 478 22 Updated Nov 6, 2025

SalesforceAIResearch / PretrainRL-pipeline

An automated data pipeline scaling RL to pretraining levels

Python 67 6 Updated Oct 11, 2025

arcprize / arc-agi-benchmarking

Testing baseline LLMs performance across various models

Python 317 49 Updated Nov 6, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 349 20 Updated Oct 30, 2025

ScalingIntelligence / KernelBench

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 651 81 Updated Nov 5, 2025

dstackai / dstack

dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or on-prem.

Python 1,952 201 Updated Nov 7, 2025

Arc-Computer / atlas-sdk

Atlas is infrastructure for continual learning in LLM agents.

Python 6 3 Updated Nov 7, 2025

itbench-hub / ITBench

An open source benchmarking framework for IT automation

144 22 Updated Oct 18, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 1,523 120 Updated Nov 7, 2025

microsoft / agent-framework

A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.

C# 4,823 681 Updated Nov 8, 2025

openai / frontier-evals

OpenAI Frontier Evals

Python 937 106 Updated Oct 31, 2025

THUDM / LongBench

LongBench v2 and LongBench (ACL 25'&24')

Python 1,009 107 Updated Jan 15, 2025

andyzorigin / cybench

HTML 165 64 Updated Jun 12, 2025

stanford-crfm / helm

Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…

Python 2,534 340 Updated Oct 21, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 17,178 2,829 Updated Nov 8, 2025

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

2,003 112 Updated Nov 5, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,498 107 Updated Nov 7, 2025