techandy42

☀️

Being cracked.

Andy Lee techandy42

☀️

Being cracked.

21 | Prev @ Glean, Carta | 4A CS @ U of Waterloo

15 followers · 15 following

San Francisco, CA, United States
in/andy-lee-b68302232

Achievements

Tinker Public

The world's first System Design Engineer AI Agent.

TypeScript 1 Apache License 2.0 Updated Jul 12, 2025
Codex_Playground Public

A playground repo to play around with OpenAI Codex.

Python 1 Updated Jun 29, 2025
BICS_Plus Public

Benchmark that tests LLMs to find semantic bugs in large Python code.

Python 2 Apache License 2.0 Updated Jun 11, 2025
BioServices_DS Public

Dataset of BioService code containing signatures and docstrings.

Jupyter Notebook 1 Updated Jun 4, 2025
bioservices Public
Forked from cokelaer/bioservices

Access to Biological Web Services from Python.

Python 1 Other Updated May 19, 2025
MLGit Public

MLGit: Index Codebase into Natural Language Descriptions; Works Just Like Git.

Python 1 Apache License 2.0 Updated May 18, 2025
MLGit_Test_Repo_N2 Public

A Test Repo for MLGit; Python; Relative Imports.

Python 1 Apache License 2.0 Updated May 18, 2025
MLGit_Test_Repo_N1 Public

A Test Repo for MLGit; Python; Absolute Imports.

Python 1 Apache License 2.0 Updated May 18, 2025
techandy42 Public

README.md for my GitHub page.

2 Updated May 16, 2025
bug_in_the_code_stack Public

A new benchmark for measuring LLM's capability to detect bugs in large codebase.

Jupyter Notebook 3 5 Apache License 2.0 Updated May 3, 2025
watai_hammingai_project Public

WAT.ai x Hamming.ai Joint Project for Building Code Debugging Benchmarks and Models.

Python 5 Updated May 3, 2025
debugger_llm Public

Open-source datasets & models for LLM Judges to find and describe bugs in LLM-generated code.

Jupyter Notebook 2 Updated Nov 9, 2024
Codegen_Challenge_Submission Public

A Python import visualization program.

Jupyter Notebook 1 Updated Sep 14, 2024
bug_in_the_code_stack_v2 Public

Can LLMs find bugs that compilers can't?: A benchmark for measuring LLMs' capabilities in debugging large source code.

Jupyter Notebook 1 Updated May 29, 2024
eccc-hail-forecasting-project Public

Open-source ECCC repository for notebooks and documentations for the Hail Forecasting project by Hokyung (Andy) Lee.

Jupyter Notebook 1 Updated Apr 26, 2024
eccc-webcam-project Public

Open-source ECCC repository for notebooks and documentations for the Webcam project by Hokyung (Andy) Lee.

Jupyter Notebook 1 Updated Apr 26, 2024
LVEval Public
Forked from infinigence/LVEval

Repository of LV-Eval Benchmark

Jupyter Notebook 1 MIT License Updated Apr 15, 2024
babilong Public
Forked from booydar/babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Jupyter Notebook 1 Updated Apr 15, 2024
awesome-llm-metrics Public

An open-source framework that makes evaluating LLMs & prompt engineering x10 easier!

Python 3 Updated Mar 20, 2024
RagTagTeam Public

Startup co-founder matching platform built using Cohere for the WAT.AI RAG Challenge hackathon.

Jupyter Notebook 2 Updated Jan 26, 2024
GreenTechGuardians Public

A Circular Economy business idea evaluator tool built using Gen-AI.

Jupyter Notebook 4 3 Updated Jan 18, 2024
racecar_gym Public
Forked from axelbr/racecar_gym

A gym environment for a miniature racecar using the pybullet physics engine.

Python 1 MIT License Updated Jan 3, 2024
CrafterGPT Public

Leveraging Language Model to Play Procedurally-Generated Survival Games.

reinforcement-learning video-game language-model

Jupyter Notebook 3 Updated Dec 28, 2023
OpenAI_Gym_Atari_Space_Invaders_RL Public

Space Invaders agent trained using DQN/A2C models on OpenAI Gym Atari Environment.

reinforcement-learning openai-gym dqn a2c

Jupyter Notebook 3 Updated Dec 28, 2023
LLM_Reward_Model Public

Developing a LLM response ranking reward model using HFRL except it's GPT-3.5 instead of human.

language-model reward-model hfrl

Jupyter Notebook 2 Updated Dec 28, 2023
crafter Public
Forked from danijar/crafter

Benchmarking the Spectrum of Agent Capabilities

Python 1 MIT License Updated Dec 16, 2023
ExchangeAgent Public

Training a stock exchange agent with Reinforcement Learning algorithms and Decision Transformer.

reinforcement-learning stock-exchange decision-transformer

Jupyter Notebook 3 2 Updated Dec 15, 2023
FinancialBERT Public

Stock price prediction model built using BERT and regression model trained on textual financial news data.

stock-price-prediction bert machine-learning-research

Jupyter Notebook 29 4 Updated Dec 9, 2023
rank_llm Public
Forked from castorini/rank_llm

Repository for prompt-decoding using LLMs (GPT3.5, GPT4, and Vicuna)

Python 1 Apache License 2.0 Updated Nov 11, 2023
torchgym Public

A PyTorch library that provides major RL algorithms and functionalities for training OpenAI Gym agents.

reinforcement-learning openai-gym pytorch

Python 1 Updated Nov 8, 2023

Andy Lee techandy42

Achievements

Achievements

Tinker Public

Uh oh!

Codex_Playground Public

Uh oh!

BICS_Plus Public

Uh oh!

BioServices_DS Public

Uh oh!

bioservices Public

Uh oh!

MLGit Public

Uh oh!

MLGit_Test_Repo_N2 Public

Uh oh!

MLGit_Test_Repo_N1 Public

Uh oh!

techandy42 Public

Uh oh!

bug_in_the_code_stack Public

Uh oh!

watai_hammingai_project Public

Uh oh!

debugger_llm Public

Uh oh!

Codegen_Challenge_Submission Public

Uh oh!

bug_in_the_code_stack_v2 Public

Uh oh!

eccc-hail-forecasting-project Public

Uh oh!

eccc-webcam-project Public

Uh oh!

LVEval Public

Uh oh!

babilong Public

Uh oh!

awesome-llm-metrics Public

Uh oh!

RagTagTeam Public

Uh oh!

GreenTechGuardians Public

Uh oh!

racecar_gym Public

Uh oh!

CrafterGPT Public

Uh oh!

OpenAI_Gym_Atari_Space_Invaders_RL Public

Uh oh!

LLM_Reward_Model Public

Uh oh!

crafter Public

Uh oh!

ExchangeAgent Public

Uh oh!

FinancialBERT Public

Uh oh!

rank_llm Public

Uh oh!

torchgym Public

Uh oh!