Skip to content
View ash80's full-sized avatar
  • London, United Kingdom

Block or report ash80

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. RLHF_in_notebooks RLHF_in_notebooks Public

    RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks

    Jupyter Notebook 209 18

  2. diffusion-gpt diffusion-gpt Public

    From babyGPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).

    Jupyter Notebook 220 18

  3. backtracking_gpt backtracking_gpt Public

    A GPT agent with a Text Interface tool

    Python 15 1