Recursive Language Models (minimal version)

I received a lot of requests to put out a notebook or gist version of the codebase I've been using. Sadly it's a bit entangled with a bunch of random state, cost, and code execution tracking logic that I want to clean up while I run other experiments. In the meantime, I've re-written a simpler version of what I'm using so people can get started building on top and writing their own RLM implementations. Happy hacking!

I've provided a basic, minimal implementation of a recursive language model (RLM) with a REPL environment for OpenAI clients. Like the blogpost, we only implement recursive sub-calls with depth=1 inside the RLM environment. Enabling further depths is as simple as replacing the Sub_RLM class with the RLM_REPL class, but you may need to finagle the exec-based REPL environments to work better here (because now your sub-RLMs have their own REPL environments!).

In this stripped implementation, we exclude a lot of the logging, cost tracking, prompting, and REPL execution details of the experiments run in the blogpost. It's relatively easy to modify and build on top of this code to reproduce those results, but it's currently harder to go from my full codebase to supporting any new functionality.

Basic Example

We have all the basic dependencies in requirements.txt, although none are really necessary if you change your implementation (openai for LM API calls, dotenv for .env loading, and rich for logging).

In main.py, we have a basic needle-in-the-haystack (NIAH) example that embeds a random number inside ~1M lines of random words, and asks the model to go find it. It's a silly Hello World type example to emphasize that RLM.completion() calls are meant to replace LM.completion() calls.

Code Structure

In the rlm/ folder, the two main files are rlm_repl.py and repl.py.

rlm_repl.py offers a basic implementation of an RLM using a REPL environment in the RLM_REPL class. The completion() function gets called when we query an RLM.
repl.py is a simple exec-based implementation of a REPL environment that adds an LM sub-call function. To make the system truly recursive beyond depth=1, you can replace the Sub_RLM class with RLM_REPL (they all inherit from the RLM base class).

The functionality for parsing and handling base LLM clients are all in rlm/utils/. We also add example prompts here.

The rlm/logger/ folder mainly contains optional logging utilities used by the RLM REPL implementation. If you want to enable colorful or enhanced logging outputs, you may need to install the rich library as a dependency.

pip install rich

When you run your code, you'll see something like this:

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
media		media
rlm		rlm
.env-example		.env-example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Recursive Language Models (minimal version)

Basic Example

Code Structure

About

Uh oh!

Releases

Packages

Languages

License

alexzhang13/rlm

Folders and files

Latest commit

History

Repository files navigation

Recursive Language Models (minimal version)

Basic Example

Code Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages