Sourceformer

Abstract

LLMs have been shown to use tools well. By allowing specific tools to increase the capabilities that LLMs struggle with, these models can become much more useful. Previous works use handcrafted examples of simple tool use during self-training. This type of data generation and training is a great compliment to the self-supervised nature of LLMs because most of the generation effort is placed on the LLM. But as the tools become more complex it will become harder to handwrite thorough examples that allow this generation. Today we introduce Sourceformer, which attempts to use a tool in the form of raw source code for self-training and benchmarking during evaluation. We propose a potentially viable method that allows tools to easily grow in complexity and size as the input token sequence to our LLMs inevitably grows. We focus on one tool in particular, a calculator, as a proof of concept for this idea; although, our results are sub par. Across three math benchmarks SVAMP, MAWPS, and ASDiv our model accuracy increases slightly, for some versions, compared to our base model before finetuning.

Models and Dataset

My models and dataset are avalibale through huggingface.

Models

Dataset

sourceformer-dataset

Running the code

The codebase for the Sourceformer model is based on two open source repositories. My requirements.txt are the packages for both repos and Deepspeed.

/model

conceptofmind/toolformer is an open-source implementation of Toolformer: Language Models Can Teach Themselves to Use Tools by Meta AI.

I have copied their README.md as it stood when I used it and placed it in /model.md

/benchmarks

arkilpatel/SVAMP is the SVAMP, ASDiv, and MAWPS math benchmarks. All three are cleaned up and compiled in the SVAMP paper's repository.

I have copied their README.md as it stood when I used it and placed it in /benchmarks.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sourceformer

Abstract

Models and Dataset

Models

Dataset

Running the code

/model

/benchmarks

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
benchmarks		benchmarks
model		model
.DS_Store		.DS_Store
README.md		README.md
Table1.png		Table1.png
benchmarks.md		benchmarks.md
model.md		model.md
requirements.txt		requirements.txt
toolformer.png		toolformer.png

erichmond33/sourceformer

Folders and files

Latest commit

History

Repository files navigation

Sourceformer

Abstract

Models and Dataset

Models

Dataset

Running the code

/model

/benchmarks

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages