Skip to content
View soldni's full-sized avatar
🏳️‍🌈
vibing!
🏳️‍🌈
vibing!

Organizations

@Georgetown-IR-Lab @allenai

Block or report soldni

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 515 42 Updated Mar 13, 2025

utilities for batched llm calls with retries

Python 43 1 Updated Jan 2, 2026

RSS reader for macOS and iOS.

Swift 9,449 620 Updated Jan 5, 2026

Our library for RL environments + evals

Python 3,701 465 Updated Jan 5, 2026

Code for the paper "BPE stays on SCRIPT"

Python 16 3 Updated Jan 5, 2026

Official Rust Implementation of Model2Vec

Rust 146 13 Updated Sep 29, 2025

📚 Freely available programming books

Python 380,008 65,711 Updated Jan 5, 2026

[ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale

Python 264 18 Updated Jul 8, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

105,841 27,966 Updated Jan 4, 2026

Data mapping framework for rust stuff

Rust 42 4 Updated Dec 29, 2025

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

Python 160 23 Updated Jun 18, 2024

Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).

Python 11,652 685 Updated Jan 5, 2026

PyTorch building blocks for the OLMo ecosystem

Python 659 118 Updated Jan 5, 2026

GhoulBoii's Firefox Dots

CSS 7 Updated Jan 15, 2025

OLMost every training recipe you need to perform data interventions with the OLMo family of models.

Python 63 11 Updated Jan 5, 2026

Curated list of datasets and tools for post-training.

4,142 337 Updated Nov 10, 2025

Versatile typeface for code, from code.

JavaScript 21,510 645 Updated Jan 4, 2026

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 40,728 1,382 Updated Jan 5, 2026

😸 Soothing pastel theme for the high-spirited!

TypeScript 18,127 329 Updated Nov 25, 2025

A more intuitive version of du in rust

Rust 11,032 246 Updated Nov 24, 2025

A curated list of resources and examples of ASCII Art

150 10 Updated Apr 24, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,945 227 Updated Jun 19, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,589 1,306 Updated Jan 2, 2026

LLM.swift is a simple and readable library that allows you to interact with large language models locally with ease for macOS, iOS, watchOS, tvOS, and visionOS.

C++ 801 101 Updated Dec 6, 2025

Large Language Model (LLM) module for the Spezi Ecosystem

Swift 276 42 Updated Dec 6, 2025

BPE modification that implements removing of the intermediate tokens during tokenizer training.

Python 25 3 Updated Nov 25, 2024

A curated list of awesome model based RL resources (continually updated)

1,268 73 Updated Dec 20, 2025

Dockerized iCloud Client - make a local copy of your iCloud documents and photos, and keep it automatically up-to-date.

Python 1,721 65 Updated Jan 5, 2026

Tools for shrinking fastText models (in gensim format)

Jupyter Notebook 181 13 Updated May 3, 2024
Rust 5 2 Updated May 8, 2025
Next