-
MIT
- Boston, MA
Stars
[NeurIPS'24 Spotlight] Observational Scaling Laws
A framework for collecting a large human-sourced chain-of-thoughts dataset
[T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators
GPU programming related news and material links
A specification that python filesystems should adhere to.
A natural language interface for computers
Probabilistic programming with large language models
Machine Learning Engineering Open Book
A Data Streaming Library for Efficient Neural Network Training
Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Matplotlib styles for scientific plotting
A framework for few-shot evaluation of language models.
Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.
Web interface for browsing, search and filtering recent arxiv submissions
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
An implementation of masked language modeling for Pytorch, made as concise and simple as possible
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Retrieve information from DBLP and update BibTex files automatically
🚪✊Knock Knock: Get notified when your training ends with only two additional lines of code
Code for ACL 2020 paper: "Extractive Summarization as Text Matching"
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch