Skip to content
View scarletcho's full-sized avatar
🍐
🍐
  • University of Texas at Austin
  • Austin, TX

Block or report scarletcho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 84,527 9,566 Updated Nov 25, 2025

For our EMNLP 2020 paper “Are ‘Undocumented Workers’ the Same as ‘Illegal Aliens’? Disentangling Denotation and Connotation in Vector Spaces”.

Python 12 Updated Dec 4, 2020

Intrinsic Evaluation of pre-trained word embeddings, using large Word Association Dataset: SWOW (Small World of Words)

Jupyter Notebook 11 Updated Feb 28, 2024

A Github repository containing the LWOW project.

Python 13 2 Updated Oct 3, 2025
Python 6 Updated Sep 8, 2021

[CoNLL'21] MirrorWiC: On Eliciting Word-in-Context Representationsfrom Pretrained Language Models

Python 12 5 Updated Oct 31, 2021

Sparse and discrete interpretability tool for neural networks

Python 64 5 Updated Feb 12, 2024

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 833 93 Updated Oct 13, 2025

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,535 130 Updated Feb 6, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

11,653 1,909 Updated Aug 31, 2023

Learning to Describe Unknown Phrases with Local and Global Contexts

Python 21 1 Updated Jun 21, 2022

Interpretable Word Sense Representations via Definition Generation

Python 9 2 Updated Mar 6, 2025

Simple, unified interface to multiple Generative AI providers

Python 12,821 1,307 Updated Nov 11, 2025

Code relating to evaluation of models of compositional sentence semantics.

Jupyter Notebook 3 2 Updated Oct 17, 2024

data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015

Python 43 11 Updated Nov 10, 2020

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Python 5,451 1,328 Updated Dec 22, 2020

ACL 2024 - Linguistically Conditioned Semantic Textual Similarity

5 Updated Sep 5, 2024

Data, codebook, and models to automatically detect storytelling.

Jupyter Notebook 26 1 Updated Apr 23, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,032 2,665 Updated Aug 12, 2024
15 Updated Jul 20, 2023

A set of media framing annotations, along with scripts for obtaining the corresponding news articles

Python 54 9 Updated Jun 11, 2019

[EMNLP 2023] C-STS: Conditional Semantic Textual Similarity

Python 73 7 Updated May 23, 2024

Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO

Python 54 3 Updated Sep 3, 2020

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,530 13,739 Updated Nov 22, 2025

Utilities intended for use with Llama models.

Python 7,352 1,276 Updated Oct 10, 2025

Agentic components of the Llama Stack APIs

4,281 638 Updated Aug 5, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,230 4,034 Updated Jul 17, 2024

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,007 225 Updated Oct 16, 2025

Resources & scripts for the paper "MTEB: Massive Text Embedding Benchmark"

Python 18 4 Updated Sep 22, 2024
Next