Skip to content
View daphnei's full-sized avatar

Block or report daphnei

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).

Python 161 15 Updated Jun 20, 2025

A collection of small corpuses of interesting data for the creation of bots and similar stuff.

JavaScript 5,041 1,299 Updated Oct 6, 2025

Crawl BookCorpus

Python 847 109 Updated Jul 14, 2023

Easily fine tune GPT-2 to fill in missing text

Python 201 45 Updated Dec 8, 2022

Front-end for ChatEval platform. Written using React, Next.js and ES6 JavaScript.

JavaScript 2 3 Updated May 2, 2025

Python Implementation of HLL-tailcut with 4-bit buckets and MLE estimator

Python 1 Updated May 17, 2019

Public evaluation tool for non task driven neural open domain chatbots

Python 5 Updated May 4, 2022

A fast, efficient universal vector embedding utility package.

Python 1,651 122 Updated Aug 3, 2023

This repo contains code to process a once live-streamed video and annotate it with Tweets.

Jupyter Notebook 1 Updated May 22, 2018

Deep network that performs spectral clustering

Python 328 103 Updated Feb 23, 2023

Cluster paraphrases by word sense

Python 12 7 Updated Jan 3, 2019

LogLog space version of MinHash by combining ideas from HyperLogLog and b-bit MinHash

Python 57 8 Updated Feb 19, 2020

simple python3 module for scraping google images across many languages, based on input dictionaries

Python 9 5 Updated Nov 21, 2019
Python 1 Updated Sep 29, 2017