Skip to content
View brunovilar's full-sized avatar

Block or report brunovilar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

JavaScript 23,421 3,570 Updated Nov 9, 2025

Revisiting Pretrarining Objectives for Tabular Deep Learning

Python 65 11 Updated Aug 22, 2022

Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and mo…

Python 3,933 289 Updated Aug 27, 2025

🎓 Um caminho para a educação autodidata em Ciência da Computação!

18,695 1,485 Updated Oct 15, 2025

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,042 209 Updated Jan 9, 2024

Example repo to kickstart integration with mlflow pipelines.

Python 77 64 Updated Nov 14, 2022
TypeScript 835 75 Updated Oct 15, 2025

JupyterLite demo deployed to GitHub Pages 🚀

Jupyter Notebook 411 238 Updated Aug 8, 2025
Jupyter Notebook 367 98 Updated Aug 8, 2024

The data factory for next gen AI

Python 143 68 Updated Oct 28, 2025

The website for the CMU Language Technologies Institute low resource NLP bootcamp 2020

Jupyter Notebook 603 67 Updated Jun 4, 2020

Dict2vec is a framework to learn word embeddings using lexical dictionaries.

Python 115 30 Updated Jan 8, 2021

My PhD thesis with all its source files, including all .tex files and images created, as well as the slides of my defense.

TeX 5 1 Updated Nov 9, 2020

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

17,912 2,712 Updated Sep 13, 2025

Benchmarks of approximate nearest neighbor libraries in Python

Python 5,490 860 Updated Jun 10, 2025

Curated repository of notes from papers I'm reading, mostly NLP related. Updated regularly.

128 29 Updated Apr 19, 2021

Compute Sentence Embeddings Fast!

Jupyter Notebook 623 84 Updated Mar 2, 2023

sentence embedding by Smooth Inverse Frequency weighting scheme

Python 1,087 308 Updated Jul 23, 2019

A collection of modern/faster/saner alternatives to common unix commands.

32,559 816 Updated Sep 10, 2024

A pure python implementation of the Word Mover‘s Embedding Algorithm

Python 6 1 Updated Apr 24, 2021

WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clustering.

C 83 15 Updated Dec 5, 2018

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28,500 3,825 Updated Jul 18, 2024
Jupyter Notebook 58 11 Updated May 14, 2024

Best Practices on Recommendation Systems

Python 21,100 3,267 Updated Oct 13, 2025

Roadmap to becoming a data engineer in 2021

12,717 1,363 Updated Jan 25, 2022

Self-Supervised Euphemism Detection and Identification for Content Moderation, IEEE S&P (Oakland) 2021

Python 34 10 Updated Mar 26, 2025

PyTorch implementation for "Matching the Blanks: Distributional Similarity for Relation Learning" paper

Python 604 135 Updated Sep 24, 2023

This Universal Dependencies (UD) Portuguese treebank.

Common Lisp 52 13 Updated Sep 9, 2025

SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks

Java 31 5 Updated Mar 12, 2024
Python 100 19 Updated Feb 25, 2022
Next