- Berlin
- https://anapaulagomes.me/
Highlights
- Pro
Lists (23)
Sort Name ascending (A-Z)
AI
Anomaly detection
Civic Tech
Data
Datasets
Embeddings
Feature engineering
Graphs
Hugo
Interviews process
LLM
MacOS
Med
ML
NLP
PhD
Products
Remote
Scrapers
Sustaintability
Synthetic Data
Time series
Visualization
Stars
- All languages
- Astro
- C
- C#
- C++
- CSS
- Clojure
- DIGITAL Command Language
- Dart
- Dockerfile
- EJS
- Elixir
- Elm
- Go
- HTML
- Hack
- Haskell
- JSON
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- MDX
- Makefile
- Objective-C
- Objective-C++
- PHP
- PLpgSQL
- Perl
- Python
- R
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- SQL
- SVG
- Scala
- Shell
- Stata
- Stylus
- Swift
- TeX
- TypeScript
- Vala
- Vue
- Web Ontology Language
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
CRISP-T: Sense-making from Text and Numbers for Qualitative Research!
Easily generate and modify .docx files with JS/TS with a nice declarative API. Works for Node and on the Browser.
collections for advanced, novel multi-view clustering methods(papers , codes and datasets)
A little Python script to collect LaTeX sources for upload to the arXiv.
A Deep Learning Python Toolkit for Healthcare Applications.
Time series distances: Dynamic Time Warping (fast DTW implementation in C)
A Python implementation of similarity network fusion by Wang et al, 2014 (https://doi.org/10.1038/nmeth.2810)
ST-DBSCAN: Simple and effective tool for spatial-temporal clustering
A Python library for the Docker Engine API
Git Study Cards for devs that might need a refresher about git commands 🗂️
DHTI: a reference architecture for Gen AI in healthcare.
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Salvador Urban Network Transportation (SUNT): A Landmark Spatiotemporal Dataset for Public Transportation
State-of-the-Art Text Embeddings
🔒 End-to-end encrypted cloud for photos, videos and 2FA secrets.
Teaching statistics with games - lesson plans for teachers
The machine learning toolkit for time series analysis in Python
🦆 A curated list of awesome DuckDB resources
Workshop material for DuckDB workshop at Pydata DE
Richtlinien für Benutzeroberflächen, Code-Beispiele, Designtools und Ressourcen
Tools for curating biomedical training data for large-scale language modeling
✖️MEN - A Modular Toolkit for Cross-Lingual Medical Entity Normalization