Skip to content
View santapo's full-sized avatar
🤖
Representation is all you need!
🤖
Representation is all you need!

Highlights

  • Pro

Block or report santapo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

EHR datasets preprocessing scripts

Jupyter Notebook 10 3 Updated Jan 31, 2024

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 16,913 1,191 Updated Nov 15, 2025

Tool for generating high quality Synthetic datasets

Python 1,389 195 Updated Oct 28, 2025

Building blocks for foundation models.

573 28 Updated Jan 3, 2024

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 708 97 Updated Nov 18, 2025

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,742 117 Updated Nov 12, 2025

Robust machine learning for responsible AI

Python 504 59 Updated Jul 12, 2024

VietConizer: Vietnamese OCR with NVIDIA DALI

Python 15 1 Updated Jul 5, 2025

Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

TypeScript 9,120 783 Updated Nov 18, 2025

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…

TypeScript 20,238 923 Updated Nov 18, 2025

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,048 90 Updated Nov 8, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,672 76 Updated Apr 18, 2025

A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

Jupyter Notebook 871 110 Updated Jul 13, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,403 1,523 Updated Apr 24, 2025

A curated list of papers related to constrained decoding of LLM, along with their relevant code and resources.

289 12 Updated Oct 15, 2025

Fluent student-teacher redteaming

Jupyter Notebook 23 4 Updated Jul 25, 2024

Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nom…

Jupyter Notebook 133 25 Updated Jun 11, 2024

日本語OCR

Python 243 31 Updated Aug 7, 2021

A curated list of MLSecOps tools, articles and other resources on security applied to Machine Learning and MLOps systems.

399 63 Updated Aug 1, 2025

Protection against Model Serialization Attacks

Python 602 124 Updated Oct 20, 2025

Approximating neural network loss landscapes in low-dimensional parameter subspaces for PyTorch

Python 347 55 Updated Nov 30, 2023

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 23,695 1,639 Updated Nov 18, 2025

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSy…

3,402 346 Updated Jul 25, 2025

Solution for Zindi's TechCabal Ewè Audio Translation Challenge

Python 5 2 Updated Oct 1, 2024

Open source Python library for converting PDF to DOCX.

Python 3,178 460 Updated May 28, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,463 89 Updated Jun 5, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,356 1,688 Updated Jul 2, 2025

Machine Learning Engineering Open Book

Python 15,773 968 Updated Oct 27, 2025

AI wearables. Put it on, speak, transcribe, automatically

C 7,226 1,237 Updated Nov 18, 2025
Next