Skip to content
View cpenguf's full-sized avatar

Block or report cpenguf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 70 8 Updated Jul 30, 2025

[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.

2,488 434 Updated Jan 14, 2026

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 5,052 401 Updated Jan 16, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,894 8,006 Updated Jan 17, 2026

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 849 61 Updated Aug 5, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,417 3,046 Updated Jan 16, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,803 851 Updated Jan 8, 2026

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

HTML 23,868 5,151 Updated Jan 13, 2026

Curated list of data science interview questions and answers

5,378 1,223 Updated Sep 29, 2024

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,739 316 Updated Jan 16, 2026

A benchmark for few-shot evaluation of foundation models for electronic health records (EHRs)

Jupyter Notebook 209 28 Updated Jun 6, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,197 1,103 Updated Dec 26, 2025

Multimodal Question Answering in the Medical Domain: A summary of Existing Datasets and Systems

312 36 Updated Oct 17, 2023

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases

Jupyter Notebook 3,104 1,654 Updated Nov 10, 2025

Tools for curating biomedical training data for large-scale language modeling

Python 488 118 Updated Dec 9, 2024

Efficient Retrieval Augmentation and Generation Framework

Python 1,758 166 Updated Jan 12, 2026

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Python 868 106 Updated Oct 25, 2024

MedType: Improving Medical Entity Linking with Semantic Type Prediction

Python 114 11 Updated Feb 10, 2023

[Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Python 210 25 Updated Jan 7, 2025

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 2,134 205 Updated Apr 10, 2024

MedAlign is a clinician-generated dataset for instruction following with electronic medical records.

97 9 Updated May 17, 2025

A quick guide (especially) for trending instruction finetuning datasets

3,340 228 Updated Nov 28, 2023

This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as Flan-T5.

Python 357 38 Updated Jul 4, 2023

Approaching Clinical NER as a MRC problem

Python 11 1 Updated Apr 4, 2024
Python 25 6 Updated Aug 2, 2024

Papers and online resources related to machine learning fairness

75 6 Updated May 11, 2023

a library for named entity recognition developed by UF HOBI NLP lab featuring SOTA algorithms

Python 154 29 Updated Sep 13, 2023

A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch

Python 235 46 Updated Jun 12, 2023

BertViz: Visualize Attention in Transformer Models

Python 7,876 859 Updated Jan 8, 2026
Next