-
Tsinghua University
- Beijing
-
14:05
(UTC -12:00)
Lists (12)
Sort Name ascending (A-Z)
📑 benchmarks etc.
🧩Categorization
datasets and implementations of algorithmsCourses
🏠 Homepage
🎰 Judgement & Decision Making
Datasets, open-source codes and bench marksLLM
🎶Musica!
📖 NLP
Starred repositories
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
freephdlabor: customizing personalized multiagent systems that researchs 24/7 on your own scientific problem
🤗 smolagents: a barebones library for agents that think in code.
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
Ready-to-use code and tutorial notebooks to boost your way into few-shot learning for image classification.
A beautiful, simple, clean, and responsive Jekyll theme for academics
RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
《动手学大模型Dive into LLMs》系列编程实践教程
[ICLR 2025] <MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses>
This repository is a mirror. If you want to raise an issue or contact us, we encourage you to do it on Gitlab (https://gitlab.com/agrumery/aGrUM).
Learning materials for UCB CS169 : software engineering
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Utility functions for handling MIDI data in a nice/intuitive way.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A simple, easy-to-hack GraphRAG implementation
An AI personal tutor built with Llama 3.1
SuperEasy 100% Local RAG with Ollama + Email RAG
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.
Train transformer language models with reinforcement learning.