Skip to content
View hkrsnd's full-sized avatar
🌸
🌸

Highlights

  • Pro

Block or report hkrsnd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 289 32 Updated Aug 13, 2024

A lightweight, powerful framework for multi-agent workflows

Python 16,580 2,715 Updated Oct 17, 2025

Muon is an optimizer for hidden layers in neural networks

Python 1,882 89 Updated Jul 12, 2025

Benchmarking the Spectrum of Agent Capabilities

Python 479 82 Updated Jan 23, 2024

"AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"

Python 7,654 1,029 Updated Oct 16, 2025
Python 2 Updated Jun 12, 2023

A collection of model counting (#SAT) benchmarks.

Python 6 2 Updated Jan 16, 2020

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,619 567 Updated Jan 16, 2025

Fully open reproduction of DeepSeek-R1

Python 25,552 2,395 Updated Sep 8, 2025

PyTorch native post-training library

Python 5,539 678 Updated Oct 17, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,347 7,312 Updated Oct 17, 2025

✨✨Latest Advances on Neuro-Symbolic Learning in the era of Large Language Models

170 10 Updated Jun 19, 2025

SatLM: SATisfiability-Aided Language Models using Declarative Prompting (NeurIPS 2023)

Python 50 11 Updated Jul 18, 2024

A toolkit for SAT-based prototyping in Python

Python 437 80 Updated Oct 13, 2025

The glucose SAT solver

C++ 124 21 Updated Jun 11, 2025

A minimalistic and high-performance SAT solver

C++ 1,106 413 Updated Apr 28, 2024

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,089 2,898 Updated Sep 10, 2025

Prover9 is an automated theorem prover for first-order and equational logic, and Mace4 searches for finite models and counterexamples.

C 53 14 Updated Jan 26, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,069 3,848 Updated Oct 17, 2025

Implementing DeepSeek R1's GRPO algorithm from scratch

Python 1,616 73 Updated Apr 18, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,397 109 Updated Aug 5, 2025

[NeurIPS 2025] Thinkless: LLM Learns When to Think

Python 234 18 Updated Sep 26, 2025

LogicBench is a natural language question-answering dataset consisting of 25 different reasoning patterns spanning over propositional, first-order, and non-monotonic logics.

31 3 Updated May 2, 2024
Jupyter Notebook 5 1 Updated Oct 7, 2025

(ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.01935

Python 175 17 Updated Oct 17, 2025

Framework and Language for Neurosymbolic Programming.

Rust 407 22 Updated May 1, 2025

One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

Python 1,180 183 Updated Nov 28, 2024

An open collection of methodologies to help with successful training of large language models.

Python 536 44 Updated Feb 15, 2024
Next