kiminh

Ramsey kiminh

127 followers · 2.7k following

Starred repositories

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,423 165 Updated Mar 20, 2025

weiruichen01 / distilling-the-essence

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

Python 1 Updated Dec 26, 2025

stepfun-ai / StepDeepResearch

Step-DeepResearch

Python 250 7 Updated Dec 25, 2025

WenbinZhu / TritonMQ

A Real-Time Fault-tolerant In-Memory Distributed Message Queue

Java 9 2 Updated Jun 25, 2017

aielte-research / LlamBERT

LlamBERT implements a hybrid approach approach for text classification that leverages LLMs to annotate a small subset of large, unlabeled databases and uses the results for fine-tuning transformer …

Python 23 6 Updated Nov 2, 2024

NLPOptimize / flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

C++ 474 7 Updated Sep 19, 2025

VisualAIKHU / SRF

Official Repository for "See, Rank and Filter: Important Word-Aware Clip Filtering via Scene Understanding for Moment Retrieval and Highlight Detection" (AAAI 2026 Oral)

1 Updated Dec 19, 2025

MarkusSagen / Estimating-Uncertainty-in-Deep-Learning---project-2019

Python 1 1 Updated Nov 21, 2022

GauravBh1010tt / RewardRank

Counter-factual reward ranking

Python 5 Updated Oct 22, 2025

MobileTeleSystems / Ambrosia

Ambrosia is a Python library for A/B tests design, split and result measurement

Python 239 19 Updated Oct 24, 2023

VikhrModels / effective_llm_alignment

Effective LLM Alignment Toolkit

Python 151 10 Updated Jun 25, 2025

MobileTeleSystems / logs

A lightweight, high-performance microservice for forwarding browser-side logs to server-side log aggregation systems (ELK, Loki, Splunk, etc.).

TypeScript 7 1 Updated Dec 23, 2025

Zhuofeng-Li / Qwen-Agent

Forked from QwenLM/Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 1 Updated Dec 23, 2025

otmhi / fopo

Source code for the paper "Fast Offline Policy Optimization for Large Scale Recommendation" published at the Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI-23).

Python 3 Updated Jun 27, 2023

otmhi / Reward-Optimizing-Reco

Materials for the "Reward Optimising Recommendation using Deep Learning and Fast Maximum Inner Product Search" tutorial delivered at the 28th SIGKDD Conference on Knowledge Discovery and Data Minin…

Jupyter Notebook 6 2 Updated Sep 19, 2022