Skip to content
View BlBana's full-sized avatar
😀
Duang ~
😀
Duang ~

Block or report BlBana

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning

Python 1,276 76 Updated May 16, 2025
Go 141 10 Updated Dec 26, 2025

💫 Toolkit to help you get started with Spec-Driven Development

Python 58,936 5,147 Updated Dec 4, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,134 1,936 Updated Dec 29, 2025

Fully autonomous AI hacker to find actual exploits in your web apps. Shannon has achieved a 96.15% success rate on the hint-free, source-aware XBOW Benchmark.

JavaScript 3,218 446 Updated Dec 22, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 4,041 724 Updated Dec 18, 2025

The official repository of "A Comprehensive Survey on Reinforcement Learning-based Agentic Search: Foundations, Roles, Optimizations, Evaluations, and Applications".

109 3 Updated Dec 26, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 495 90 Updated Dec 30, 2025

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 13,616 1,435 Updated Dec 29, 2025

A Claude Skill to give your agent the ability to use a web browser

TypeScript 2,007 120 Updated Dec 23, 2025

The better playwright MCP: works as a browser extension. No context bloat. More capable.

TypeScript 760 31 Updated Jan 1, 2026

Build resilient language agents as graphs.

Python 22,780 4,003 Updated Dec 31, 2025

A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation

Python 47 1 Updated Dec 18, 2025

A security scanner for your LLM agentic workflows

Python 858 106 Updated Nov 27, 2025

腾讯ai渗透黑客松参赛作品(xjtuHunter)

Python 121 15 Updated Dec 4, 2025

The React Framework

JavaScript 136,867 30,167 Updated Jan 1, 2026

CVE-2025-55182 POC

JavaScript 789 208 Updated Dec 8, 2025

一个用于 AI 驱动的渗透测试竞赛的**模型上下文协议 (MCP)** 服务器。该工 具提供了一个完整的 API 接口,使 LLM 能够自主参与 CTF 挑战。

Go 69 6 Updated Dec 3, 2025

基于kimi-cli二次开发的针对CTF竞赛的专用Agent

Python 37 5 Updated Dec 3, 2025

A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

Jupyter Notebook 1,097 152 Updated Nov 30, 2025

Postgres Foreign Data Wrapper development framework in Rust.

Rust 788 86 Updated Dec 22, 2025

Collection of specialized AI subagents for Claude Code for personal use (full-stack development).

1,273 212 Updated Aug 15, 2025

NOVA: The Prompt Pattern Matching

Python 61 9 Updated Oct 22, 2025

Actions for running CodeQL analysis

TypeScript 1,450 430 Updated Dec 22, 2025

Kode is one unit agent for every human & computer task

TypeScript 3,900 604 Updated Dec 29, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 50,244 3,565 Updated Dec 20, 2025

Playwright MCP server

TypeScript 24,969 2,033 Updated Dec 31, 2025

A research prototype of a human-centered web agent

Python 9,532 964 Updated Dec 18, 2025

A lightweight, powerful framework for multi-agent workflows

Python 18,082 3,028 Updated Dec 31, 2025
Next