Lists (1)
Sort Name ascending (A-Z)
Stars
Ongoing research training transformer models at scale
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
A pipeline parallel training script for diffusion models.
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin9…
Fully Open Framework for Democratized Multimodal Training
PPOCRLabelv2 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.
Demonstration of running a native LLM on Android device.
使用TensorRT推理GroundingDINO,推理速度提升3倍以上!
Effortless data labeling with AI support from Segment Anything and other awesome models.
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
A simple yet powerful agent framework that delivers with open-source models
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models (CVPR 2024 Highlight)
A fast JSON parser/generator for C++ with both SAX/DOM style API
基于tensorrt的crnn,输入batch为动态,pytroch1.6,opset11导出的onnx,trt7.1.3.4可以直接用,trt6无法使用
GUI for marking bounded boxes of objects in images for training neural network Yolo v3 and v2
convert cifar-10 dataset from bin to png or jpg