Stars
BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities
Implementation of the models described in "Cancer detection in breast MRI screening via explainable artificial intelligence anomaly detection"
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
🏆1st place in the PANORAMA challenge (early detection of PDAC on contrast-enhanced CT)
End-to-end Generative Optimization for AI Agents
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
[MICCAI 2023] MedNeXt is a fully ConvNeXt architecture for 3D medical image segmentation.
[NeurIPS 2024] Touchstone - Benchmarking AI on 5,172 o.o.d. CT volumes and 9 anatomical structures
AI-powered financial analysis and investment recommendation system using multi-agent AI orchestration.
Fully automated end to end framework to extract data from complex charts and other figures in scientific literature.
An open-source project dedicated to build foundational large language model for natural science, mainly in physics, chemistry and material science.
[ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Training LLMs with QLoRA + FSDP
[npj Digital Medicine] The official repository for "Large-Vocabulary Segmentation for Medical Images with Text Prompts"
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
The largest pre-trained medical image segmentation model (1.4B parameters) based on the largest public dataset (>100k annotations), up until April 2023.
[ICCV 2023] CLIP-Driven Universal Model; Rank first in MSD Competition.
BankNote-Net: Open dataset and encoder model for assistive currency recognition