Skip to content
Change the repository type filter

All

    Repositories list

    • A benchmark framework for Tensorflow
      Python
      633100Updated Jul 18, 2024Jul 18, 2024
    • Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
      Jupyter Notebook
      2.6k000Updated Jun 20, 2024Jun 20, 2024
    • llama3

      Public
      The official Meta Llama 3 GitHub site
      Python
      3.5k000Updated Jun 19, 2024Jun 19, 2024
    • vllm-rocm

      Public
      vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k000Updated Jun 19, 2024Jun 19, 2024
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      11k000Updated Jun 19, 2024Jun 19, 2024
    • Stable Diffusion web UI
      Python
      29k000Updated Jun 11, 2024Jun 11, 2024
    • Stable Diffusion web UI
      Python
      29k000Updated Jun 6, 2024Jun 6, 2024
    • llama.cpp

      Public
      LLM inference in C/C++
      C++
      13k000Updated Jun 6, 2024Jun 6, 2024
    • inference

      Public
      Reference implementations of MLPerf™ inference benchmarks
      Python
      581000Updated May 7, 2024May 7, 2024
    • Jupyter Notebook
      23000Updated May 1, 2024May 1, 2024
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      26k000Updated Apr 16, 2024Apr 16, 2024
    • A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
      Python
      9.8k000Updated Apr 14, 2024Apr 14, 2024
    • llama

      Public
      Inference code for Llama models
      Python
      9.8k000Updated Apr 10, 2024Apr 10, 2024
    • Reference implementations of MLPerf™ training benchmarks
      Python
      584100Updated Apr 5, 2024Apr 5, 2024
    • expect shell for updating mi210 firmware 8 cards
      Shell
      0000Updated Mar 24, 2024Mar 24, 2024
    • ZLUDA

      Public
      CUDA on AMD GPUs
      Rust
      845000Updated Mar 17, 2024Mar 17, 2024
    • tensorflow based ai benchmark
      Python
      6000Updated Mar 15, 2024Mar 15, 2024
    • DirectML

      Public
      DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
      Python
      326000Updated Mar 15, 2024Mar 15, 2024
    • 我的超迷你机械臂机器人项目。
      C
      3k000Updated Mar 14, 2024Mar 14, 2024
    • 49000Updated Mar 7, 2024Mar 7, 2024
    • change script for AMD gpus
      Shell
      35000Updated Feb 20, 2024Feb 20, 2024
    • Jupyter Notebook
      1000Updated Jul 31, 2022Jul 31, 2022
    • Efficient binary-decimal and decimal-binary conversion routines for IEEE doubles.
      C++
      301000Updated May 26, 2019May 26, 2019
    • Firmware

      Public
      PX4 Autopilot Software
      C++
      15k100Updated Aug 29, 2018Aug 29, 2018
    • ardupilot

      Public
      APM Plane, APM Copter, APM Rover source
      C++
      20k000Updated Aug 21, 2015Aug 21, 2015