Skip to content
View sgxu's full-sized avatar

Block or report sgxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

BS::thread_pool: a fast, lightweight, modern, and easy-to-use C++17 / C++20 / C++23 thread pool library

C++ 2,700 294 Updated Dec 20, 2024

Data Structures & Algorithms implemented by c++

C++ 7 Updated Jul 25, 2023

Assembler for NVIDIA Volta and Turing GPUs

Python 231 40 Updated Jan 13, 2022

GVProf: A Value Profiler for GPU-based Clusters

Python 52 10 Updated Mar 24, 2024

解析iOS工程中的linkmap文件,方便分析各个模块占用的包大小

C++ 126 19 Updated May 28, 2021

row-major matmul optimization

C++ 682 94 Updated Aug 20, 2025

Dilated Convolution for Semantic Image Segmentation

Python 785 266 Updated Apr 1, 2018

Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks

MATLAB 2,864 989 Updated Oct 11, 2022

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,321 5,853 Updated Oct 21, 2025

Caffe: a fast open framework for deep learning.

C++ 4,798 1,670 Updated Apr 21, 2023

Ansible role to install nvidia-docker

6 2 Updated Mar 13, 2017