Skip to content
View ftian1's full-sized avatar
  • Intel
  • Shanghai
  • 09:02 (UTC +08:00)

Organizations

@tianocore

Block or report ftian1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An innovative library for efficient LLM inference via low-bit quantization

C++ 351 39 Updated Aug 30, 2024

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

Python 2,560 290 Updated Dec 29, 2025

Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version

Python 8,282 4,102 Updated Nov 7, 2019

A python script that automatise the training of a CNN, compress it through tensorflow (or ristretto) plugin, and compares the performance of the two networks

Python 27 8 Updated Dec 8, 2022

Ristretto: Caffe-based approximation of convolutional neural networks.

C++ 289 59 Updated Jul 10, 2019