-
π Iβm currently working on Meituan, focusing on algorithms and engineering related to LLM inference acceleration, familar with model quantization.
-
π» I'm lucky to contribute to some open source projects: SGLang, vLLM, TorchAO, Megatron-DeepSpeed and LightSeq.
-
π I'm proud to build some projects from scratch:
- AutoSmoothQuant: An easy-to-use package for implementing SmoothQuant for LLMs.
- QQQ: QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.
-
π« Contact: [email protected]
-
π Google Scholar: https://scholar.google.com/citations?hl=zh-CN&user=MBR97ZIAAAAJ
-
Notifications
You must be signed in to change notification settings - Fork 0
HandH1998/HandH1998
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Β | Β | |||
Repository files navigation
About
No description, website, or topics provided.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published