-
Databricks
- San Francisco
-
05:01
(UTC -12:00) - in/megha-agarwal95
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
sglang Public
Forked from sgl-project/sglangSGLang is yet another fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJul 23, 2024 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedMar 27, 2024 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedFeb 23, 2024 -
composer Public
Forked from mosaicml/composerTrain neural networks up to 7x faster
Python Apache License 2.0 UpdatedAug 11, 2023