-
-
-
hugo-theme-arknights Public
Arknights theme for Hugo framework
-
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
-
mbridge Public
Forked from ISEEKYAN/mbridgeBridge Megatron-Core to Hugging Face/Reinforcement Learning
Python Other UpdatedAug 28, 2025 -
-
verl_megatron_practice Public
Forked from ISEEKYAN/verl_megatron_practice(best/better) practices of megatron on veRL and tuning guide
Shell Apache License 2.0 UpdatedJul 30, 2025 -
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedMay 13, 2025 -
GPUStressTest Public
Forked from NVIDIA/GPUStressTestGPU Stress Test is a tool to stress the compute engine of NVIDIA Tesla GPU’s by running a BLAS matrix multiply using different data types. It can be compiled and run on both Linux and Windows.
C++ Other UpdatedApr 15, 2025 -
BambooSimulator Public
simulator for bamboo and other distributed machine learning frameworks
-
Hetaceso Public
Forked from microsoft/SuperScalerAceso adapted to heterogeneous environments
-
-
-
bamboo Public
Forked from uclasystem/bambooBamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.
Python MIT License UpdatedMar 17, 2025 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…
Python Apache License 2.0 UpdatedFeb 19, 2025 -
-
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedNov 11, 2024 -
-
-
-
neo4j-python-pandas-py2neo-v3 Public
Forked from MazzaWill/neo4j-python-pandas-py2neo-v3利用pandas将excel中数据抽取,以三元组形式加载到neo4j数据库中构建相关知识图谱
Python UpdatedJun 4, 2024 -
kubeflow-manifests Public
Forked from kubeflow/manifestsA repository for Kustomize manifests
YAML Apache License 2.0 UpdatedMay 20, 2024 -
MIT-Distributed-Lab-2024 Public
Lab for MIT Distributed course (6.824 -> 6.5840)
-
-
-
mlvp Public
Multi-language-based chip Verification Methodology (MLVP)
-
-
-