Pinned Loading
- 
  pytorch_tzwpytorch_tzw PublicForked from pytorch/pytorch Tensors and Dynamic neural networks in Python with strong GPU acceleration Python 
- 
  Megatron-DeepSpeedMegatron-DeepSpeed PublicForked from deepspeedai/Megatron-DeepSpeed Ongoing research training transformer language models at scale, including: BERT & GPT-2 Python 
- 
  xDiTxDiT PublicForked from xdit-project/xDiT xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters Python 
- 
  NVIDIA/Megatron-LMNVIDIA/Megatron-LM PublicOngoing research training transformer models at scale 
          Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
  If the problem persists, check the GitHub status page or contact support.