Skip to content
View Skunchala's full-sized avatar

Block or report Skunchala

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Transformer related optimization, including BERT, GPT

C++ 1 Updated Jul 8, 2022

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

C++ 1 Updated Jun 24, 2022

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 1 Updated Jul 13, 2022

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Python 1 Updated Jul 13, 2022

Tensors and Dynamic neural networks in Python with strong GPU acceleration

C++ 1 Updated Jul 13, 2022

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,870 4,761 Updated Mar 22, 2026