taomiao (TaoMiao)

Pinned Loading

cuBERT cuBERT Public

Forked from zhihu/cuBERT

Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

C++
mini-c mini-c Public

Forked from Fedjmike/mini-c

Dr Strangehack, or: how to write a self-hosting C compiler in 10 hours

C
pytorch/pytorch pytorch/pytorch Public

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98.6k 27.3k
apache/tvm apache/tvm Public

Open Machine Learning Compiler Framework

Python 13.2k 3.8k
triton-inference-server/server triton-inference-server/server Public

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10.5k 1.7k
ShannonAI/service-streamer ShannonAI/service-streamer Public

Boosting your Web Services of Deep Learning Applications.

Python 1.2k 187