Skip to content
View taomiao's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report taomiao

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. cuBERT cuBERT Public

    Forked from zhihu/cuBERT

    Fast implementation of BERT inference directly on NVIDIA (CUDA, CUBLAS) and Intel MKL

    C++

  2. mini-c mini-c Public

    Forked from Fedjmike/mini-c

    Dr Strangehack, or: how to write a self-hosting C compiler in 10 hours

    C

  3. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 98.6k 27.3k

  4. apache/tvm apache/tvm Public

    Open Machine Learning Compiler Framework

    Python 13.2k 3.8k

  5. triton-inference-server/server triton-inference-server/server Public

    The Triton Inference Server provides an optimized cloud and edge inferencing solution.

    Python 10.5k 1.7k

  6. ShannonAI/service-streamer ShannonAI/service-streamer Public

    Boosting your Web Services of Deep Learning Applications.

    Python 1.2k 187