Skip to content
View kssteven418's full-sized avatar

Block or report kssteven418

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SqueezeAILab/LLMCompiler SqueezeAILab/LLMCompiler Public

    [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

    Python 1.8k 128

  2. SqueezeAILab/SqueezeLLM SqueezeAILab/SqueezeLLM Public

    [ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

    Python 715 49

  3. Squeezeformer Squeezeformer Public

    [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

    Python 264 19

  4. I-BERT I-BERT Public

    [ICML'21 Oral] I-BERT: Integer-only BERT Quantization

    Python 266 42

  5. LTP LTP Public

    [KDD'22] Learned Token Pruning for Transformers

    Python 98 19

  6. BigLittleDecoder BigLittleDecoder Public

    [NeurIPS'23] Speculative Decoding with Big Little Decoder

    Python 96 12