Skip to content
View jan-karsten-kuhnke's full-sized avatar

Block or report jan-karsten-kuhnke

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. DATASET_CREATION-Evolved_Self_Intruct-evol-dataset DATASET_CREATION-Evolved_Self_Intruct-evol-dataset Public

    Forked from theblackcat102/evol-dataset

    evol augment any dataset online

    Python

  2. QUANTIZATION-AWQ-AutoAWQ QUANTIZATION-AWQ-AutoAWQ Public

    Forked from casper-hansen/AutoAWQ

    AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.

    C++ 1

  3. QUANTIZATION-GGUF-llama.cpp QUANTIZATION-GGUF-llama.cpp Public

    Forked from ggml-org/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C 1

  4. QUANTIZATION-GPTQ-exllama QUANTIZATION-GPTQ-exllama Public

    Forked from turboderp/exllama

    A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

    Python

  5. QUANTIZATION-GPTQ-AutoGPTQ QUANTIZATION-GPTQ-AutoGPTQ Public

    Forked from AutoGPTQ/AutoGPTQ

    An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

    Python

  6. QUANTIZATION-LORA-QLORA-bitsandbytes QUANTIZATION-LORA-QLORA-bitsandbytes Public

    Forked from bitsandbytes-foundation/bitsandbytes

    8-bit CUDA functions for PyTorch

    Python