MikalaiDrabovich (Mikalai Drabovich)

Pinned Loading

full_train_K2_thinking full_train_K2_thinking Public

Training of a thinking model

Python 1
quickreduce quickreduce Public

Forked from mk1-project/quickreduce

QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.

C++
TensorScope TensorScope Public

Easily benchmark training of any model by (op type+parameters) to spot real bottlenecks. Catches more details then native tfprof, especially for RNN/LSTM models

HTML 1
fast-cityscapes fast-cityscapes Public

Evaluate a deep neural network on CityScapes dataset

Python
DeepSpeed DeepSpeed Public

Forked from deepspeedai/DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

Python 1
Kimi-Linear Kimi-Linear Public

Forked from MoonshotAI/Kimi-Linear

Up to 6x decoding throughput improvement for context as long as 1M tokens, reduction of KV cache by up to 75%

1