Efficient in-memory representation for ONNX, in Python
- Updated
Dec 1, 2025 - Python
Efficient in-memory representation for ONNX, in Python
Code of the ICASSP 2022 paper "Gradient Variance Loss for Structure Enhanced Super-Resolution"
TinyML & Edge AI: On-device inference, model quantization, embedded ML, ultra-low-power AI for microcontrollers and IoT devices.
本仓库包含了完整的深度学习应用开发流程,以经典的手写字符识别为例,基于LeNet网络构建。推理部分使用torch、onnxruntime以及openvino框架💖
Vision-lanugage model example code.
ptdeco is a library for model optimization by matrix decomposition built on top of PyTorch
DA2Lite is an automated model compression toolkit for PyTorch.
Minimal Reproducibility Study of (https://arxiv.org/abs/1911.05248). Experiments with Compression of Deep Neural Networks
compares different pretrained object classification with per-layer and per-channel quantization using pytorch
40x faster AI inference: ONNX to TensorRT optimization with FP16/INT8 quantization, multi-GPU support, and deployment
This project is built to detect spam messages using a Long Short-Term Memory (LSTM) model combined with Word2Vec as the word embedding technique. The model has been optimized using Grid Search, achieving a best accuracy of 95.65%.
Mobile AI: iOS CoreML, Android TFLite, on-device inference, ONNX, TensorRT, and ML deployment for smartphones.
Arbitrary Numbers
Computer vision project that classifies 101 food categories with 80.2% accuracy using fine-tuned EfficientNetB2 and PyTorch. Features interactive Gradio UI, optimized inference (~100ms/image), and strategic training on 20% of Food101 dataset for efficient resource utilization.
Quantization for Object Detection in Tensorflow 2.x
Công cụ giảm kích thước mô hình bằng Quantization, kết hợp AI Agent để tự động chọn mức tối ưu, giúp tăng tốc và tiết kiệm chi phí inference.
Heart disease classification using machine learning algorithms with hyperparameter tuning for optimized model performance. Algorithms include XGBoost, Random Forest, Logistic Regression, and moreto find the best model for accurate heart disease prediction.
YOLOv3-YOLO12 unified pipeline for edge deployment - Detection, segmentation, pose estimation with PyTorch to ONNX/TFLite/CoreML export
ai-zipper offers numerous AI model compression methods, also it is easy to embed into your own source code
Add a description, image, and links to the model-optimization topic page so that developers can more easily learn about it.
To associate your repository with the model-optimization topic, visit your repo's landing page and select "manage topics."