Simbotic Torch

Real-time computer vision pipeline for GStreamer; a PyTorch-powered GStreamer plugin developed with Rust. Includes custom 3D engine, accelerated ML models, accelerated capture and transform pipelines.

LibTorch plugin for GStreamer in Rust

Not only are the networks CUDA-enabled, but the pipeline has also been accelerated with CUDA tensors.

Source for:

Dependencies

SimboticTorch has been tested on Ubuntu 18/20.

Rust

Works with latest stable Rust

CUDA 11.4 + cuDNN 8.2.2

Make sure you have CUDA 11.4 installed in your system with cuDNN 8.2.2.

Download cuDNN v8.2.2 (July 6th, 2021), for CUDA 11.4:

cuDNN Runtime Library for Ubuntu20.04 (Deb)
cuDNN Developer Library for Ubuntu20.04 (Deb)

NOTE: On Ubuntu 20, there might be an issue with missing libnvrtc-builtins.so.11.1. A symlink solves it:

vertex@vx-pc:/usr/local/cuda/targets/x86_64-linux/lib$ ll libnvrtc-builtins* lrwxrwxrwx 1 root root 25 Jul 15 12:10 libnvrtc-builtins.so -> libnvrtc-builtins.so.11.4 lrwxrwxrwx 1 root root 25 Aug 12 00:01 libnvrtc-builtins.so.11.1 -> libnvrtc-builtins.so.11.4 lrwxrwxrwx 1 root root 29 Jul 15 12:10 libnvrtc-builtins.so.11.4 -> libnvrtc-builtins.so.11.4.100 -rw-r--r-- 1 root root 6883208 Jul 15 12:10 libnvrtc-builtins.so.11.4.100 sudo ln -s /usr/local/cuda/targets/x86_64-linux/lib/libnvrtc-builtins.so.11.4 /usr/local/cuda/targets/x86_64-linux/lib/libnvrtc-builtins.so.11.1

LibTorch 1.11.0

Depends on CUDA-enabled (works with CUDA 11.4) LibTorch:

Get libtorch from the
- PyTorch website download section
- Specifically cxxx11 ABI
Set env $LIBTORCH

GStreamer

Depends on GStreamer development libraries:

apt install libgstreamer1.0-dev libgstreamer-plugins-base1.0-dev \ gstreamer1.0-plugins-base gstreamer1.0-plugins-good \ gstreamer1.0-plugins-bad gstreamer1.0-plugins-ugly \ gstreamer1.0-libav libgstrtspserver-1.0-dev

Misc dependencies

SimboticTorch now includes a 3D rendering engine, and has the following dependencies:

apt install glslang-tools

Others:

apt install libssl-dev apt install libx11-dev apt install gnome-video-effects-frei0r

Build

Git LFS

This repo uses Git LFS for models and assets. Make sure git lfs command is properly installed.

Environment variable:

An environment variable needs to be set for all scripts and tools to be able to find this plugin.

export SIMBOTIC_TORCH=/full/path/to/this/repo

Building:

To build the rust gst plugin, just type:

./build.sh

Test Monodepth and Segmentation with any of the following:

./test_dashboard_preview.sh ./test_dashboard_webcam.sh ./test_dashboard_file.sh ./test_monodepth_preview.sh ./test_monodepth_webcam.sh ./test_semseg_preview.sh ./test_semseg_webcam.sh

Test Motion Transfer:

./test_motiontransfer_preview.sh ./test_motiontransfer_webcam.sh ./test_motiontransfer_file.sh

Citations

@article{monodepth2, title = {Digging into Self-Supervised Monocular Depth Prediction}, author = {Cl{\'{e}}ment Godard and Oisin {Mac Aodha} and Michael Firman and Gabriel J. Brostow}, booktitle = {The International Conference on Computer Vision (ICCV)}, month = {October}, year = {2019} }

@inproceedings{semantic_cvpr19, author = {Yi Zhu*, Karan Sapra*, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro}, title = {Improving Semantic Segmentation via Video Propagation and Label Relaxation}, booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2019}, url = {https://nv-adlr.github.io/publication/2018-Segmentation} } * indicates equal contribution @inproceedings{reda2018sdc, title={SDC-Net: Video prediction using spatially-displaced convolution}, author={Reda, Fitsum A and Liu, Guilin and Shih, Kevin J and Kirby, Robert and Barker, Jon and Tarjan, David and Tao, Andrew and Catanzaro, Bryan}, booktitle={Proceedings of the European Conference on Computer Vision (ECCV)}, pages={718--733}, year={2018} }

@InProceedings{Siarohin_2019_NeurIPS, author={Siarohin, Aliaksandr and Lathuilière, Stéphane and Tulyakov, Sergey and Ricci, Elisa and Sebe, Nicu}, title={First Order Motion Model for Image Animation}, booktitle = {Conference on Neural Information Processing Systems (NeurIPS)}, month = {December}, year = {2019} }

@misc{3ddfa_cleardusk, author = {Guo, Jianzhu and Zhu, Xiangyu and Lei, Zhen}, title = {3DDFA}, howpublished = {\url{https://github.com/cleardusk/3DDFA}}, year = {2018} } @article{zhu2017face, title= {Face alignment in full pose range: A 3d total solution}, author= {Zhu, Xiangyu and Liu, Xiaoming and Lei, Zhen and Li, Stan Z}, journal= {IEEE transactions on pattern analysis and machine intelligence}, year= {2017}, publisher= {IEEE} }

@InProceedings{Qin_2020_PR, title = {U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection}, author = {Qin, Xuebin and Zhang, Zichen and Huang, Chenyang and Dehghan, Masood and Zaiane, Osmar and Jagersand, Martin}, journal = {Pattern Recognition}, volume = {106}, pages = {107404}, year = {2020} }

tch-rs - Rust bindings for PyTorch
monodepth2 - Monocular depth estimation from a single image
semantic-segmentation - Improving Semantic Segmentation via Video Propagation and Label Relaxation
First Order Model for Image Animation - First Order Motion Model for Image Animation
3DDFA - Face Alignment in Full Pose Range: A 3D Total Solution
U^2-Net - Object detection based on a visual attention.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
assets		assets
gst-plugin-version-helper		gst-plugin-version-helper
models		models
src		src
tools		tools
.gitattributes		.gitattributes
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
build.rs		build.rs
build.sh		build.sh
test.sh		test.sh
test_dashboard_file.sh		test_dashboard_file.sh
test_dashboard_preview.sh		test_dashboard_preview.sh
test_dashboard_webcam.sh		test_dashboard_webcam.sh
test_facepose_preview.sh		test_facepose_preview.sh
test_facepose_webcam.sh		test_facepose_webcam.sh
test_monodepth_ndi.sh		test_monodepth_ndi.sh
test_monodepth_preview.sh		test_monodepth_preview.sh
test_monodepth_webcam.sh		test_monodepth_webcam.sh
test_motiontransfer_file.sh		test_motiontransfer_file.sh
test_motiontransfer_preview.sh		test_motiontransfer_preview.sh
test_motiontransfer_webcam.sh		test_motiontransfer_webcam.sh
test_obs.sh		test_obs.sh
test_salientobject_preview.sh		test_salientobject_preview.sh
test_semseg_preview.sh		test_semseg_preview.sh
test_semseg_webcam.sh		test_semseg_webcam.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simbotic Torch

LibTorch plugin for GStreamer in Rust

Dependencies

Rust

CUDA 11.4 + cuDNN 8.2.2

LibTorch 1.11.0

GStreamer

Misc dependencies

Build

Git LFS

Environment variable:

Building:

Test Monodepth and Segmentation with any of the following:

Test Motion Transfer:

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Simbotic Torch

LibTorch plugin for GStreamer in Rust

Dependencies

Rust

CUDA 11.4 + cuDNN 8.2.2

LibTorch 1.11.0

GStreamer

Misc dependencies

Build

Git LFS

Environment variable:

Building:

Test Monodepth and Segmentation with any of the following:

Test Motion Transfer:

Citations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages