Name	Name	Last commit message	Last commit date
Latest commit History 21 Commits
data_collected_samples	data_collected_samples
images	images
trained_model	trained_model
.DS_Store	.DS_Store
README.md	README.md

Robotic Inference

abstract

working on a supplied data from a camera fixed above a conveyor belt and a collected data using a mobile camera, first using the supplied data with Nvidia DIGITS workflow to train a model by tuning the hyperparameters and choosing the best network to achieve a specific inference time and accuracy, second using the collected data for an inference idea and training a model using Nvidia DIGITS to deploy this model on an embedded system like Jetson TX2 board.

Introduction

the robotic kitchen is now an interesting part of robotics which full of challenges, due to the routine of life nowadays we are very busy because of the tight schedules, so it's difficult to take care of our food, people tend to approach fast-food instead of preparing healthy meals at home which leads to severe diseases.

nowadays there are some cooking robots which enable us to cook the food.

they are fully automated robots, so they have to capture the world around them using several kinds of sensors and cameras. in turn classifying images around the robot and determining each kitchenware is a core element of the robot perception process, using Nvidia DIGITS workflow with a collected photos of spoons or forks or even no thing to train a network for the classification processes.

Background / Formulation

several types of DNN’s have been developed on the ImageNet benchmark dataset like AlexNet, VGGNet, ResNet, Inception, GoogleNet and their many variations.

The increased accuracy is the result of breakthroughs in design and optimization, but comes at a cost when computation resources are considered.

The following table provides a sampling of the results (values are approximated from graphs in the paper), including a derived metric called information density. The information density is a measure of the efficiency of the network, or how much accuracy is provided for every one million parameters that the network requires.

Note that only the results based on a batch size of one are included. In most cases, the batch size provides a speedup in inference time but maintains the same relative performance among architectures. However, an exception is AlexNet, which sees a 3x speedup when going from 1 to 64 images per batch due to weak optimization of its fully connected layer.

for the supplied data collected using Jetson TX2 camera above a conveyor belt AlexNet gave a good results in this case however the other DNN’s may give more accurate results.

for the collected data GoogleNet gaved more accurate results than AlexNet using 0.001 learning rate.

Data Acquisition

first the Supplied Data

for the supplied data there are 7570 image for the following 3 classes:

Bottle
Candy Box
Nothing

the data looks like this photo below :

first by opening the DIGITS workspace we should see something like this

second the Collected Data

Supplied Data Model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Robotic Inference

abstract

Introduction

Background / Formulation

Data Acquisition

first the Supplied Data

second the Collected Data

About

Uh oh!

Releases

Packages

mohamedsayedantar/Robotic-Inference

Folders and files

Latest commit

History

Repository files navigation

Robotic Inference

abstract

Introduction

Background / Formulation

Data Acquisition

first the Supplied Data

second the Collected Data

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages