Skip to main content

Questions tagged [action-recognition]

For questions regarding action recognition. This should be used when asking about what could be implemented that complements or harms this.

0 votes
0 answers
35 views

I'm designing a computer vision system to detect suspicious customer behavior in a store, for example: unusual body movements near a cashier or shelf sudden hiding motions, loitering for too long in ...
Aalzubair's user avatar
2 votes
1 answer
62 views

I am working on a video processing pipeline where the frames have a width that is much larger than the height (wide aspect ratio). My main goal is to apply action recognition on human-object ...
Renat Abdrakhmanov's user avatar
0 votes
1 answer
56 views

I am working on an action recognition task involving human-object interactions using an I3D (3D CNN-based) model. The model was trained on pre-recorded videos, and it performed well during evaluation. ...
Renat Abdrakhmanov's user avatar
0 votes
1 answer
179 views

I'm currently doing some researches on video recognition. What I'm trying to do is like this paper. The idea is that: for processing a specific input video clip (shape: [T, C, H, W]), it needs ...
Henry's user avatar
  • 1
0 votes
1 answer
155 views

There are 8 distinct action classes and around 50+ videos per class. I was wondering if flipping videos from the training set can be a good option to generate additional data. Is it?
Uday Sai's user avatar
2 votes
1 answer
63 views

Let's say that you want to detect if a man is running, walking, or dancing instead of just detecting a man still. What type of neural networks will you use for this purpose?
S. Feunmajer's user avatar
1 vote
0 answers
192 views

I aim to do action recognition in videos on a private dataset. To compare with the existing state-of-the-art implementations, other guys published their code on Github, like the one here (for the ...
krishna chaitanya's user avatar
1 vote
0 answers
312 views

For action recognition or similar tasks, one can either use 3D CNN or combine 2D CNN with optical flow. See this paper for details. Can someone tell the pros/cons of each, in terms of accuracy, cost ...
user984260's user avatar
3 votes
1 answer
227 views

I need to solve a video classification problem. While looking for solutions, I only found solutions that transform this problem into a series of simpler image classification tasks. However, this ...
אבנר יעקב's user avatar
1 vote
1 answer
457 views

I need some explanation about the following paragraph (page 3) from the paper A Novel Approach for Robust Multi Human Action Detection and Recognition based on 3-Dimentional Convolutional Neural ...
Ahmed Hanan's user avatar
2 votes
2 answers
187 views

The Planning Domain Definition Language (PDDL) is known for its capabilities of symbolic planning in the state space. A solver will find a sequence of steps to bring the system from a start state to ...
user avatar
3 votes
1 answer
156 views

I am going to train a deep learning model to classify hand gestures in video. Since the person will be taking up nearly the entire width/height of the video and I will be classifying what hand gesture ...
Amaan's user avatar
  • 131
4 votes
1 answer
154 views

The ability to recognize an object with particular identifying features from single or multiple camera shoots with the temporal dimension digitized as frames has been shown. The proof is that the ...
Douglas Daseeco's user avatar
3 votes
2 answers
802 views

If we look at state of the art accuracy on the UCF101 data set, it is around 93% whereas for the HMDB51 data set it is around 66%. I looked at both the data sets and both contain videos of similar ...
ksh's user avatar
  • 41
4 votes
1 answer
1k views

Inspired by this discussion about recognizing human actions, I have found the Fall-Detection project which detects humans falling on the ground from a CCTV camera feed, and which can consider alerting ...
kenorb's user avatar
  • 10.5k

15 30 50 per page