Skip to main content

Questions tagged [recurrent-neural-network]

A recurrent neural network (RNN) is a class of artificial neural network where connections between units form a directed cycle.

1 vote
0 answers
24 views

I really want to play around with RNNs. Trying to build an AI assistant with RNNs to run on my machine as I'm always obsessed with RNNs model... To make the performance good, I think I need to do some ...
jupyter's user avatar
  • 111
0 votes
0 answers
49 views

The key equation for an RNN gives us a recurrence relation for the $\frac{\partial h_t}{\partial w_\textrm{h}}$ which are used later to compute the derivative of the loss through a simple chain rule. $...
Link's user avatar
  • 63
1 vote
0 answers
45 views

I do not understand this simple equality $$\frac{\partial h_t}{\partial w_\textrm{h}}= \frac{\partial f(x_{t},h_{t-1},w_\textrm{h})}{\partial w_\textrm{h}} +\frac{\partial f(x_{t},h_{t-1},w_\textrm{h})...
Link's user avatar
  • 63
3 votes
2 answers
125 views

I will use the answer here as an example: https://stats.stackexchange.com/a/370732/78063 It says "which means that you choose a number of time steps $N$, and unroll your network so that it ...
Baron Yugovich's user avatar
1 vote
1 answer
59 views

I have an LSTM model to predict a variable by considering multiple variables. (Say the target variable is river discharge and the predictors are rainfall, temperature, evapotranspiration etc.) There ...
DWijesena's user avatar
4 votes
1 answer
217 views

In standard machine learning settings with cross-sectional data, it's common to assume that data points are independently and identically distributed (i.i.d.) from some fixed data-generating process (...
spie227's user avatar
  • 242
1 vote
0 answers
48 views

I'm reading Murphy's PML and struggling with the derivation of this formula: $$p(\mathbf{y}_{1:T}|\mathbf{x}) = \sum_{\mathbf{h}_{1:T}} p(\mathbf{y}_{1:T}, \mathbf{h}_{1:T}|\mathbf{x}) = \sum_{\mathbf{...
Huseyin Okan Demir's user avatar
0 votes
0 answers
33 views

I have read somewhere (sorry, couldn't find the reference anymore), that $C_t$ (check figure below) represents the "long term" in LSMT. The reason was because "The information goes ...
Murilo's user avatar
  • 179
0 votes
0 answers
132 views

I am working on a time series prediction problem using an LSTM model. My dataset consists of 27 different items, each with unique IDs, and roughly the same number of samples per item. There are around ...
Rai's user avatar
  • 43
0 votes
0 answers
89 views

Consider the following simple gated RNN: \begin{aligned} c_{t} &= \sigma\bigl(W_{c}\,x_{t} + W_{z}\,z_{t-1}\bigr) \\[6pt] z_{t} &= c_{t} \,\odot\, z_{t-1} \;\;+\;\; (1 - c_{t}) \,\odot\,\...
kuzzooroo's user avatar
  • 181
1 vote
1 answer
80 views

I find it hard to understand the differences of the two. From what my understanding is that each hidden neuron in the FCRN influences all other neuron and itself while in the Hopfield it influences ...
IKNv99's user avatar
  • 111
0 votes
0 answers
78 views

I am working on a project where I generate synthetic data which is the sum of 5 random sine functions sampled every 0.01s (and I add mean reverting brownian motion noise to the data). ...
Arnav Tapadia's user avatar
0 votes
0 answers
97 views

I have a bunch of time series with known real-world events (one event per t/s). An event is not necessarily visible as a change in the t/s, but is know to have occurred in the real world. I wonder if ...
Mihail's user avatar
  • 582
19 votes
2 answers
6k views

On October 9, 2024, the Nobel Foundation announced the Nobel Prize in Physics 2024 with the following statement of merit: for foundational discoveries and inventions that enable machine learning with ...
1 vote
1 answer
117 views

I have a dataset of numerous years of buoy wave height measurements including features such as measured significant wave height, numerical model predictions, peak wave period, mean wave period, and ...
Donald M.'s user avatar

15 30 50 per page
1
2 3 4 5
43