Questions tagged [recurrent-neural-network]
A recurrent neural network (RNN) is a class of artificial neural network where connections between units form a directed cycle.
634 questions
1 vote
0 answers
24 views
Is there a work on trying to pretrain RNNs model? [closed]
I really want to play around with RNNs. Trying to build an AI assistant with RNNs to run on my machine as I'm always obsessed with RNNs model... To make the performance good, I think I need to do some ...
0 votes
0 answers
49 views
Is expanding the recurrence relation for RNNs useful in practice?
The key equation for an RNN gives us a recurrence relation for the $\frac{\partial h_t}{\partial w_\textrm{h}}$ which are used later to compute the derivative of the loss through a simple chain rule. $...
1 vote
0 answers
45 views
Simple questions about RNNs
I do not understand this simple equality $$\frac{\partial h_t}{\partial w_\textrm{h}}= \frac{\partial f(x_{t},h_{t-1},w_\textrm{h})}{\partial w_\textrm{h}} +\frac{\partial f(x_{t},h_{t-1},w_\textrm{h})...
3 votes
2 answers
125 views
Question on RNNs lookback window when unrolling
I will use the answer here as an example: https://stats.stackexchange.com/a/370732/78063 It says "which means that you choose a number of time steps $N$, and unroll your network so that it ...
1 vote
1 answer
59 views
Is it possible to ignore the past values of the response variable in an LSTM model with multiple predictor variables?
I have an LSTM model to predict a variable by considering multiple variables. (Say the target variable is river discharge and the predictors are rainfall, temperature, evapotranspiration etc.) There ...
4 votes
1 answer
217 views
Do i.i.d. assumptions extend to datasets of independently generated sequences in modern sequence models (e.g., RNNs)?
In standard machine learning settings with cross-sectional data, it's common to assume that data points are independently and identically distributed (i.i.d.) from some fixed data-generating process (...
1 vote
0 answers
48 views
Vec2Seq Model Formula
I'm reading Murphy's PML and struggling with the derivation of this formula: $$p(\mathbf{y}_{1:T}|\mathbf{x}) = \sum_{\mathbf{h}_{1:T}} p(\mathbf{y}_{1:T}, \mathbf{h}_{1:T}|\mathbf{x}) = \sum_{\mathbf{...
0 votes
0 answers
33 views
Short and Long terms in LSTM
I have read somewhere (sorry, couldn't find the reference anymore), that $C_t$ (check figure below) represents the "long term" in LSMT. The reason was because "The information goes ...
0 votes
0 answers
132 views
How to properly split train/val sets for time series LSTM prediction with multiple unique items?
I am working on a time series prediction problem using an LSTM model. My dataset consists of 27 different items, each with unique IDs, and roughly the same number of samples per item. There are around ...
0 votes
0 answers
89 views
Analytically solving backpropagation through time for a simple gated RNN
Consider the following simple gated RNN: \begin{aligned} c_{t} &= \sigma\bigl(W_{c}\,x_{t} + W_{z}\,z_{t-1}\bigr) \\[6pt] z_{t} &= c_{t} \,\odot\, z_{t-1} \;\;+\;\; (1 - c_{t}) \,\odot\,\...
1 vote
1 answer
80 views
What are the differences between Fully recurrent neural networks Hopfield networks and Elman networks?
I find it hard to understand the differences of the two. From what my understanding is that each hidden neuron in the FCRN influences all other neuron and itself while in the Hopfield it influences ...
0 votes
0 answers
78 views
LSTM Limitations for Time Series Forecasting
I am working on a project where I generate synthetic data which is the sum of 5 random sine functions sampled every 0.01s (and I add mean reverting brownian motion noise to the data). ...
0 votes
0 answers
97 views
What type of model/analysis to use to predict events in time series data
I have a bunch of time series with known real-world events (one event per t/s). An event is not necessarily visible as a change in the t/s, but is know to have occurred in the real world. I wonder if ...
19 votes
2 answers
6k views
What are the works of Hopfield and Hinton that enable machine learning with neural networks, as noted in the physics Nobel award statement?
On October 9, 2024, the Nobel Foundation announced the Nobel Prize in Physics 2024 with the following statement of merit: for foundational discoveries and inventions that enable machine learning with ...
1 vote
1 answer
117 views
Should classical/traditional ML techniques such as polynomial regression/decision trees/random forests SIGNIFICANTLY outperform RNN in timeseries? [closed]
I have a dataset of numerous years of buoy wave height measurements including features such as measured significant wave height, numerical model predictions, peak wave period, mean wave period, and ...