🪴 Quartz 4.0

Search

❯

Notes and Images

❯

BI - Recurrent Networks

BI - Recurrent Networks

Jul 31, 20242 min read

Questions

What are Recurrent Networks?
- ==Recurrent networks are a type of neural network that are designed to process sequential data, such as time series or natural language.
  Unlike feedforward neural networks, which process each input independently, recurrent networks maintain an internal memory of previous inputs and use this memory to inform the processing of the current input==.
- The key feature of recurrent networks is the use of recurrent connections, which allow information to flow from one time step to the next.
  Specifically, at each time step, the network receives an input and produces an output, as well as updating its internal state based on the current input and the previous state. This internal state is then used to inform the processing of the next input.
- There are several types of recurrent networks, including Elman networks, Jordan networks, and more advanced architectures such as Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRUs).
  These networks differ in the way they use recurrent connections to maintain and update their internal memory, and in the way they process the current input based on this memory.
- Recurrent networks have been successful in a variety of applications, including speech recognition, natural language processing, and time series prediction.
  However, they can be more difficult to train than feedforward networks, due to the challenge of propagating gradients through the recurrent connections over long sequences.
  Techniques such as gradient clipping, weight regularization, and truncated backpropagation through time are often used to address these issues.

—————————————————————

Slides with Notes

RNN: Recurrent Neural Network

MLP: Multilayer Perceptron (A simple Neural Network with $0 - n$ hidden layers)

Graph View

Backlinks

index

Created with Quartz v4.2.4 © 2024

GitHub
Discord Community