Questions tagged [layers]

For questions related to layers of neural networks, so you can use this tag if your question involves the input, output, hidden, feedforward, recurrent, convolutional, pooling (aka subsampling), upsampling, etc., layers.

7 questions
33
votes
4 answers

How to find the optimal number of neurons per layer?

When you're writing your algorithm, how do you know how many neurons you need per single layer? Are there any methods for finding the optimal number of them, or is it a rule of thumb?
5
votes
3 answers

Can you confirm that the transformer works strictly deterministically and there is no randomness inside or between the attention layers?

On a high-level temperature and randomness affect the output of a generative language model: Lower temperature: Produces more focused, conservative, and consistent responses. Moderate temperature: Strikes a balance between creativity and…
Hans-Peter Stricker
  • 811
  • 1
  • 8
  • 20
5
votes
2 answers

In a neural network, by how much does the number of neurons typically vary from layer to layer?

In a neural network, by how much does the number of neurons typically vary from layer to layer? Note that I am NOT asking how to find the optimal number of neurons per layer. As a hardware design engineer with no practical experience programming…
2
votes
1 answer

What are the major layers in a Vision Transformer?

Currently, I am studying deepfake detection using deep learning methods. Convolution neural networks, recurrent neural networks, long-short term memory networks, and vision transformers are famous deep learning-based methods that are used in…
2
votes
0 answers

Has the idea of using different learning rates for different layers been explored in the literature?

I wonder whether there are heuristic rules for the optimal selection of learning rates for different layers. I expect that there is no general recipe, but probably there are some choices that may be beneficial. The common strategy uses the same…
1
vote
1 answer

Formally, what are the layers in an Artificial Neural Network?

You may not believe it, but I am an ANN expert. Perhaps, for that reason, I am unable to grasp completely what the layers are in a Deep Forward Artificial Neural Network (DFANN). According to the Deep Learning "bible", p. 164 "the model is…
0
votes
0 answers

Layer Questions regarding Bidirectional VAE (D3VAE)

I am currently trying to figure out how D3VAE are working, but I can't seem to understand the network architecture given. The paper can be found here: https://openreview.net/pdf?id=rG0jm74xtx The things I don't understand in the picture can be…