For questions related to the max-pooling operation, where the maximum of a set of numbers is chosen as the "representative" of that set of numbers, which is commonly used in convolutional neural networks.
Questions tagged [max-pooling]
13 questions
5
votes
1 answer
How many weights does the max-pooling layer have?
How many weights does the max-pooling layer have?
For example, if there are 10 inputs, a pooling filter of size 2, stride 2, how many weights, including bias, does a max-pooling layer have?

Tibby
- 53
- 5
5
votes
1 answer
What are the benefits of using max-pooling in convolutional neural networks?
I am reading Francois Chollet's Deep learning with Python, and I came across a section about max-pooling that's really giving me trouble.
I am unable to copy-paste the content, so I've included screenshots of the paragraph that's troubling me.
I…

An Ignorant Wanderer
- 191
- 1
- 4
4
votes
1 answer
Is a non-linear activation function needed if we perform max-pooling after the convolution layer?
Is there any need to use a non-linear activation function (ReLU, LeakyReLU, Sigmoid, etc.) if the result of the convolution layer is passed through the sliding window max function, like max-pooling, which is non-linear itself? What about the average…

Kasia
- 173
- 1
- 6
4
votes
1 answer
Is it effective to concatenate the results of mean-pooling and max-pooling?
Is it popular or effective to concatenate the results of mean-pooling and max-pooling, to get the invariance of the latter and the expressivity of the former?

user3180
- 598
- 3
- 14
4
votes
2 answers
Is max-pooling really bad?
Hinton doesn't believe in the pooling operation (video). I also heard that many max-pooling layers have been replaced by convolutional layers in recent years, is that true?

user559678
- 101
- 4
3
votes
0 answers
What is the fundamental difference between max pooling and adaptive max pooling used in PyTorch
PyTorch provides max pooling and adaptive max pooling.
Both, max pooling and adaptive max pooling, is defined in three dimensions: 1d, 2d and 3d. For simplicity, I am discussing about 1d in this question.
For max pooling in one dimension, the…

hanugm
- 3,571
- 3
- 18
- 50
3
votes
3 answers
Does a fully convolutional network share the same translation invariance properties we get from networks that use max-pooling?
Does a fully convolutional network share the same translation invariance properties we get from networks that use max-pooling?
If not, why do they perform as well as networks which use max-pooling?

FourierFlux
- 783
- 1
- 4
- 14
2
votes
2 answers
How translation invariance is achieved in CNNs?
I am trying to understand how translation invariance is achieved in CNNs. For example, consider the following simple binary classification problem: predicting whether the letter that appears on an image is A or B.
We want our network to be…

ado sar
- 150
- 4
2
votes
1 answer
How is the max function differentiable wrt multiple arguments?
I recently came across an answer on StackOverflow that mentioned the max function being differentiable with respect to its values.
From my current understanding of mathematics, I'm struggling to comprehend how this is possible.
Could someone help…

Peyman
- 534
- 3
- 10
2
votes
1 answer
Is there any reason behind bias towards max pooling over avg pooling?
Consider the following excerpt taken from the chapter named Using convolutions to generalize from the textbook titled Deep Learning with PyTorch by Eli Stevens et al.
Downsampling could in principle occur in different ways. Scaling an
image by half…

hanugm
- 3,571
- 3
- 18
- 50
2
votes
2 answers
How can we compute the gradient of max pooling with overlapping regions?
While studying backpropagation in CNNs, I can't understand how can we compute the gradient of max pooling with overlapping regions.
That's also a question from this quiz and can be also found on this book.

estamos
- 157
- 1
- 12
1
vote
1 answer
How can max-pooling be applied to find features in words?
I'm reading about max-pooling in a dynamic CNN paper. I can see how it can help find features in images, given that the pixel with the highest density gets pooled, but how does it help to find features in words?

user8714896
- 717
- 1
- 4
- 21
0
votes
0 answers
How to interpret CNN output image? CNN expressive enough for particle positions?
Let us suppose we have a squared grid and some particles (active vertices) that are distributed on it.
We can construct a 2D image, where basically the input image is a matrix with 0s (no particle) and 1s (particle present). This corresponds to a…

relaxon
- 31
- 4