I'm having trouble understanding how bias is added to the feature extraction convolution. I've seen people either refer to the bias as a single number that changes per filter or the whole matrix that is the size of the output. Here is what I mean:
- $I$ is the input single-channel image.
- $F$ is the filter.
- $b$ is the bias.
- "Izhod" means "output".
Which is actually the correct bias used in CNN?