I've done an MNIST digit recognition neural network.
When you put images in that are completely unlike its training data, it still tries to classify them as digits. Sometimes it strongly classifies nonsense data as being a specific digit.
I am interested in the problem of rejecting nonsense inputs generally, and want a solution that is effective for MNIST in particular. I think the most basic approach is just to have a confidence threshold for the standard network. I have also heard about a Bayesian approach.
I have read that in high dimensional vectors like these bitmap images "almost all data is extrapolation", I think this makes rejection of out of distribution a difficult problem, but I feel like it is an important problem too so I will appreciate information on this. Thank you.