What is the simplest classification problem which cannot be solved by a perceptron (that is a single-layered feed-forward neural network, with no hidden layers and step activation function), but it can be solved by the same network if the activation function is swapped out to a differentiable activation function (e.g. sigmoid, tanh)?
In the first case, the training would be done with the perceptron training rule, in the second case with the delta rule.
Note that regression problems cannot be solved by perceptrons, so I'm interested in classification only.