I am doing some experimentation on neural networks, and for that I am trying to program a plain OCR task. I have learned CNNs are the best choice ,but for the time being and due to my inexperience, I wanna go step by step and start with feedforward nets.
So my training data is a set of roughly 400 16*16 images extracted from a script that draws every alphabet char in a tiny image for a small set of fonts registered in my computer.
Then the test data set is extracted from the same procedure, but for all fonts in my computer.
Well, results are quite bad. Get an accuracy of aprox. 45-50%, which is very poor... but that's not my question.
The point is that I can't get the MSE below 0.0049, no matter what hidden layer distribution I apply to the net. I have tried with several architectures and all comes down to this figure. Does that mean the net cannot learn any further given the data?
This MSE value however throws this poor results too.
I am using Tensorflow API directly, no keras or estimators and for a list of 62 recognizable characters these are examples of the architectures I have used: [256,1860,62] [256,130,62] [256,256, 128 ,62] [256,3600,62] ....
But never get the MSE below 0.0049, and still results are not over 50%.
Any hints are greatly appreciated.