Questions tagged [learning-curve]

6 questions
4
votes
0 answers

Is there a mathematical formula that describes the learning curve in neural networks?

In training a neural network, you often see the curve showing how fast the neural network is getting better. It usually grows very fast then slows down to almost horizontal. Is there a mathematical formula that matches these curves? Some similar…
zooby
  • 2,196
  • 1
  • 11
  • 21
3
votes
0 answers

How to interpret the training loss curves in Soft-Actor-Critic (SAC)?

I am using stable-baseline3 implementation of the Soft-Actor-Critic (SAC) algorithm. The plotted training curves look promising. However, I am not fully sure how to interpret the actor and critic losses. The entropy coefficient $\alpha$ is…
2
votes
1 answer

Spikes in of Train and Test error

I learn a DNN for image recognition. During each epoch, I calculate mean loss in the training set. After each epoch, I calculate loss and number of errors over both training and test set. The problem is, training and test error go to (almost) zero,…
user31264
  • 203
  • 1
  • 8
1
vote
0 answers

Bias-variance tradeoff and learning curves for non-deep learning models

I am following a course on machine learning and am confused about the bias-variance trade-off relationship to learning curves in classification. I am seeing some conflicting information online on this. The scikit-learn learning curve looks like the…
1
vote
1 answer

LSTM text classifier shows unexpected cyclical pattern in loss

I'm training a text classifier in PyTorch and I'm experiencing an unexplainable cyclical pattern in the loss curve. The loss drops drastically at the beginning of each epoch and then starts rising slowly. However, the global convergence pattern…
0
votes
1 answer

How to explain that a same DNN model have radically different behaviours with each new initialization and training?

I'm trying to predict the continuous values of a variable $y$ using a Fully Connected Neural Network while providing it with data from a $(3300, 13)$ matrix $X$ where $X[i, :]=[0,...,1,...,0,x_{i}]$. So the first $12$ elements of a data vector are…