For questions about the hinge loss, which is typically used for "maximum-margin" classification, most notably for support vector machines (SVMs).
Questions tagged [hinge-loss]
4 questions
7
votes
2 answers
How should we interpret this figure that relates the perceptron criterion and the hinge loss?
I am currently studying the textbook Neural Networks and Deep Learning by Charu C. Aggarwal. Chapter 1.2.1.2 Relationship with Support Vector Machines says the following:
The perceptron criterion is a shifted version of the hinge-loss used in…

The Pointer
- 527
- 3
- 17
2
votes
1 answer
What is the definition of the hinge loss function?
I came across the hinge loss function for training a neural network model, but I did not know the analytical form for the same.
I can write the mean squared error loss function (which is more often used for regression) as
$$\sum\limits_{i=1}^{N}(y_i…

hanugm
- 3,571
- 3
- 18
- 50
1
vote
1 answer
What is the definition of the "cost" function in the SVM's objective function?
In a course that I am attending, the cost function of a support vector machine is given by
$$J(\theta)=\sum_{i=1}^{m} y^{(i)} \operatorname{cost}_{1}\left(\theta^{T} x^{(i)}\right)+\left(1-y^{(i)}\right) \operatorname{cost}_{0}\left(\theta^{T}…

jr123456jr987654321
- 235
- 1
- 7
0
votes
0 answers
Choosing an appropriate loss function for sparse label proportion estimation
I'm working over a task of estimating sparse label proportions, where the target is probability distribution $\textbf{q} \in \Delta^{K-1}$ and $\Delta^{K-1} := \{\textbf{p} \in \mathbb{R}^K \, | \, p_1 + \dots + p_K = 1 \}$ and the support is…

James Arten
- 297
- 1
- 8