Questions tagged [multiclass-classification]

In machine learning, multiclass or multinomial classification is the problem of classifying instances into one of three or more classes (classifying instances into one of two classes is called binary classification).

The existing multi-class classification techniques can be categorized into (1) transformation to binary (2) extension from binary and (3) hierarchical classification:

  1. Transformation to binary: This section discusses strategies for reducing the problem of multiclass classification to multiple binary classification problems. It can be categorized into one vs rest and one vs one. The techniques developed based on reducing the multi-class problem into multiple binary problems can also be called problem transformation techniques
  2. Extension from binary: This section discusses strategies of extending the existing binary classifiers to solve multi-class classification problems. Several algorithms have been developed based on neural networks, decision trees, k-nearest neighbors, naive Bayes, support vector machines and extreme learning machines to address multi-class classification problems. These types of techniques can also be called algorithm adaptation techniques.
  3. Hierarchical classification: Hierarchical classification tackles the multi-class classification problem by dividing the output space i.e. into a tree. Each parent node is divided into multiple child nodes and the process is continued until each child node represents only one class. Several methods have been proposed based on hierarchical classification.
29 questions
19
votes
2 answers

How to implement an "unknown" class in multi-class classification with neural networks?

For example, I need to detect classes for MNIST data. But I want to have not 10 classes for digits, but also I want to have 11th class "not a digit", so that any letter, any other type of image, or random noise would be classified as "not a digit".…
4
votes
0 answers

When computing the ROC-AUC score for multi-class classification problems, when should we use One-vs-Rest and One-vs-One?

The sklearn's documentation of the method roc_auc_score states that the parameter multi_class can take the value 'OvR' (which stands for One-vs-Rest) or 'OvO' (which stands for One-vs-One). These values are only applicable for multi-class…
3
votes
2 answers

What is the general procedure to use and train neural networks for multi-class classification?

I am very new to machine learning. I am following the course offered by Andrew Ng. I am very confused about how we train our neural network for multi-class classification. Let's say we have $K$ classes. For $K$ classes, we will be training $K$…
2
votes
1 answer

Image classification problem with multiple right classes

I have a use case where the model needs to detect fabricdefects. There are 15+ different kinds of defects. In one image there can be multiple defects present. The straight forward solution for this should be a multilabel model from my understanding.…
2
votes
0 answers

Which algorithm for hierarchical and ordered classification?

I have developed the "Pyrates" application which is a serious game to learn programming in Python. In each level of the game, you have to pick up a key and open a chest. To do this, you need to use the control functions available (walk, jump, etc.).…
2
votes
1 answer

Why do we use the softmax instead of no activation function?

Why do we use the softmax activation function on the last layer? Suppose $i$ is the index that has the highest value (in the case when we don't use softmax at all). If we use softmax and take $i$th value, it would be the highest value because $e$ is…
2
votes
1 answer

Is Mask R-CNN suited to solve a multi-class classification problem where the classes are related?

I want to create a model to solve a multi-class classification problem. Here are more details about my problem. Every picture contains only one object The background is very simple All objects belong to the same family of objects (for example,…
1
vote
0 answers

Out of distribution detection (OOD) in the context of regression problems

I'm working in a regression setting to predict a scalar value $y$ from an input $\textbf{x} \in \mathbb{R}^D$ and I'm interested in understanding whenever my model is fed with something that it is outside the (unknown) training distribution…
1
vote
0 answers

Use soft-max post-training for a ReLU trained network?

For a project, I've trained multiple networks for multiclass classification all ending with a ReLU activation at the output. Now the output logits are not probabilities. Is it valid to get the probability of each class by applying a softmax function…
1
vote
0 answers

How to deal with images that do not contain any object of interest?

I'm currently working on an iOS App where I want to detect if there is a table, chair or bench in the current camera input. My idea was to take the MobileNetV2 model and get it to classify these three categories with transfer learning in TensorFlow.…
1
vote
3 answers

What is the difference (if any) between semantic segmentation and multi-class, mutually exclusive classification?

Multi-class classification is simply assigning all data points into one of up to any finite number of mutually exclusive labels. I am new to the field(s) of AI/ML and I keep hearing people use the term "semantic segmentation." I want to "translate"…
1
vote
1 answer

How do I calculate the probabilities of the BERT model prediction logits?

I might be getting this completely wrong, but please let me first try to explain what I need, and then what's wrong. I have a classification task. The training data has 50 different labels. The customer wants to differentiate the low probability…
1
vote
0 answers

Using one-class classification first to find anomalies then apply multi-class classification

I'm new to machine learning and trying to apply it for fault detection, an idea came to mind which is using only anomaly detection after which if the results after a while come up as positive, a multi-class classification algorithm (using 7…
mak
  • 11
  • 1
1
vote
1 answer

Multi class text classification when having only one sample for classes

I have a dataset of texts, each text was identified with an ID number. I would like to do a prediction by finding the best match ID number for upcoming new texts. To use multi text classification, I am not sure if this is the right approach since…
1
vote
1 answer

Is it possible to combine k-fold cross-validation and oversampling for a multi-class text classification task with imbalanced data?

I am dealing with an intent classification task on an Italian customer service data set. I've more or less 1.5k sentences and 29 classes (imbalanced). According to the literature, a good choice is to generate synthetic data, oversampling, or…
1
2