Questions tagged [feature-selection]

For questions related to the concept of feature selection (also known as variable selection or attribute selection), which is the process of selecting a subset of relevant features (a.k.a. variables or predictors) for use in model construction.

For more info, see e.g. https://en.wikipedia.org/wiki/Feature_selection.

52 questions
10
votes
4 answers

How do I select the relevant features of the data?

Recently I was working on a problem to do some cost analysis of my expenditure for some particular resource. I usually make some manual decisions from the analysis and plan accordingly. I have a big data set in excel format and with hundreds of…
7
votes
1 answer

How come that the addition of features can decrease the performance of a neural network?

I have a Remaining Useful Life (RUL) prediction problem that I want to solve. When I added two or more features as inputs to my ANN, the accuracy of my ANN has been decreased. More precisely, I've added features like RMS or KURTOSIS (or both). I was…
5
votes
2 answers

How much can the addition of new features improve the performance?

How much can the addition of new features improve the performance of the model during the optimization process? Let's say I have a total of 10 features. Suppose I start the optimisation process using only 3 features. Can the addition of the 7…
4
votes
1 answer

How does uniform offset tiling work with function approximation?

I get the fundamental idea of how tilings work, but, in Barton and Sutton's book, Reinforcement Learning: An Introduction (2nd edition), a diagram, on page 219 (figure 9.11), showing the variations of uniform offset tiling has confused me. I don't…
4
votes
2 answers

How should I select the features for predicting diseases (in particular when patients specify their health issues)?

My aim is to train a model for predicting diseases. Now, according to this Wikipedia article, diseases are classified based on the following criteria in general: Causes (of the disease) Pathogenesis (the mechanism by which the disease…
3
votes
2 answers

Problem extracting features from convolutional layer where the dimensions are big for feature maps

I have trained a convolutional neural network on images to detect emotions. Now I need to use the same network to extract features from the images and use them to train an LSTM. The problem is: the dimensions of the top layers are: [None, 4, 4, 512]…
3
votes
1 answer

What is a good descriptor for similar objects?

I am developing an image search engine. The engine is meant to retrieve wrist watches based on the input of the user. I am using SIFT descriptors to index the elements in the database and applying Euclidean distance to get the most similar watches.…
3
votes
1 answer

Is automated feature engineering a path to general AI?

I recently came across the featuretools package, which facilitates automated feature engineering. Here's an explanation of the package: https://towardsdatascience.com/automated-feature-engineering-in-python-99baf11cc219 Automated feature…
3
votes
1 answer

What is the $\ell_{2, 1}$ norm?

I'm reading this paper and it says: In this paper, we present a multi-class embedded feature selection method called as sparse optimal scoring with adjustment (SOSA), which is capable of addressing the data heterogeneity issue. We propose to…
Gyntonic
  • 133
  • 1
  • 5
3
votes
0 answers

Feature Selection using Monte Carlo Tree Search

I'm trying to tackle the problem of feature selection as an RL problem, inspired by the paper Feature Selection as a One-Player Game. I know Monte-Carlo tree search (MCTS) is hardly RL. So, I used MCTS for this problem, where nodes are subsets of…
2
votes
2 answers

How to recognize non-circular radial symmetry in images?

This is a question about pattern recognition and feature extraction. I am familiar with Hough transforms, the Fast Radial Transform and variants (e.g., GFRS), but these highlight circles, spheres, etc. I need an image filter that will highlight the…
2
votes
2 answers

How can I minimize the number of answers that are relevant to a machine learning model?

Problem: We have a fairly big database that is built up by our own users. The way this data is entered is by asking the users 30ish questions that all have around 12 answers (x, a, A, B, C, ..., H). The letters stand for values that we can later…
2
votes
2 answers

Which correlated feature should be eliminated from a model?

BACKGROUND: There is a lot of information online about the problem of multicollinearity as it relates to machine learning and how to identify correlated features. However, I am still unclear on which variables to eliminate once a correlated subset…
Snehal Patel
  • 912
  • 1
  • 1
  • 25
2
votes
0 answers

How to decide which column has more weightage to output

As per Image we can see Column_A value is directly proportional to output, While Change in value of Column_B has no effects in output. So basically I want to know is there any algorithm where I can get weightage of columns which is affecting more…
2
votes
3 answers

Why my classification results are correlated with the proportionality of my data?

I'm facing a problem. I'm working on mixed data model with NN (MLP & Word Embedding). My results are not pretty good. And I observed that the proportionality of my data are corelated with my classification results. I explain: As you can see, I have…
1
2 3 4