Questions tagged [dimensionality-reduction]

For questions related to AI methods of dimensionality reduction (e.g. PCA or autoencoders).

22 questions
12
votes
4 answers

What are the purposes of autoencoders?

Autoencoders are neural networks that learn a compressed representation of the input in order to later reconstruct it, so they can be used for dimensionality reduction. They are composed of an encoder and a decoder (which can be separate neural…
5
votes
2 answers

What are examples of approaches to dimensionality reduction of feature vectors?

Given a pre-trained CNN model, I extract feature vector of images in reference and query dataset with several thousands of elements. I would like to apply some augmentation techniques to reduce the feature vector dimension to speed up cosine…
3
votes
1 answer

When using PCA for dimensionality reduction of the feature vectors to speed up learning, how do I know that I'm not letting the model overfit?

I'm following Andrew Ng's course for Machine Learning and I just don't quite understand the following. Using PCA to speed up learning Using PCA to reduce the number of features, thus lowering the chances for overfitting Looking at these two…
3
votes
0 answers

What is meant by subspace clustering in MFA?

The basic idea of MFA is to perform subspace clustering by assuming the covariance structure for each component of the form, $\Sigma_i = \Lambda_i \Lambda_i^T + \Psi_i$, where $\Lambda_i \in \mathbb{R}^{D\times d}$, is the factor loadings matrix…
stoic-santiago
  • 1,121
  • 5
  • 18
3
votes
1 answer

How do AI researchers imagine higher dimensions?

We can visualize single, two, and three dimensions using websites or imagination. In the context of AI and, in particular, machine learning, AI researchers often have to deal with multi-dimensional random vectors. Suppose if we consider a dataset of…
hanugm
  • 3,571
  • 3
  • 18
  • 50
2
votes
0 answers

How to reduce the dimensionality of the actions in RL

I have a single-agent RL model in which the dimension of the dimension of the action space is $70$. This action space is too big and the deep RL agent is not learning properly. The boundaries of the action space are $-1$ and $1$. My question is, how…
2
votes
1 answer

Perform clustering on high dimensional data

Recently I trained a BYOL model on a set of images to learn an embedding space where similar vectors are close by. The performance was fantastic when I performed approximate K-nearest neighbours search. Now the next task, where I am facing a problem…
2
votes
1 answer

Why does PCA of the vertices of a hexagon result in principal components of equal length?

I do PCA on the data points placed in the corners of a hexagon, and get the following principal components: The PCA variance is $0.6$ and is the same for each component. Why is that? Shouldn't it be greater in the horizontal direction than in the…
2
votes
0 answers

Estimating dimensions to reduce input image size to in CNNs

Considering input images to a CNN that have a large dimension (e.g. 256X256), what are some possible methods to estimate the exact dimensions (e.g. 16X16 or 32X32) to which it can be condensed in the final pooling layer within the CNN network such…
2
votes
0 answers

Compressing Parameters of an Response System

I have an input-output system, which is fully determined by 256 parameters, of which I know a significant amount are of less importance to the input-output pattern. The data I have is some (64k in total) input-parameter-output match. My goal is to…
2
votes
2 answers

How can I use autoencoders to analyze patterns and classify them?

I generated a bunch of simulation data from a complex physical simulation that spits out patterns. I am trying to apply unsupervised learning to analyze the patterns and ideally classify them into whatever categories the learning technique…
2
votes
0 answers

Clustering of very high dimensional data and large number of examples without losing info in dimensions

I'm trying to get a grasp on scalability of clustering algorithms, and have a toy example in mind. Let's say I have around a million or so songs from $50$ genres. Each song has characteristics - some of which are common across all or most genres,…
Shirish Kulhari
  • 383
  • 1
  • 10
1
vote
0 answers

Dimensionality Limitations

I just started learning about AI and have been reading a book called "Foundations of Machine Learning" by Mehryar Mohri so that I can try to create my own. I had a question come up recently: Can I create a machine learning algorithm that can…
1
vote
1 answer

Is it generally advisable to have a low dimensional action space in Reinforcement Learning?

In supervised or unsupervised learning, it is advised to reduce the dimensionality due to the curse of dimensionality in general. Is this also generally advisable for the action space of reinforcement learning? As far as I understand (and inspired…
1
vote
0 answers

How do I select the number of neurons for each layer in an auto-encoder for dimensionality reduction?

I am trying to apply an auto-encoder for dimensionality reduction. I wonder how it will be applied on a large dataset. I have tried this code below. I have total of 8 features in my data and I want to reduce it to 3. from keras.models import…
1
2