Questions tagged [training]

For questions about training networks, rules systems, or other AI system components.

462 questions
23
votes
3 answers

How do I choose the optimal batch size?

Batch size is a term used in machine learning and refers to the number of training examples utilised in one iteration. The batch size can be one of three options: batch mode: where the batch size is equal to the total dataset thus making the…
16
votes
5 answers

Why are the initial weights of neural networks randomly initialised?

This might sound silly to someone who has plenty of experience with neural networks but it bothers me... Random initial weights might give you better results that would be somewhat closer to what a trained neural network should look like, but it…
14
votes
4 answers

Can some one help me understand this paragraph from Nvidia's progressive GAN paper?

In the paper Progressive growing of gans for improved quality, stability, and variation (ICLR, 2018) by Nvidia researchers, the authors write Furthermore, we observe that mode collapses traditionally plaguing GANs tend to happen very quickly, over…
13
votes
2 answers

Which layer in a CNN consumes more training time: convolution layers or fully connected layers?

In a convolutional neural network, which layer consumes more training time: convolution layers or fully connected layers? We can take AlexNet architecture to understand this. I want to see the time breakup of the training process. I want a relative…
13
votes
3 answers

Is it possible to train a neural network to estimate a vehicle's length?

I have a large dataset (over 100k samples) of vehicles with the ground truth of their lengths. Is it possible to train a deep network to measure/estimate vehicle length? I haven't seen any papers related to estimating object size using a deep neural…
13
votes
3 answers

How to train a neural network for a round based board game?

I'm wondering how to train a neural network for a round based board game like, tic-tac-toe, chess, risk or any other round based game. Getting the next move by inference seems to be pretty straight forward, by feeding the game state as input and…
soriak
  • 239
  • 1
  • 2
  • 3
13
votes
2 answers

How are generative adversarial networks trained?

I am reading about generative adversarial networks (GANs) and I have some doubts regarding it. So far, I understand that in a GAN there are two different types of neural networks: one is generative ($G$) and the other discriminative ($D$). The…
12
votes
1 answer

What are the best known gradient-free training methods for deep learning?

As I know, the current state of the art methods for training deep learning networks are variants of gradient descent or stochastic gradient descent. What are the best known gradient-free training methods for deep learning (mostly in visual tasks…
11
votes
3 answers

What size of neural networks can be trained on current consumer grade GPUs? (1060,1070,1080)

Is it possible to give a rule of thumb estimate about the size of neural networks that are trainable on common consumer-grade GPUs? For example, the Emergence of Locomotion (Reinforcement) paper trains a network using tanh activation of the neurons.…
10
votes
2 answers

How can I encode angle data to train neural networks?

I am training a neural network where the target data is a vector of angles in radians (between $0$ and $2\pi$). I am looking for study material on how to encode this data. Can you supply me with a book or research paper that covers this topic…
9
votes
7 answers

Why does training an SVM take so long? How can I speed it up?

I'm trying to create and test non-linear SVMs with various kernels (RBF, Sigmoid, Polynomial) in scikit-learn, to create a model which can classify anomalies and benign behaviors. My dataset includes 692703 records and I use a 75/25%…
Panagiotis
  • 191
  • 1
  • 1
  • 2
9
votes
3 answers

Is a GPU always faster than a CPU for training neural networks?

Currently, I am working on a few projects that use feedforward neural networks for regression and classification of simple tabular data. I have noticed that training a neural network using TensorFlow-GPU is often slower than training the same…
GKozinski
  • 1,240
  • 8
  • 19
8
votes
2 answers

Can LSTM neural networks be sped up by a GPU?

I am training LSTM neural networks with Keras on a small mobile GPU. The speed on the GPU is slower than on the CPU. I found some articles that say that it is hard to train LSTMs (and, in general, RNNs) on GPUs because the training cannot be…
Dieshe
  • 279
  • 1
  • 2
  • 6
8
votes
3 answers

Is it okay to use publicly available Instagram videos to train an AI?

Since I haven't found any good training data for my university project, I want to use pictures and videos from public Instagram profiles. Am I allowed to do that?
8
votes
2 answers

What is the name of a human-inspired machine learning approach?

I once came across a neural network being trained without back-propagation or genetic algorithms (or using any kind of data sets). It was based on how the human brain learns and adjusts its connections between neurons. What is the name of such a…
1
2 3
30 31