Highest Voted 'training' Questions - Artificial Intelligence Stack Exchange

23

votes

3 answers

How do I choose the optimal batch size?

Batch size is a term used in machine learning and refers to the number of training examples utilised in one iteration. The batch size can be one of three options: batch mode: where the batch size is equal to the total dataset thus making the…

asked Oct 21 '18 at 17:09

Sebastian Nielsen

363
1
2
10

16

votes

5 answers

Why are the initial weights of neural networks randomly initialised?

This might sound silly to someone who has plenty of experience with neural networks but it bothers me... Random initial weights might give you better results that would be somewhat closer to what a trained neural network should look like, but it…

neural-networks training weights weights-initialization

asked Oct 21 '17 at 06:52

Matas Vaitkevicius

271
5
12

14

votes

4 answers

Can some one help me understand this paragraph from Nvidia's progressive GAN paper?

In the paper Progressive growing of gans for improved quality, stability, and variation (ICLR, 2018) by Nvidia researchers, the authors write Furthermore, we observe that mode collapses traditionally plaguing GANs tend to happen very quickly, over…

deep-learning training papers generative-adversarial-networks discriminator

asked Jun 06 '18 at 23:27

Inkplay_

411
4
8

13

votes

2 answers

Which layer in a CNN consumes more training time: convolution layers or fully connected layers?

In a convolutional neural network, which layer consumes more training time: convolution layers or fully connected layers? We can take AlexNet architecture to understand this. I want to see the time breakup of the training process. I want a relative…

neural-networks deep-learning convolutional-neural-networks training

asked Sep 06 '18 at 23:27

Ruchit Dalwadi

325
3
11

13

votes

3 answers

Is it possible to train a neural network to estimate a vehicle's length?

I have a large dataset (over 100k samples) of vehicles with the ground truth of their lengths. Is it possible to train a deep network to measure/estimate vehicle length? I haven't seen any papers related to estimating object size using a deep neural…

machine-learning deep-learning computer-vision training reference-request

asked Oct 16 '17 at 18:10

Naji

139
1
1
3

13

votes

3 answers

How to train a neural network for a round based board game?

I'm wondering how to train a neural network for a round based board game like, tic-tac-toe, chess, risk or any other round based game. Getting the next move by inference seems to be pretty straight forward, by feeding the game state as input and…

training tensorflow game-ai

asked May 19 '17 at 18:38

soriak

239
1
2
3

13

votes

2 answers

How are generative adversarial networks trained?

I am reading about generative adversarial networks (GANs) and I have some doubts regarding it. So far, I understand that in a GAN there are two different types of neural networks: one is generative ($G$) and the other discriminative ($D$). The…

neural-networks deep-learning training generative-adversarial-networks self-supervised-learning

asked Oct 24 '16 at 11:42

Eka

1,036
8
23

12

votes

1 answer

What are the best known gradient-free training methods for deep learning?

As I know, the current state of the art methods for training deep learning networks are variants of gradient descent or stochastic gradient descent. What are the best known gradient-free training methods for deep learning (mostly in visual tasks…

deep-learning reference-request training algorithm-request

asked Aug 24 '17 at 12:42

rkellerm

334
1
9

11

votes

3 answers

What size of neural networks can be trained on current consumer grade GPUs? (1060,1070,1080)

Is it possible to give a rule of thumb estimate about the size of neural networks that are trainable on common consumer-grade GPUs? For example, the Emergence of Locomotion (Reinforcement) paper trains a network using tanh activation of the neurons.…

neural-networks training performance hardware-evaluation

asked Dec 09 '17 at 10:20

pascalwhoop

305
1
8

10

votes

2 answers

How can I encode angle data to train neural networks?

I am training a neural network where the target data is a vector of angles in radians (between $0$ and $2\pi$). I am looking for study material on how to encode this data. Can you supply me with a book or research paper that covers this topic…

neural-networks reference-request training datasets data-preprocessing

asked Nov 27 '22 at 03:47

user366312

351
1
12

9

votes

7 answers

Why does training an SVM take so long? How can I speed it up?

I'm trying to create and test non-linear SVMs with various kernels (RBF, Sigmoid, Polynomial) in scikit-learn, to create a model which can classify anomalies and benign behaviors. My dataset includes 692703 records and I use a 75/25%…

machine-learning training support-vector-machine

asked Jul 19 '18 at 11:01

Panagiotis

191
1
1
2

9

votes

3 answers

Is a GPU always faster than a CPU for training neural networks?

Currently, I am working on a few projects that use feedforward neural networks for regression and classification of simple tabular data. I have noticed that training a neural network using TensorFlow-GPU is often slower than training the same…

neural-networks training tensorflow gpu

asked Aug 24 '19 at 13:28

GKozinski

1,240
8
19

8

votes

2 answers

Can LSTM neural networks be sped up by a GPU?

I am training LSTM neural networks with Keras on a small mobile GPU. The speed on the GPU is slower than on the CPU. I found some articles that say that it is hard to train LSTMs (and, in general, RNNs) on GPUs because the training cannot be…

training tensorflow keras long-short-term-memory gpu

asked Jul 09 '18 at 04:55

Dieshe

279
1
2
6

8

votes

3 answers

Is it okay to use publicly available Instagram videos to train an AI?

Since I haven't found any good training data for my university project, I want to use pictures and videos from public Instagram profiles. Am I allowed to do that?

computer-vision training datasets research image-processing

asked Sep 22 '21 at 12:04

Bert Gayus

545
3
12

8

votes

2 answers

What is the name of a human-inspired machine learning approach?

I once came across a neural network being trained without back-propagation or genetic algorithms (or using any kind of data sets). It was based on how the human brain learns and adjusts its connections between neurons. What is the name of such a…

neural-networks machine-learning training terminology hebbian-learning

asked Mar 31 '17 at 21:40

Philogy

201
1
6

Questions tagged [training]