Highest Voted Questions - Artificial Intelligence Stack Exchange

5

votes

1 answer

Why didn't champion of the Go game manage to win the last game against AlphaGo, after winning the 4th one?

In the documentary about the match, it is said that after losing the 4th game, AlphaGo came back stronger and started to play in a weird way (not human-like) and it was pretty impossible to be beaten. Why and how did that happen?

reinforcement-learning game-ai monte-carlo-tree-search alphago

asked Mar 26 '19 at 20:43

Jay Critch

343
1
7

5

votes

2 answers

How can we recognise musical notes in low-resolution or blurry images?

I was looking for an approach to recognise musical notes from photos. I found this repository https://github.com/mpralat/notesRecognizer. However, it doesn't seem good enough. If you look into the bad folder, you can see that just tiny variations of…

image-recognition optical-character-recognition

asked Mar 24 '19 at 23:20

Toskan

151
1
4

5

votes

2 answers

How do I improve accuracy and know when to stop training?

I am training a modified VGG-16 to classify crowd density (empty, low, moderate, high). 2 dropout layers were added at the end on the network each one after one of the last 2 FC layers. network settings: training data contain 4381 images…

convolutional-neural-networks classification performance vgg

asked Mar 24 '19 at 08:40

norahik

125
4

5

votes

1 answer

Why Q2 is a more or less independant estimate in Twin Delayed DDPG (TD3)?

Twin Delayed Deep Deterministic (TD3) policy gradient is inspired by both double Q-learning and double DQN. In double Q-learning, I understand that Q1 and Q2 are independent because they are trained on different samples. In double DQN, I understand…

reinforcement-learning q-learning dqn deep-rl ddpg

asked Mar 24 '19 at 05:26

Luke Guye

61
2

5

votes

2 answers

Are on-line backpropagation iterations perpendicular to the constraint?

Raul Rojas' Neural Networks A Systematic Introduction, section 8.1.2 relates off-line backpropagation and on-line backpropagation with Gauss-Jacobi and Gauss-Seidel methods for finding the intersection of two lines. What I can't understand is how…

backpropagation math gradient-descent

asked Mar 23 '19 at 16:03

EmmanuelMess

207
3
14

5

votes

1 answer

Why should we study causation in artificial intelligence?

Judea Pearl won the 2011 Turing Award For fundamental contributions to artificial intelligence through the development of a calculus for probabilistic and causal reasoning. He is credited with the invention of Bayesian networks and a framework for…

applications causation

asked Mar 22 '19 at 15:11

nbro

39,006
12
98
176

5

votes

1 answer

How can we use linear programming to solve an MDP?

Apparently, we can solve an MDP (that is, we can find the optimal policy for a given MDP) using a linear programming formulation. What's the basic idea behind this approach? I think you should start by explaining the basic idea behind a linear…

reinforcement-learning optimization markov-decision-process linear-programming

asked Mar 14 '19 at 21:12

nbro

39,006
12
98
176

5

votes

1 answer

What would be a good comprehensive source about the different forms of classical learning in mammals?

I am looking for a source that really discusses the classic rules of learning in depth. So classical conditioning, operant conditioning, imitation learning... I have found an infinite number of books that supposedly discuss these topics, but have…

reinforcement-learning reference-request biology

asked Mar 13 '19 at 11:20

Gurkenkönig

101
5

5

votes

1 answer

What is a graph neural network?

What is a graph neural network (GNN)? Here are some sub-questions How is a GNN different from a NN? How exactly is a GNN related to graphs? What are the components of a GNN? What are the inputs and outputs of GNNs? How can GNNs be trained? Can we…

deep-learning definitions geometric-deep-learning graph-neural-networks

asked Mar 12 '19 at 10:46

nbro

39,006
12
98
176

5

votes

1 answer

Which neural network can count the number of objects in an image?

I'm looking for a neural network architecture that excels in counting objects. For example, CNN that can output the number of balls (or any other object) in a given image. I already found articles about crowd counting. I'm looking for articles about…

deep-learning convolutional-neural-networks object-detection model-request

asked Mar 10 '19 at 17:29

ron653

83
1
8

5

votes

2 answers

What should the range of the output layer be when performing classification?

I am working on a MLP neural networks, using supervised learning (2 classes and multi-class classification problems). For the hidden layers, I am using $\tanh$ (which produces an output in the range $[-1, 1]$) and for the output layer a softmax…

neural-networks classification activation-functions supervised-learning

asked Mar 08 '19 at 12:52

LVoltz

121
1
5

5

votes

1 answer

Is there a neural network with a varying number of neurons?

Is there some type of neural network that changes the number of neurons while training? Using this idea, the network can increase or decrease the number of neurons when the complexity of the inputs increases or decreases.

neural-networks neat neuroevolution

asked Mar 03 '19 at 13:05

Aura Lee

239
1
7

5

votes

1 answer

Why would someone use NEAT over other machine learning algorithms?

Why would someone use a neuroevolution algorithm, such as NEAT, over other machine learning algorithms? What situation would only apply to an algorithm such as NEAT, but no other machine learning algorithm?

machine-learning neat comparison neuroevolution

asked Mar 02 '19 at 10:17

Sebastian Dixon

97
1
6

5

votes

2 answers

Is there an alternative to the use of target network?

In the context of Deep Q Network, a target network is usually utilized. The target network is a slow changing network with a changing rate as its hyperparameter. This includes both replacement update every $N$ iterations and slowly update every…

reinforcement-learning deep-rl

asked Feb 27 '19 at 09:25

Phizaz

510
3
13

5

votes

2 answers

What is the difference between DQN and AlphaGo Zero?

I have already implemented a relatively simple DQN on Pacman. Now I would like to clearly understand the difference between a DQN and the techniques used by AlphaGo zero/AlphaZero and I couldn't find a place where the features of both approaches are…

reinforcement-learning dqn alphazero deep-rl alphago-zero

asked Feb 27 '19 at 06:17

FenryrMKIII

153
2
6

Most Popular