Highest Voted Questions - Artificial Intelligence Stack Exchange

14

votes

1 answer

What is the relation between online (or offline) learning and on-policy (or off-policy) algorithms?

In the context of RL, there is the notion of on-policy and off-policy algorithms. I understand the difference between on-policy and off-policy algorithms. Moreover, in RL, there's also the notion of online and offline learning. What is the relation…

reinforcement-learning off-policy-methods on-policy-methods online-learning offline-reinforcement-learning

asked Feb 09 '19 at 14:48

nbro

39,006
12
98
176

14

votes

3 answers

Why does is make sense to normalize rewards per episode in reinforcement learning?

In Open AI's actor-critic and in Open AI's REINFORCE, the rewards are being normalized like so rewards = (rewards - rewards.mean()) / (rewards.std() + eps) on every episode individually. This is probably the baseline reduction, but I'm not entirely…

reinforcement-learning policy-gradients variance-reduction reward-normalization

asked Jan 24 '19 at 13:56

Gulzar

729
1
8
23

13

votes

4 answers

Is the singularity something to be taken seriously?

The term Singularity is often used in mainstream media for describing visionary technology. It was introduced by Ray Kurzweil in a popular book The Singularity Is Near: When Humans Transcend Biology (2005). In his book, Kurzweil gives an outlook to…

philosophy social superintelligence singularity mythology-of-ai

asked Sep 07 '18 at 13:31

Bucky Rogerson

131
4

13

votes

2 answers

Which layer in a CNN consumes more training time: convolution layers or fully connected layers?

In a convolutional neural network, which layer consumes more training time: convolution layers or fully connected layers? We can take AlexNet architecture to understand this. I want to see the time breakup of the training process. I want a relative…

neural-networks deep-learning convolutional-neural-networks training

asked Sep 06 '18 at 23:27

Ruchit Dalwadi

325
3
11

13

votes

1 answer

Why is A* optimal if the heuristic function is admissible?

A heuristic is admissible if it never overestimates the true cost to reach the goal node from $n$. If a heuristic is consistent, then the heuristic value of $n$ is never greater than the cost of its successor, $n'$, plus the successor's heuristic…

search proofs heuristics a-star admissible-heuristic

asked Apr 13 '18 at 02:25

Wizard

303
1
2
6

13

votes

1 answer

How exactly can ReLUs approximate non-linear and curved functions?

Currently, the most commonly used activation functions are ReLUs. So I answered this question What is the purpose of an activation function in neural networks? and, while writing the answer, it struck me, how exactly can ReLUs approximate a…

neural-networks machine-learning activation-functions relu

asked Mar 09 '18 at 10:45

user9947

13

votes

5 answers

What is the fundamental difference between CNN and RNN?

What is the fundamental difference between convolutional neural networks and recurrent neural networks? Where are they applied?

neural-networks convolutional-neural-networks recurrent-neural-networks comparison

asked Dec 08 '17 at 14:48

Pradeep BV

151
1
7

13

votes

3 answers

Is it possible to train a neural network to estimate a vehicle's length?

I have a large dataset (over 100k samples) of vehicles with the ground truth of their lengths. Is it possible to train a deep network to measure/estimate vehicle length? I haven't seen any papers related to estimating object size using a deep neural…

machine-learning deep-learning computer-vision training reference-request

asked Oct 16 '17 at 18:10

Naji

139
1
1
3

13

votes

4 answers

What is the "dropout" technique?

What purpose does the "dropout" method serve and how does it improve the overall performance of the neural network?

deep-neural-networks overfitting performance

asked Aug 02 '16 at 16:08

kenorb

10,423
3
43
91

13

votes

2 answers

Why does ChatGPT not give the answer text all at once?

When ChatGPT is generating an answer to my question, it generates it word by word. So I actually have to wait until I get the final answer. Is this just for show? Or is it really real-time generating the answer word by word not knowing yet what the…

natural-language-processing language-model chatgpt

asked Jan 27 '23 at 15:18

Sander van den Oord

231
1
5

13

votes

5 answers

Is there a rigorous proof that AGI is possible, at least, in theory?

It is often implicitly assumed in computer science that the human mind, or at least some mechanical calculations that humans perform (see the Church-Turing thesis), can be replicated with a Turing machine, therefore Artificial General Intelligence…

philosophy agi proofs neuroscience computational-theory-of-mind

asked Jul 03 '17 at 16:55

yters

387
2
10

13

votes

2 answers

Input/output encoding for a neural network to learn a grid-based game

I am writing a simple toy game with the intent of training a deep neural network on top of it. The games rules are roughly the following: The game has a board made up of hexagonal cells. Both players have the same collection of pieces that they can…

neural-networks deep-learning game-ai

asked Jun 09 '17 at 11:21

Totem

381
2
6

13

votes

3 answers

How to train a neural network for a round based board game?

I'm wondering how to train a neural network for a round based board game like, tic-tac-toe, chess, risk or any other round based game. Getting the next move by inference seems to be pretty straight forward, by feeding the game state as input and…

training tensorflow game-ai

asked May 19 '17 at 18:38

soriak

239
1
2
3

13

votes

6 answers

What are good alternatives to the expression "Artificial Intelligence"?

I read a really interesting article titled "Stop Calling it Artificial Intelligence" that made a compelling critique of the name "Artificial Intelligence". The word intelligence is so broad that it's hard to say whether "Artificial Intelligence" is…

philosophy terminology

asked Apr 17 '17 at 20:00

user6698

13

votes

2 answers

Are the shortcomings of neural networks diminishing?

Having worked with neural networks for about half a year, I have experienced first-hand what are often claimed as their main disadvantages, i.e. overfitting and getting stuck in local minima. However, through hyperparameter optimization and some…

neural-networks machine-learning overfitting support-vector-machine no-free-lunch-theorems

asked Jan 10 '17 at 08:38

user4747

Most Popular