Most Popular

1500 questions
14
votes
1 answer

What is the relation between online (or offline) learning and on-policy (or off-policy) algorithms?

In the context of RL, there is the notion of on-policy and off-policy algorithms. I understand the difference between on-policy and off-policy algorithms. Moreover, in RL, there's also the notion of online and offline learning. What is the relation…
14
votes
3 answers

Why does is make sense to normalize rewards per episode in reinforcement learning?

In Open AI's actor-critic and in Open AI's REINFORCE, the rewards are being normalized like so rewards = (rewards - rewards.mean()) / (rewards.std() + eps) on every episode individually. This is probably the baseline reduction, but I'm not entirely…
13
votes
4 answers

Is the singularity something to be taken seriously?

The term Singularity is often used in mainstream media for describing visionary technology. It was introduced by Ray Kurzweil in a popular book The Singularity Is Near: When Humans Transcend Biology (2005). In his book, Kurzweil gives an outlook to…
13
votes
2 answers

Which layer in a CNN consumes more training time: convolution layers or fully connected layers?

In a convolutional neural network, which layer consumes more training time: convolution layers or fully connected layers? We can take AlexNet architecture to understand this. I want to see the time breakup of the training process. I want a relative…
13
votes
1 answer

Why is A* optimal if the heuristic function is admissible?

A heuristic is admissible if it never overestimates the true cost to reach the goal node from $n$. If a heuristic is consistent, then the heuristic value of $n$ is never greater than the cost of its successor, $n'$, plus the successor's heuristic…
Wizard
  • 303
  • 1
  • 2
  • 6
13
votes
1 answer

How exactly can ReLUs approximate non-linear and curved functions?

Currently, the most commonly used activation functions are ReLUs. So I answered this question What is the purpose of an activation function in neural networks? and, while writing the answer, it struck me, how exactly can ReLUs approximate a…
user9947
13
votes
5 answers

What is the fundamental difference between CNN and RNN?

What is the fundamental difference between convolutional neural networks and recurrent neural networks? Where are they applied?
13
votes
3 answers

Is it possible to train a neural network to estimate a vehicle's length?

I have a large dataset (over 100k samples) of vehicles with the ground truth of their lengths. Is it possible to train a deep network to measure/estimate vehicle length? I haven't seen any papers related to estimating object size using a deep neural…
13
votes
4 answers

What is the "dropout" technique?

What purpose does the "dropout" method serve and how does it improve the overall performance of the neural network?
kenorb
  • 10,423
  • 3
  • 43
  • 91
13
votes
2 answers

Why does ChatGPT not give the answer text all at once?

When ChatGPT is generating an answer to my question, it generates it word by word. So I actually have to wait until I get the final answer. Is this just for show? Or is it really real-time generating the answer word by word not knowing yet what the…
13
votes
5 answers

Is there a rigorous proof that AGI is possible, at least, in theory?

It is often implicitly assumed in computer science that the human mind, or at least some mechanical calculations that humans perform (see the Church-Turing thesis), can be replicated with a Turing machine, therefore Artificial General Intelligence…
13
votes
2 answers

Input/output encoding for a neural network to learn a grid-based game

I am writing a simple toy game with the intent of training a deep neural network on top of it. The games rules are roughly the following: The game has a board made up of hexagonal cells. Both players have the same collection of pieces that they can…
Totem
  • 381
  • 2
  • 6
13
votes
3 answers

How to train a neural network for a round based board game?

I'm wondering how to train a neural network for a round based board game like, tic-tac-toe, chess, risk or any other round based game. Getting the next move by inference seems to be pretty straight forward, by feeding the game state as input and…
soriak
  • 239
  • 1
  • 2
  • 3
13
votes
6 answers

What are good alternatives to the expression "Artificial Intelligence"?

I read a really interesting article titled "Stop Calling it Artificial Intelligence" that made a compelling critique of the name "Artificial Intelligence". The word intelligence is so broad that it's hard to say whether "Artificial Intelligence" is…
user6698
13
votes
2 answers

Are the shortcomings of neural networks diminishing?

Having worked with neural networks for about half a year, I have experienced first-hand what are often claimed as their main disadvantages, i.e. overfitting and getting stuck in local minima. However, through hyperparameter optimization and some…