Most Popular
1500 questions
24
votes
4 answers
How could emotional intelligence be implemented?
I've seen emotional intelligence defined as the capacity to be aware of, control, and express one's emotions, and to handle interpersonal relationships judiciously and empathetically.
What are some strategies for artificial intelligence to begin…

WilliamKF
- 2,493
- 1
- 24
- 31
24
votes
3 answers
Why do most deep learning papers not include an implementation?
I'm a novice researcher, and as I started to read papers in the area of deep learning I noticed that the implementation is normally not added and is needed to be searched elsewhere, and my question is how come that's the case? The paper's authors…

Gilad Deutsch
- 629
- 5
- 12
24
votes
2 answers
Are there any ongoing projects which use the Stack Exchange for machine learning?
Are there any ongoing AI projects which use the Stack Exchange for machine learning?

Techidiot
- 349
- 1
- 7
24
votes
3 answers
Why can't OCR be perceived as a good example of AI?
On the Wikipedia page about AI, we can read:
Optical character recognition is no longer perceived as an exemplar of "artificial intelligence" having become a routine technology.
On the other hand, the MNIST database of handwritten digits is…

kenorb
- 10,423
- 3
- 43
- 91
24
votes
5 answers
What is non-Euclidean data?
What is non-Euclidean data?
Here are some sub-questions
Where does this type of data arise? I have come across this term in the context of geometric deep learning and graph neural networks.
Apparently, graphs and manifolds are non-Euclidean data.…

nbro
- 39,006
- 12
- 98
- 176
23
votes
3 answers
How do I choose the optimal batch size?
Batch size is a term used in machine learning and refers to the number of training examples utilised in one iteration. The batch size
can be one of three options:
batch mode: where the batch size is equal to the total dataset thus making the…

Sebastian Nielsen
- 363
- 1
- 2
- 10
23
votes
4 answers
How does ChatGPT know math?
ChatGPT is a language model. As far as I know and If I'm not wrong, it gets text as tokens and word embeddings. So, how can it do math? For example, I asked:
ME: Which one is bigger 5 or 9.
ChatGPT: In this case, 9 is larger than 5.
One can say,…

Peyman
- 534
- 3
- 10
23
votes
2 answers
What are the flaws in Jeff Hawkins's AI framework?
In 2004 Jeff Hawkins, inventor of the palm pilot, published a very interesting book called On Intelligence, in which he details a theory how the human neocortex works.
This theory is called Memory-Prediction framework and it has some striking…

BlindKungFuMaster
- 4,185
- 11
- 23
23
votes
5 answers
What is the difference between machine learning and deep learning?
Can someone explain to me the difference between machine learning and deep learning? Is it possible to learn deep learning without knowing machine learning?

Addis
- 333
- 5
- 9
23
votes
3 answers
Can the decoder in a transformer model be parallelized like the encoder?
Can the decoder in a transformer model be parallelized like the encoder?
As far as I understand, the encoder has all the tokens in the sequence to compute the self-attention scores. But for a decoder, this is not possible (in both training and…

shiredude95
- 333
- 2
- 6
23
votes
1 answer
When should I use Reinforcement Learning vs PID Control?
When designing solutions to problems such as the Lunar Lander on OpenAIGym, Reinforcement Learning is a tempting means of giving the agent adequate action control so as to successfully land.
But what are the instances in which control system…

SeeDerekEngineer
- 521
- 4
- 11
22
votes
1 answer
What are the advantages of ReLU vs Leaky ReLU and Parametric ReLU (if any)?
I think that the advantage of using Leaky ReLU instead of ReLU is that in this way we cannot have vanishing gradient. Parametric ReLU has the same advantage with the only difference that the slope of the output for negative inputs is a learnable…

gvgramazio
- 696
- 2
- 7
- 19
22
votes
4 answers
Why does ChatGPT fail in playing "20 questions"?
IBM Watson's success in playing "Jeopardy!" was a landmark in the history of artificial intelligence. In the seemingly simpler game of "Twenty questions" where player B has to guess a word that player A thinks of by asking questions to be answered…

Hans-Peter Stricker
- 811
- 1
- 8
- 20
22
votes
2 answers
Why would you implement the position-wise feed-forward network of the transformer with convolution layers?
The Transformer model introduced in "Attention is all you need" by Vaswani et al. incorporates a so-called position-wise feed-forward network (FFN):
In addition to attention sub-layers, each of the layers in our encoder
and decoder contains a…

Eli Korvigo
- 321
- 2
- 6
22
votes
1 answer
Has the Lovelace Test 2.0 been successfully used in an academic setting?
In October 2014, Dr. Mark Riedl published an approach to testing AI intelligence, called the "Lovelace Test 2.0", after being inspired by the original Lovelace Test (published in 2001). Mark believed that the original Lovelace Test would be…

Left SE On 10_6_19
- 1,660
- 9
- 23