For questions related to the convolutional neural network denoted as AlexNet, designed by Alex Krizhevsky (hence the name AlexNet), and published with Ilya Sutskever and Krizhevsky's doctoral advisor Geoffrey Hinton.
Questions tagged [alexnet]
8 questions
6
votes
1 answer
How should the racing agent take into account the velocity of the vehicle, given the images with a speedometer?
I'm developing a game AI, which tries to master racing simulations. I already trained a CNN (AlexNet) on in-game footage of me playing the game and the pressed keys as the target. As the CNN is only making predictions on a frame-to-frame basis, and…

TheJD
- 103
- 5
4
votes
2 answers
Why do we need 10 bits to represent the 1000 classes in AlexNet?
I'm reading the AlexNet paper. In section 4, where the authors explain how they prevent overfitting, they mention
Although the 1000 classes of ILSVRC make each training example impose 10 bits of constraint on the mapping from image to label".…

harupy
- 43
- 2
3
votes
0 answers
How can I calculate the number of matrix additions, multiplications and divisions in AlexNet?
I'm a first-year student in machine learning and I really recently started to immerse myself.
I need to calculate number of:
matrix additions
matrix multiplications
matrix divisions
which are processed in the well known convolutional neural…

Ivan Talalaev
- 131
- 2
2
votes
1 answer
Can the (sparse) categorical cross-entropy be greater than one?
I am using AlexNet CNN to classify my dataset which contains 10 classes and 1000 data for each class, with 60-30-10, splits for train, validation, and test. I used different batch sizes, learning rates, activation functions, and initializers. I'm…

SahaTib
- 140
- 1
- 9
2
votes
1 answer
What could cause a big fluctuation of the loss in the last epochs of training an AlexNet?
I am training an AlexNet neural network, with about 12000 images which 80% is for training, 10% is for validation and another 10% is for testing.
I have a problem in my plots. There is a big fluctuation in epoch 47,
how can I have a smooth plot?…

SahaTib
- 140
- 1
- 9
2
votes
1 answer
Why am I getting a difference between training accuracy and accuracy calculated with Keras' predict_classes on a subset of the training data?
I'm trying to solve a binary classification problem with AlexNet. I split the original dataset into training and validation datasets using a 70/30 ratio. I have trained my neural network with a dataset of 11200 images, and I obtained a training…

Saha
- 131
- 1
2
votes
1 answer
How is a neural network where the majority of inputs are 0 trained?
Consider AlexNet, which has 1000 output nodes, each of which classifies an image:
The problem I have been having with training a neural network of similar proportions, is that it does what any reasonable network would do: it finds the easiest way…

Recessive
- 1,346
- 8
- 21
1
vote
0 answers
What is the reason for different learned features in upper and lower half in AlexNet?
I was reading AlexNet paper and the authors quoted
the kernels on one GPU were "largely color agnostic," whereas the kernels on the other GPU were largely "color-specific."
The upper GPU takes operates on filters on the top and lower GPU deals…

sak18
- 11
- 2