I saw a couple of architectures, like CNN-LSTM, with and without attention model, use of Glove vector, self-critical models, etc. I am overwhelmed looking at different notebooks and architectures, came here for a guidance. I am looking to build a personal project on image annotations. Also, if I wanted to use this deep learning model together with TFX pipeline, what would be the best type of architecture I can go with?
Asked
Active
Viewed 63 times
1
1 Answers
0
Here are a couple of Kaggle Kernels, Notebooks and Tutorials for Image Captioning.

Saurav Maheshkar
- 756
- 1
- 7
- 20