Let's say I have designed an ML model that can take video input of a dog running around and give the breed of the dog as output. However, I do not want to wait for the video to finish before it is input into my model. I want something like the following to happen:
I am casually taking a video of my backyard when mid-way through a dog runs past the camera. Immediately, my model should identify (a) a dog has appeared within view and (b) the dog is a Labrador Retriever.
In an attempt to achieve the above, I have the following questions:
- Do I need to train a new model that detects when a dog has appeared within view?
- How can I make my model such that the input is continuous and that the model keeps running providing instant output?
Note: This question was originally posted on Stack Overflow. It was closed as a consequence of it being unfitting to the site. Hence, I am uploading the question here. Link to the original post here.