I am working on a computer vision project, based on face detection to record the time spent by a person in an office.
It consists of detecting the face by camera number 1 (input), temporarily storing the detected face, calculating the time spent until this same person leaves and his face is detected by camera number 2. (We don't have a customer database).
Is there a better approach to follow? I would also appreciate articles to read on the topic.