Now embedding compared one-to-one. But more robust approach: 1. Select embeddings for each object on N last frames. 2. Compare all embeddings {E_i} on N frames with new embedding for detect - D. 3. Final result: M = cout( { cosine_dist(E_i, D) > t } ), M/N > T, where 0 < T < 1