Essentially video and image annotation is inscribing metadata to videos and images that are not labeled so that it can be used to develop and train algorithms of machine learning, this is vital to the development of Artificial intelligence. The metadata ascribed to the images and videos can be called labels or tags, this can be done in various ways like annotating the pixels with semantic meaning.