Frame annotation is one of the basic forms of video annotation. It splits the video into frame-by-frame static images, and annotates information such as targets, categories and attributes for each frame image (such as bounding box annotation of "pedestrians" and "cars" in each frame of the video), providing frame-level data support for video temporal analysis and target tracking.