Trained weights using official coco dataset files, video detection in about 20 frames, but with their training sets out the weight of files, to run on only around 7, 8 frames,
After the network structure have been identified, I don't think weight file will influence the speed will only affect the accuracy of detection, and what are the factors that may influence the speed of detection?