World Scientific
  • Search
  •   
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×
Our website is made possible by displaying certain online content using javascript.
In order to view the full content, please disable your ad blocker or whitelist our website www.worldscientific.com.

System Upgrade on Tue, Oct 25th, 2022 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.

Object Detection Via Flexible Anchor Generation

    https://doi.org/10.1142/S0218001421550120Cited by:1 (Source: Crossref)

    This paper designs a method that can generate anchors of various shapes for the object detection framework. This method has the characteristics of novelty and flexibility. Different from the previous anchors generated by a pre-defined manner, our anchors are generated dynamically by an anchor generator. Specially, the anchor generator is not fixed but learned from the hand-designed anchors, which means that our anchor generator is able to work well in various scenes. In the inference time, the weights of anchor generator are estimated by a simple network where the input is some hand-designed anchor. In addition, in order to make the difference between the number of positive and negative samples smaller, we use an adaptive IOU threshold related to the object size to solve this problem. At the same time, we proved that our proposed method is effective and conducted a lot of experiments on the COCO dataset. Experimental results show that after replacing the anchor generation method in the previous object detectors (such as SSD, mask RCNN, and Retinanet) with our proposed method, the detection performance of the model has been greatly improved compared to before the replacement, which proves our method is effective.

    References

    • 1. Z. Cai and N. Vasconcelos , Cascade r-cnn: Delving into high quality object detection, in Proc. IEEE Conf. Computer Vision and Pattern Recognition (2018), pp. 6154–6162. CrossrefGoogle Scholar
    • 2. X. Chen and A. Gupta, Spatial memory for context reasoning in object detection (2017), arXiv:1704.04224. Google Scholar
    • 3. X. Chen, L.-J. Li, L. Fei-Fei and A. Gupta, Iterative visual reasoning beyond convolutions (2018), arXiv:1803.11189. Google Scholar
    • 4. J. Dai, Y. Li, K. He and J. Sun , R-fcn: Object detection via region-based fully convolutional networks, in Advances in Neural Information Processing Systems (2016), pp. 379–387. Google Scholar
    • 5. J. Dai, H. Qi, Y. Xiong, Y. Li, G. Zhang, H. Hu and Y. Wei , Deformable convolutional networks, CoRR 1(2) (2017) 3. abs/1703.06211. Google Scholar
    • 6. M. Everingham, L. Van Gool, C. K. Williams, J. Winn and A. Zisserman , The pascal visual object classes (voc) challenge, Int. J. Comput. Vis. 88(2) (2010) 303–338. Crossref, ISIGoogle Scholar
    • 7. C.-Y. Fu, W. Liu, A. Ranga, A. Tyagi and A. C. Berg, Dssd: Deconvolutional single shot detector (2017), arXiv:1701.06659. Google Scholar
    • 8. S. Gidaris and N. Komodakis, Attend refine repeat: Active box proposal generation via in-out localization (2016), arXiv:1606.04446. Google Scholar
    • 9. K. He, G. Gkioxari, P. Dollár and R. Girshick , Mask r-cnn, Computer Vision (ICCV), 2017 IEEE Int. Conf. (2017), pp. 2980–2988. CrossrefGoogle Scholar
    • 10. K. He, X. Zhang, S. Ren and J. Sun , Identity mappings in deep residual networks, European Conf. Computer Vision (2016), pp. 630–645. CrossrefGoogle Scholar
    • 11. R. Jin and D. Lin , Adaptive anchor for fast object detection in aerial image, IEEE Geosci. Remote Sens. Lett. 17(5) (2019) 839–843. Crossref, ISIGoogle Scholar
    • 12. H. Law and J. Deng , Cornernet: Detecting objects as paired keypoints, in Proc. European Conf. Computer Vision (ECCV) (2018), pp. 734–750. CrossrefGoogle Scholar
    • 13. J. Leng, Y. Ren, W. Jiang, X. Sun and Y. Wang , Realize your surroundings: Exploiting context information for small object detection, Neurocomputing 433 (2021) 287–299. Crossref, ISIGoogle Scholar
    • 14. H. Li, Y. Liu, W. Ouyang and X. Wang , Zoom out-and-in network with map attention decision for region proposal and object detection, Int. J. Comput. Vis. 127 (2019) 225–238. Crossref, ISIGoogle Scholar
    • 15. T.-Y. Lin, P. Dollár, R. B. Girshick, K. He, B. Hariharan and S. J. Belongie , Feature pyramid networks for object detection, CVPR, Vol. 1(2) (2017), p. 4. CrossrefGoogle Scholar
    • 16. T.-Y. Lin, P. Goyal, R. Girshick, K. He and P. Dollár , Focal loss for dense object detection, in Proc. IEEE Int. Conf. Computer Vision (2017), pp. 2980–2988. CrossrefGoogle Scholar
    • 17. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár and C. L. Zitnick , Microsoft coco: Common objects in context, European Conf. Computer Vision (2014), pp. 740–755. CrossrefGoogle Scholar
    • 18. W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu and A. C. Berg , Ssd: Single shot multibox detector, European Conf. Computer Vision (2016), pp. 21–37. CrossrefGoogle Scholar
    • 19. Y. Liu, P. Sun, N. Wergeles and Y. Shang , A survey and performance evaluation of deep learning methods for small object detection, Expert Syst. Appl. 172 (2021) 114602. Crossref, ISIGoogle Scholar
    • 20. H.-F. Lu, X. Du and P.-L. Chang , Toward scale-invariance and position-sensitive region proposal networks, in Proc. European Conf. Computer Vision (ECCV) (2018), pp. 168–183. CrossrefGoogle Scholar
    • 21. W. Ma, T. Tian, H. Xu, Y. Huang and Z. Li , Aabo: Adaptive anchor box optimization for object detection via bayesian sub-sampling, European Conf. Computer Vision (2020), pp. 560–575. CrossrefGoogle Scholar
    • 22. P. O. Pinheiro, T.-Y. Lin, R. Collobert and P. Dollár , Learning to refine object segments, European Conf. Computer Vision (2016), pp. 75–91. CrossrefGoogle Scholar
    • 23. J. Redmon and A. Farhadi , Yolo9000: Better, faster, stronger, in Proc. IEEE Conf. Computer Vision and Pattern Recognition (2017), pp. 7263–7271. CrossrefGoogle Scholar
    • 24. J. Redmon and A. Farhadi, Yolov3: An incremental improvement (2018), arXiv:1804.02767. Google Scholar
    • 25. S. Ren, K. He, R. Girshick and J. Sun , Faster r-cnn: Towards real-time object detection with region proposal networks, IEEE Trans. Pattern Anal. Mach. Intell. 39(6) (2017) 1137–1149. Crossref, ISIGoogle Scholar
    • 26. R. Solovyev, W. Wang and T. Gabruseva , Weighted boxes fusion: Ensembling boxes from different object detection models, Image Vis. Comput. 107 (2021) 104117. Crossref, ISIGoogle Scholar
    • 27. D. Zhang, J. Li, X. Li, Z. Du, L. Xiong and M. Ye , Local–global attentive adaptation for object detection, Eng. Appl. Artif. Intell. 100 (2021) 104208. Crossref, ISIGoogle Scholar
    • 28. M. Zhang, Y. Chen, X. Liu, B. Lv and J. Wang , Adaptive anchor networks for multi-scale object detection in remote sensing images, IEEE Access 8 (2020) 57552–57565. Crossref, ISIGoogle Scholar
    • 29. S. Zhang, L. Wen, X. Bian, Z. Lei and S. Z. Li , Single-shot refinement neural network for object detection, in Proc. IEEE Conf. Computer Vision and Pattern Recognition (2018), pp. 4203–4212. CrossrefGoogle Scholar
    • 30. X. Zhou, J. Zhuo and P. Krahenbuhl , Bottom-up object detection by grouping extreme and center points, in Proc. IEEE/CVF Conf. Computer Vision and Pattern Recognition (2019), pp. 850–859. CrossrefGoogle Scholar