3v324v23's picture
Add files
c9019cd
|
raw
history blame
2.97 kB

Grid R-CNN

Introduction

@inproceedings{lu2019grid,
  title={Grid r-cnn},
  author={Lu, Xin and Li, Buyu and Yue, Yuxin and Li, Quanquan and Yan, Junjie},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2019}
}

@article{lu2019grid,
  title={Grid R-CNN Plus: Faster and Better},
  author={Lu, Xin and Li, Buyu and Yue, Yuxin and Li, Quanquan and Yan, Junjie},
  journal={arXiv preprint arXiv:1906.05688},
  year={2019}
}

Results and Models

Backbone Lr schd Mem (GB) Inf time (fps) box AP Config Download
R-50 2x 5.1 15.0 40.4 config model | log
R-101 2x 7.0 12.6 41.5 config model | log
X-101-32x4d 2x 8.3 10.8 42.9 config model | log
X-101-64x4d 2x 11.3 7.7 43.0 config model | log

Notes:

  • All models are trained with 8 GPUs instead of 32 GPUs in the original paper.
  • The warming up lasts for 1 epoch and 2x here indicates 25 epochs.