Segmentation As Selective Search for Object Recognition. van de Sande et al. ICCV 2011
RCNN
Rich feature hierarchies for accurate object detection and semantic segmentation. Girshick et al. CVPR 2014.
https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
Fast-RCNN
https://arxiv.org/abs/1504.08083 Idea: No need to recompute features for every box independently, Regress refined bounding box coordinates.
https://arxiv.org/abs/1506.02640 Idea: No bounding box proposals. Predict a class and a
box for every location
in a grid.
YOLO- You Only Look Once
https://arxiv.org/abs/1506.02640 Divide the image into 7x7 cells.
Each cell trains a detector.
The detector needs to predict the object’s class distributions.
The detector has 2 bounding-box predictors to predict
bounding-boxes and confidence scores.