1 minute read

Tags: , ,

### Fine-grained object recognition

  • CNN-­based computer vision applicatons
    • Find-grained object recognition
    • Object detection
    • Semantic segmentation
    • Image super resolution
    • Image style transfer
    • Action and gesture recognition
    • Image matching and co-segementation
  • Introduction

    • Generic and fine-grained visual recognition

      • A large class number
      • Large intra-class
      • Subtle inter-class variations

      Imgur

  • Part-based Method

Imgur

Imgur

  • Idea

    • Neuron: part detector
    • Feature map: the spatial occurrence of certain part

    Imgur

We introduce the co-occurrence laye!r to encode the interaction between object parts.

Apporach (Co-occurrence layer)

Imgur

Imgur

Imgur

Experiments: Dataset

Imgur

  • experiments: setting

    Imgur

  • Visualization

Imgur

Imgur


Object Detection

  • CNN-­based computer vision applicatons
    • Find-grained object recognition
    • Object detection
    • Semantic segmentation
    • Image super resolution
    • Image style transfer
    • Action and gesture recognition
    • Image matching and co-segementation
  • Object Detection

    • Goal: Detecting instances of semantic objects of certain classes
    • Critical to high-level vision tasks such as surveillance, self-driving car, and image retrieval

    Imgur

R-CNN: Regions with CNN Features

Object Detection + CNN ===> R-CNN 第一個用 CNN 來做 Object Detection 的論文!

Imgur

  • Proposal extraction: Using selective search [Uijlin et al.,IJCV’13]
  • Compute CNN features in the layer ‘fc7’ of Caffe CNN
  • Region classification: linear SVMs or a softmax classifer
  • Regression-based bounding box refinement

Fast R-CNN

Imgur

  • Apply fully concolutional networks to the whole image
  • Rol pooling: each proposal is pooled into a fix-size feature map
  • Classification with a softmax layer
  • Regression-based bounding box refinement

Experimental Results

Imgur

Faster R-CNN

Imgur

R-CNN vs. Fast-R-CNN vs. Faster R-CNN

Imgur

YOLO9000 [Redmon & Farhadi, CVPR’17]

Imgur