๐Ÿ“‘Paper Review

[paepr review]Deep One-Class Classification

date
Mar 8, 2023
slug
paper-review-deep-one-class-classification
author
status
Public
tags
paper
DeepLearning
summary
type
Post
thumbnail
category
๐Ÿ“‘Paper Review
updatedAt
Aug 26, 2024 03:53 PM

Abstract

์ด์ƒํƒ์ง€๋ฅผ ์œ„ํ•œ ๋”ฅ๋Ÿฌ๋‹ ๋ฐฉ๋ฒ•๋ก  ๋ถ€์กฑ
ย 

Introduction

  1. supervised anomaly detection
    1. train set : normal / abnormal
  1. semi-supervised anomaly detection
    1. train set : unlabeled normal data (๋‹ค๋Ÿ‰) |or| normal/abnomal (์†Œ๋Ÿ‰)
  1. Unsupervised anomaly detection
    1. trainset : unlabeled data
  1. Deep SVDD (Support Vector Data Description)๋Š” feature extractor ๊ฐ€ anomaly detection task์— ๋งž์ถฐ ํ•™์Šต๋˜๋Š” End - to - End Training์ด ๊ฐ€๋Šฅ
    1. ย 

Related Work

ย 
  1. Kernel-based One-Class Classification
    1. OC-SVM : input space์˜ ๋ฐ์ดํ„ฐ๋ฅผ kernel function์„ ํ†ตํ•ด feature space๋กœ ํˆฌ์˜ํ•˜๊ณ , ๊ทธ ์•ˆ์—์„œ ์ •์ƒ๋ฐ์ดํ„ฐ๋ฅผ ์ตœ๋Œ€ํ•œ ์›์ ์—์„œ ๋ฉ€์–ด์ง€๋„๋กํ•˜๋Š” ํ•˜์ดํผํ”Œ๋ž˜์ธ์„ ์ฐพ๊ณ , ํ•ด๋‹น ๊ฒฝ๊ณ„๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ด์ƒ์น˜ ํƒ์ง€
      1. notion image
        ย 
    2. SVDD : input space์˜ ๋ฐ์ดํ„ฐ๋ฅผ kernel function์„ ํ†ตํ•ด feature space๋กœ ํˆฌ์˜ํ•˜๊ณ , ๊ทธ ์•ˆ์—์„œ ์ •์ƒ ๋ฐ์ดํ„ฐ๋ฅผ ๋‘˜๋ ค์‹ธ๋Š” ๊ฐ€์žฅ ์ž‘์€ ๊ตฌ (ํ•˜์ดํผ์Šคํ”ผ์–ด)๋ฅผ ์ฐพ๊ณ , ํ•ด๋‹น ๊ฒฝ๊ณ„๋ฉด์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์ด์ƒ์น˜๋ฅผ ํƒ์ง€
      1. notion image
        feature space์—์„œ ๋Œ€๋ถ€๋ถ„์˜ ์ •์ƒ ๋ฐ์ดํ„ฐ( ํฌ์‚ฌ์ด i )๋ฅผ ๋‘˜๋Ÿฌ์‹ธ๋Š” ์ค‘์‹ฌ์ด c์ธ ๊ฐ€์žฅ ์ž‘์€ ๊ตฌ ๊ฒฝ๊ณ„(R์ œ๊ณฑ)์„ ์ฐพ๋Š” ๋ชฉ์  ํ•จ์ˆ˜
      ย 
      notion image
      ย 
  1. Deep Approaches to Anomaly Detection : ์ด์ƒ์น˜ํƒ์ง€ ๋ชฉ์ ํ•จ์ˆ˜๋ฅผ ๊ฐ–๋Š” ๋”ฅ๋Ÿฌ๋‹ ๋ฐฉ๋ฒ•๋ก ์œผ๋กœ, ์•„๋ž˜์™€ ๊ฐ™์€ ๊ตฌ์กฐ๊ฐ€ ์‚ฌ์šฉ๋จ
    1. AE : ์ž…๋ ฅ์„ ์••์ถ• ํ›„ ๋ณต์›ํ•˜๋Š” ์‹ ๊ฒฝ๋ง ์••์ถ• ๊ณผ์ •์—์„œ ๋ณต์›์— ํ•„์š”ํ•œ ์ •๋ณด๋ฅผ ํฌํ•จํ•˜๋„๋ก ํ•™์Šต
    2. VAE : ์ž ์žฌ ์ฝ”๋“œ ๊ฐ’์ด ํ‰๊ท ๊ณผ ๋ถ„์‚ฐ์œผ๋กœ ํ‘œํ˜„ (๊ฐ€์šฐ์‹œ์•ˆ ํ™•๋ฅ  ๋ถ„ํฌ์— ๊ธฐ๋ฐ˜ํ•œ ํ™•๋ฅ ๊ฐ’)
    3. AnoGAN : DCGAN ๊ณผ์ •๊ณผ ๋น„์Šทํ•˜๋‚˜, ์ •์ƒ ๋ฐ์ดํ„ฐ๋งŒ์„ ํ•™์Šต์‹œํ‚ค๊ณ , ๋ฐ์ดํ„ฐ๋ฅผ ์ž…๋ ฅํ–ˆ์„ ๋•Œ, ํ•ด๋‹น ๋ชจ๋ธ์˜ ์ถœ๋ ฅ์ด ์ž…๋ ฅ ์ด๋ฏธ์ง€์™€ ๋‹ค๋ฅธ ์ •๋„๋ฅผ Anomaly Score๋กœ ๋„์ถœ, ์ ์ ˆํ•œ Anomaly Score๋ฅผ threchold๋กœ ์ทจํ•˜์—ฌ ์ด์ƒ์น˜ ๋ฐ์ดํ„ฐ ํƒ์ง€
    4. ย 

Deep SVDD

  1. The Deep SVDD Objective
    1. notion image
      ๋Œ€๋ถ€๋ถ„์˜ ์ •์ƒ ๋ฐ์ดํ„ฐ๋ฅผ ์ตœ์†Œ ๋ถ€ํ”ผ๋ฅผ ๊ฐ€์ง„ ๊ตฌ ์•ˆ์œผ๋กœ mapping ํ•˜๋Š” R๊ณผ W๋ฅผ ํ•™์Šต
      parameters = R : ๋ฐ˜์ง€๋ฆ„, W: weights
      Hyperparameters = c : center, v : ์ด์ƒ์น˜ ๋น„์œจ
    2. term 1 : ๊ตฌ์˜ ๋ถ€ํ”ผ๋ฅผ ์ตœ์†Œํ™”
    3. term 2 : ๊ตฌ ์™ธ๋ถ€์˜ ๋ฐ์ดํ„ฐ point์— ๋Œ€ํ•ด penalty ๋ถ€์—ฌ
    4. term 3 : weight decay regularizer
      1. ๋Œ€๋ถ€๋ถ„์˜ ๋ฐ์ดํ„ฐ๊ฐ€ normal์ด๋ผ๊ณ  ๊ฐ€์ •ํ•˜๋Š” one-class deep SVDD์—์„œ๋Š” ์•„๋ž˜์™€ ๊ฐ™์ด ๊ตฌ์˜ ์ค‘์‹ฌ๊ณผ ๋ฐ์ดํ„ฐ point ์‚ฌ์ด์˜ ๊ฑฐ๋ฆฌ๋ฅผ ์ตœ์†Œํ™”ํ•˜๋Š” W๋งŒ์„ ์ตœ์ ํ™” ํ•˜๋„๋ก ํ•™์Šต
        notion image
  1. Optimization of Deep SVDD
    1. ์œ„์˜ objective function์„ ์ตœ์ ํ™”ํ•˜์—ฌ ์ •์ƒ ๋ฐ์ดํ„ฐ์˜ ๊ณตํ†ต์ ์ธ ์š”์†Œ๋ฅผ ์ถ”์ถœํ•˜๊ณ , ๊ฐ ๋ฐ์ดํ„ฐ point๋ฅผ ๊ตฌ์˜ ์ค‘์‹ฌ C์— ๊ฐ€๊น๊ฒŒ mappingํ•˜๋„๋ก ํ•™์Šต, W์™€ R์„ ๊ต๋Œ€๋กœ ์ตœ์ ํ™”
    2. optimiztion of W : R์„ ๊ณ ์ •ํ•˜๊ณ  k epoch ๋™์•ˆ backpropagation์„ ํ†ตํ•ด w๋ฅผ ์—…๋ฐ์ดํŠธ
    3. Optimization of R : latest update W๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ line search๋ฅผ ํ†ตํ•ด R์„ ์—…๋ฐ์ดํŠธ
      1. ย 
  1. Properties of Deep SVDD
    1. Proposition 1
      1. W, R, C๊ฐ€ ๋ชจ๋‘ 0์ด ๋˜๋„๋ก ํ•™์Šต๋˜๋ฉด, ์–ด๋–ค input๋„ 0์œผ๋กœ ์ถœ๋ ฅ๋˜๋Š” ๋ฌธ์ œ๊ฐ€ ์ƒ๊น€
        ๋•Œ๋ฌธ์— C๋ฅผ representations of initial forward pass์˜ ํ‰๊ท ์œผ๋กœ ๊ณ ์ •
        ย 
        representations of initial forward pass์˜ ํ‰๊ท ์„ ๊ตฌํ•˜๋Š” ๊ณผ์ •
      2. C๋Š” ์ •์ƒ๋ฐ์ดํ„ฐ์˜ ๋Œ€ํ‘œ feature๋ฅผ ํฌํ•จํ•˜๊ณ  ์žˆ์–ด์•ผ ํ•จ
      3. pre-trained ๋œ CAE์˜ encoder๋ฅผ Deep SVDD์˜ initial weight๋กœ ์‚ฌ์šฉ
      4. ํ•™์Šต๋˜์ง€ ์•Š์€ ์ดˆ๊ธฐ ๋ชจ๋ธ์— ๋ชจ๋“  ๋ฐ์ดํ„ฐ๋ฅผ ์ž…๋ ฅํ•˜์—ฌ feature๋ฅผ ์ถœ๋ ฅํ•œ ํ›„ feature๋“ค์˜ ํ‰๊ท ์„ ๊ตฌํ•˜์—ฌ C๋กœ ์‚ฌ์šฉ
        1. ย 
    2. Proposition 2
      1. bias term์„ constant๋กœ ์ˆ˜๋ ดํ•˜๋Š” ์ƒํ™ฉ์„ ๋ฐฉ์ง€ํ•˜๊ธฐ ์œ„ํ•ด bias term์„ ์‚ฌ์šฉํ•˜์ง€ ์•Š์Œ
      ย 

Experiments

  1. Competing methods
    1. Shallow baselines : OC-SVM/ SVDD (kernel based) , KDE, IF (95% ์ด์ƒ ๋ถ„์‚ฐ ๋ณด์กด)
    2. Deep baselines : DCAE, AnoGAN
  1. One-class classification on MNIST and CIFAR-10
    1. Setup
      1. MNIST
        1. Train set : 6,000๊ฐœ์˜ normal ์ด๋ฏธ์ง€
        2. Test set : 10,000๊ฐœ์˜ normal / abnormal ์ด๋ฏธ์ง€
      2. CIFAR-10
        1. Train set : 5,000๊ฐœ์˜ normal ์ด๋ฏธ์ง€
        2. Test set : 10,000๊ฐœ์˜ normal / abnormal ์ด๋ฏธ์ง€
    2. Network architectures : LeNet ์‚ฌ์šฉ
    3. Results
      1. notion image
        MNIST์™€ CIFAR-10์—์„œ one-class SVDD ๊ฐ€ ์ „๋ฐ˜์ ์œผ๋กœ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋„์ถœ
  1. Adversarial attacks on GTSRB stop signs (๋…์ผ ๊ตํ†ต ์‹ ํ˜ธ ์ธ์‹ ๋ฒค์น˜๋งˆํฌ ๋ฐ์ดํ„ฐ)
    1. Setup
      1. Trainset : 780๊ฐœ์˜ ์ •์ง€ ์‹ ํ˜ธ ํด๋ž˜์Šค
      2. Testset : 270๊ฐœ์˜ ์ •๊ทœ ์˜ˆ์ œ์™€ 20๊ฐœ์˜ ์ ๋Œ€์  ์˜ˆ์ œ
    2. Network architectures : LeNet ์‚ฌ์šฉ
    3. Results
      1. notion image
        AnoGAN์€ ๋ฐ์ดํ„ฐ์…‹์ด ๋„ˆ๋ฌด ์ž‘์•„ ์ˆ˜๋ ดํ•˜์ง€ ๋ชปํ–ˆ๊ณ , ONE CLASS DEEP SVDD ๊ฐ€ ๊ฐ€์žฅ ์ข‹์€ ์„ฑ๋Šฅ์„ ๋„์ถœํ•œ๋‹ค๊ณ  ์ฃผ์žฅ
        ย 
        ย