๐Ÿ“‘Paper Review

[paper review] Generative Adversarial Nets

date
Jun 27, 2023
slug
Generative-Adversarial-Nets
author
status
Public
tags
paper
DeepLearning
summary
type
Post
thumbnail
์บก์ฒ˜.PNG
category
๐Ÿ“‘Paper Review
updatedAt
Sep 7, 2024 03:12 AM
  • ์ง„์งœ์™€ ๋™์ผํ•ด ๋ณด์ด๋Š” ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š” model
notion image
  • ๊ธฐ์กด ์ง€๋„ํ•™์Šต์˜ ํ•œ๊ณ„๋ฅผ ๋›ฐ์–ด๋„˜์€ model
  • ๊ธฐ์กด์˜ ์ง€๋„ํ•™์Šต์˜ ๊ฒฝ์šฐ dataset์ด ํ•„์ˆ˜. ์ด๋Ÿฌํ•œ dataset์„ ๋งŒ๋“œ๋Š” ๊ณผ์ •์— ๋งŽ์€ ๋น„์šฉ ๋ฐœ์ƒ
  • GAN์€ ๋น„์ง€๋„ํ•™์Šต์— ์†ํ•˜๋ฉฐ, data๋ฅผ ์ง์ ‘ ์ƒ์„ฑํ•˜๋Š” ํฐ ์žฅ์ ์„ ๊ฐ€์ง
notion image

Abstract

  • ๋‘ ๊ฐœ์˜ ๋ถ„๋ฆฌ๋œ ๋ชจ๋ธ
  • Generative model(์ƒ์„ฑ๊ธฐ,G) : data์˜ ๋ถ„ํฌ๋ฅผ ํฌ์ฐฉ
  • Discriminative model(ํŒ๋ณ„๊ธฐ,D) : ํ•œ sample์ด ์ƒ์„ฑ๊ธฐ๊ฐ€ ์•„๋‹Œ ์‹ค์ œ training data๋กœ๋ถ€ํ„ฐ ์™”์„ ๊ฒƒ์ด๋ผ๋Š” ํ™•๋ฅ ์„ ์ถ”์ •ํ•˜๋Š” model
  • ์œ„๋ฅผ ๋™์‹œ์— ํ•™์Šตํ•˜๋Š” adversarial process ๊ณผ์ •
  • G์˜ ํ•™์Šต์ ˆ์ฐจ๋Š” D๊ฐ€ ์ž˜๋ชป๋œ ๊ฒฐ์ •์„ ํ•˜๊ฒŒ ๋งŒ๋“ค ํ™•๋ฅ ์„ ์ตœ๋Œ€ํ™”ํ•˜๋Š” ๊ฒƒ

Image data์— ๋Œ€ํ•œ ํ™•๋ฅ ๋ถ„ํฌ

  • Image data๋Š” ๋‹ค์ฐจ์› ํŠน์ง• ๊ณต๊ฐ„์˜ ํ•œ ์ ์œผ๋กœ ํ‘œํ˜„. Image ๋ถ„ํฌ๋ฅผ ๊ทผ์‚ฌํ•˜๋Š” model์„ ํ•™์Šตํ•  ์ˆ˜ ์žˆ์Œ
  • ex) ์‚ฌ๋žŒ์˜ ์–ผ๊ตด์—๋Š” ํ†ต๊ณ„์ ์ธ ํ‰๊ท ์น˜๊ฐ€ ์กด์žฌ -> model์€ ์ด๋ฅผ ์ˆ˜์น˜์ ์œผ๋กœ ํ‘œํ˜„
  • Image์—์„œ๋Š” ๋‹ค์–‘ํ•œ ํŠน์ง•๋“ค์ด ๊ฐ๊ฐ์˜ ํ™•๋ฅ  ๋ณ€์ˆ˜๊ฐ€ ๋˜๋Š” ๋ถ„ํฌ. ๋‹ค๋ณ€์ˆ˜ ํ™•๋ฅ ๋ถ„ํฌ(Multivariate probability distribution)
notion image

Genrative Models

  • ์‹ค์กดํ•˜์ง€ ์•Š์ง€๋งŒ ์žˆ์„ ๋ฒ•ํ•œ data๋ฅผ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ๋Š” model
  • ์—ฌ๋Ÿฌ ๊ฐœ์˜ ๋ณ€์ˆ˜์— ๋Œ€ํ•œ ๊ฒฐํ•ฉ ํ™•๋ฅ  ๋ถ„ํฌ์˜ ํ†ต๊ณ„์  ๋ชจํ˜•

1. Introduction

  • ์ ๋Œ€์  ์‹ ๊ฒฝ๋ง(adversarial nets) : G๊ฐ€ D๋ฅผ ์†์ด๋„๋ก ํ•˜๊ณ , D๋Š” ์–ด๋–ค sample์ด G๊ฐ€ modelingํ•œ ๋ถ„ํฌ๋กœ๋ถ€ํ„ฐ ๋‚˜์˜จ ๊ฒƒ์ธ์ง€ ์‹ค์ œ data ๋ถ„ํฌ๋กœ๋ถ€ํ„ฐ ๋‚˜์˜จ๊ฒƒ์ธ์ง€๋ฅผ ๊ฒฐ์ •ํ•˜๋Š” ๋ฒ•์„ ํ•™์Šต. ์ด์™€ ๊ฐ™์€ ๊ฒฝ์Ÿ๊ตฌ๋„๋Š” ๋‘ model์ด ๋ชจ๋‘ ๊ฐ๊ฐ์˜ ๋ชฉ์ ์„ ๋‹ฌ์„ฑ์‹œํ‚ค๊ธฐ ์œ„ํ•ด ์Šค์Šค๋กœ๋ฅผ ๊ฐœ์„ ํ•˜๋„๋ก ํ•จ
  • (G๋Š” D๋ฅผ ๋” ์ž˜ ์†์ด๋„๋ก ์›๋ณธ data๋ฅผ ๋” ์ž˜ ๋ชจ๋ฐฉํ•œ ๋ถ„ํฌ๋ฅผ ํ•™์Šต, D๋Š” ์ง„์งœ/๊ฐ€์งœ data๋ฅผ ๋” ์ž˜ ๊ฐ„ํŒŒํ•˜๋„๋ก data์˜ ํŠน์ง•์„ ๋” ์ž˜ ํŒŒ์•…ํ•˜๋„๋ก ํ•™์Šต)
  • ์‹ค์ œ data ๋ถ„ํฌ์™€ model์ด ์ƒ์„ฑํ•œ data์˜ ๋ถ„ํฌ ๊ฐ„์˜ ์ฐจ์ด๋ฅผ ์ค„์ด๋Š” ๊ฒƒ
  • G์— random noise๋ฅผ ๋”ํ•จ์œผ๋กœ์จ data๋ฅผ ์ƒ์„ฑํ•˜๋ฉฐ ์ด๋ฅผ ํŒ๋ณ„ํ•˜๊ธฐ ์œ„ํ•ด MLP model ์‚ฌ์šฉ = adversarial nets
  • Generator Network : random noise vector๋ฅผ ์ž…๋ ฅ๋ฐ›์•„ image๋ฅผ ๋งŒ๋“œ๋Š” upsampling ์ง„ํ–‰
  • Discriminator Network : Network์— ์ „๋‹ฌ๋œ image๊ฐ€ ์‹ค์ œ์ธ์ง€ ๊ฐ€์งœ์ธ์ง€๋ฅผ ํŒ๋ณ„

2. Adversarial Nets

notion image
  • x : sample image์˜ data,ย ย : image data๋“ค์˜ ํ™•๋ฅ ๋ถ„ํฌ, z : generator์— ์ž…๋ ฅ๋˜๋Š” noise ์˜์—ญ,ย ย : noise์˜ ํ™•๋ฅ  ๋ถ„ํฌ, G(z) : new data instance, D(x) : training data๋กœ๋ถ€ํ„ฐ ๋‚˜์™”๋Š”์ง€์— ๋Œ€ํ•œ ํ™•๋ฅ (์ง„์งœ : 1, ๊ฐ€์งœ : 0)
    • pdata(x)
      pz(z)
notion image
  • Generator์˜ ๋ชฉํ‘œ : Discriminator๊ฐ€ ๊ตฌ๋ถ„ํ•˜์ง€ ๋ชปํ•  ์ •๋„๋กœ ์‹ค์ œ data์™€ ์œ ์‚ฌํ•œ data๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ
  • Discriminator์˜ ๋ชฉํ‘œ : Generator๊ฐ€ ๋งŒ๋“  ๊ฒƒ๊ณผ ์‹ค์ œ data๋ฅผ ์ž˜ ๊ตฌ๋ถ„ํ•ด ๋‚ด๋Š” ๊ฒƒ
notion image
  • ์ดˆ๋ก์ƒ‰ ๋ถ„ํฌ : generative model์˜ ๋ถ„ํฌ, ๊ฒ€์ •์ƒ‰ ๋ถ„ํฌ : ์›๋ณธ data์˜ ๋ถ„ํฌ, ํŒŒ๋ž€์ƒ‰ ๋ถ„ํฌ : discriminator์˜ ๋ถ„ํฌ
  • ์‹œ๊ฐ„์ด ์ง€๋‚˜๋ฉด์„œ generative model G๊ฐ€ ์›๋ณธ data์˜ ๋ถ„ํฌ๋ฅผ ํ•™์Šต
  • ํ•™์Šต์ด ์ž˜ ๋˜์—ˆ๋‹ค๋ฉด ํ†ต๊ณ„์ ์œผ๋กœ ํ‰๊ท ์ ์ธ ํŠน์ง•์„ ๊ฐ€์ง€๋Š” data๋ฅผ ์‰ฝ๊ฒŒ ์ƒ์„ฑ ๊ฐ€๋Šฅ

GAN์˜ ์ˆ˜๋ ด ๊ณผ์ •

  • ์ƒ์„ฑ์ž์˜ ๋ถ„ํฌ๊ฐ€ ์›๋ณธ data์˜ ๋ถ„ํฌ์— ์ˆ˜๋ ดํ•˜๋„๋ก ๋งŒ๋“œ๋Š” ๊ฒƒ.ย ย ->ย , D(G(z))->1/2(ํ•™์Šต์ด ๋‹ค ์ด๋ฃจ์–ด์ง„ ํ›„์—๋Š” ์ง„์งœ์™€ ๊ฐ€์งœ๋ฅผ ๊ตฌ๋ณ„ํ•  ์ˆ˜ ์—†๊ธฐ ๋•Œ๋ฌธ์— 50% ํ™•๋ฅ ๊ฐ’์„ ๊ฐ€์ง€๊ฒŒ ๋จ = ํŒ๋ณ„์ž๊ฐ€ ๋” ์ด์ƒ ๊ตฌ๋ณ„์„ ๋ชปํ•จ
    • Pg
      Pdata

Global Optimality

ํŒ๋ณ„๊ธฐ์˜ global optimum point

  • G๊ฐ€ ๊ณ ์ •๋˜์–ด ์žˆ์„ ๋•Œ
notion image
notion image
notion image

์ƒ์„ฑ๊ธฐ์˜ global optimum point

  • Global optimum point is
    • pg=pdata
notion image

KL divergence(์ฟจ๋ฐฑ-๋ผ์ด๋ธ”๋Ÿฌ ๋ฐœ์‚ฐ)

  • ๋‘ ํ™•๋ฅ ๋ถ„ํฌ์˜ ์ฐจ์ด๋ฅผ ๋ณด์—ฌ์ฃผ๋Š” ์‹
notion image

Jensen-Shannon Divergence

  • KL divergence์—์„œ ๋‚˜์˜จ ๊ฐœ๋…์„ distance metrics๋กœ ์‚ฌ์šฉํ•˜๊ธฐ ์œ„ํ•œ ๋ณ€ํ™˜์‹
  • ๋‘ ๋ถ„ํฌ๊ฐ€ ๋™์ผํ•  ๋•Œ๋Š” JSD ๊ฐ’์€ 0
notion image
  • ์œ„์˜ ์ฆ๋ช…๋“ค์€ model ํ•™์Šต์ด ์ž˜ ๋˜์—ˆ์„ ๋•Œ ์œ„์™€ ๊ฐ™์€ ๊ฐ’์„ ๊ฐ€์ ธ์•ผํ•œ๋‹ค๋Š” ๊ฒƒ์„ ์˜๋ฏธ.
  • ํ•™์Šต์ด ์ˆ˜๋ ดํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๊ด€ํ•œ ์‹ X
  • ์—ญ์ „ํŒŒ ์ˆ˜ํ–‰

GAN ์•Œ๊ณ ๋ฆฌ์ฆ˜

  • ํŒ๋ณ„๊ธฐ ํ•™์Šต ํ›„ -> ์ƒ์„ฑ๊ธฐ ํ•™์Šต
  • ์ƒ์„ฑ๊ธฐ ํ•™์Šต ํ›„ -> ํŒ๋ณ„๊ธฐ ํ•™์Šต
ย