SlideShare a Scribd company logo
Generative Models as Distributions
of Functions
PR12와 함께 이해하는
Jaejun Yoo
(current) Postdoc. @EPFL
(from July) Assistant Prof., @UNIST
PR-312, 11th April, 2021
Today’s contents
“For all datasets, we use an MLP with 3
hidden layers of size 128 … and an MLP
with 2 hidden layers of size 256 and 512”
“We performed all training on a single
2080Ti GPU with 11GB of RAM.”
Motivation and Main Problem
“Conventional signal representations are usually discrete.”
However, Mother Nature is continuous!
(well… up to planck constant…?)
2D Images Audio 3D Shapes
Motivation and Main Problem
Of course, these functions are usually not analytically tractable. it is impossible to "write down"
the function that parameterizes a natural image as a mathematical formula.
Continuous representation?
Why hard?
Motivation and Main Problem
Why important?
• independent of spatial resolution (infinite resolution)
• Geometric transformation of images: zoom, rotation, super-resolution.
• Derivatives are well-defined.
Motivation and Main Problem
Why important?
• independent of spatial resolution (infinite resolution)
• Geometric transformation of images: zoom, rotation, super-resolution.
• Derivatives are well-defined.
Motivation and Main Problem
Why important?
Piecewise Constant Bilinear Cubic Spline
Motivation and Main Problem
Why important?
Piecewise constant Bilinear Cubic Spline
Motivation and Main Problem
Why important?
Motivation and Main Problem
Spline: Variational properties
Explicit derivative!
Motivation and Main Problem
Spline: Variational properties
Continuous representation?
• DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation (Park et al. 2019)
• Occupancy Networks: Learning 3D Reconstruction in Function Space (Mescheder et al. 2019)
• IM-Net: Learning Implicit Fields for Generative Shape Modeling (Chen et al. 2018)
• … NeRF (PR-302)…
“Implicit Neural Representations approximate this function via a neural network!”
Motivation and Main Problem
Implicit Neural Representation!
Implicit Neural Representation
- Remarkably, the representation !" is independent of
the number of pixels. The representation !" therefore,
unlike most image representations, does not depend
on the resolution of the image.
- The core property of these representations is that
they scale with signal complexity and not with
signal resolution.
Learning Distributions of Functions
1. Parameterizing a distribution over neural
networks with a hypernetwork (Ha et al., 2017)
Overall Scheme
“Sample the weights of a neural network”
to obtain a function.
: Learning a distribution over functions !" is equivalent to
learning a distribution over weights #(%).
: Then, #(%), where % = () * , is refer to as a neural
function distribution (NFD).
Learning Distributions of Functions
1. Parameterizing a distribution over neural
networks with a hypernetwork (Ha et al., 2017)
Overall Scheme
“Sample the weights of a neural network”
to obtain a function.
: Learning a distribution over functions !" is equivalent to
learning a distribution over weights #(%).
: Then, #(%), where % = () * , is refer to as a neural
function distribution (NFD).
However! How do we get access to the
ground truth functions to train the network?
Learning Distributions of Functions
1. Parameterizing a distribution over neural
networks with a hypernetwork (Ha et al., 2017)
Overall Scheme
“We do have access to input/output
pairs of these functions through the
coordinates and features, allowing us to
learn function distributions without
operating directly on the functions!”
Learning Distributions of Functions
1. Parameterizing a distribution over neural
networks with a hypernetwork (Ha et al., 2017)
Overall Scheme
2. Training this distribution with an adversarial
approach (Goodfellow et al., 2014).
“We do have access to input/output
pairs of these functions through the
coordinates and features, allowing us to
learn function distributions without
operating directly on the functions!”
Learning Distributions of Functions
1. Parameterizing a distribution over neural
networks with a hypernetwork (Ha et al., 2017)
Overall Scheme
2. Training this distribution with an adversarial
approach (Goodfellow et al., 2014).
* is a kind of position encoding (Fourier feature).
“We do have access to input/output
pairs of these functions through the
coordinates and features, allowing us to
learn function distributions without
operating directly on the functions!”
Learning Distributions of Functions
Overall Scheme
NFD
Now we know how to design a network to learn continuous functions!
Learning Distributions of Functions
Overall Scheme
Discriminator
But, the data we consider may not necessarily lie on a grid…
Learning Distributions of Functions
Overall Scheme
Discriminator
… in which case it is not possible to use convolutional discriminators.
Learning Distributions of Functions
Overall Scheme
Discriminator
Our discriminator should be able to distinguish between
real and fake sets of coordinate and feature pairs.
Point Cloud Discriminator
Point Convolution
In contrast to regular convolutions,
where the convolution kernels are only
defined at certain grid locations, the
convolution filters in PointConv are
parameterized by an MLP mapping
coordinates to kernel values:
Experiments
“For all datasets, we use an MLP with 3
hidden layers of size 128 … and an MLP
with 2 hidden layers of size 256 and 512”
“We performed all training on a single
2080Ti GPU with 11GB of RAM.”
“Remarkably, such a simple architecture
is sufficient for learning rich distributions
of images and 3D shapes.“
“Use the exact same model for both
images and 3D shapes except for the
input and output dimensions of the
function representation.”
Implementation Setups
Results
2D Image generation
• Samples from our model trained on CelebAHQ.
• 64×64 (top) and 128×128 (bottom)
• Each image corresponds to a function which
was sampled from our model and then
evaluated on the grid.
• To produce this figure we sampled 5 batches
and chose the best batch by visual inspection.
Results
“To the infinity and beyond!”
- Buzz Lightyear, Toy Story
Super-resolution
NFD
64×64
NFD
256×256
Bicubic
256×256
NFD
28×28
NFD
256×256
Bicubic
256×256
Results
3D shapes
Voxel grids from Choy et al. (2016) representing the chairs category from the ShapeNet (Chang et al.,
2015) dataset. The dataset contains 6778 chairs each of dimension 32#
. For each 3D model, uniformly
subsample K = 4096 points among 32# = 32,768 points and use them for training.
• A step towards making implicit neural representation methods genuinely useful
for modeling datasets rather than individual data points.
• The first framework to model data of this complexity in an entirely continuous
fashion.
• The ability of being independent to resolution and operating outside of a grid.
• A unique way of using point cloud discriminators.
Conclusion
Summary of Contributions (I think)
Things to discuss about…
• What kinds of study would be derived from this?
• Architectural developments (better quality)?
• Then How? Or what would be helpful?
• Other applications?
• Again, compute-driven AI vs human-knowledge based?
• Big model vs inductive bias?
• Etc.?
Ad

More Related Content

What's hot (20)

Faster R-CNN - PR012
Faster R-CNN - PR012Faster R-CNN - PR012
Faster R-CNN - PR012
Jinwon Lee
 
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
Deep Learning JP
 
はじパタ8章 svm
はじパタ8章 svmはじパタ8章 svm
はじパタ8章 svm
tetsuro ito
 
Deep Generative Models
Deep Generative Models Deep Generative Models
Deep Generative Models
Chia-Wen Cheng
 
PR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision LearnersPR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision Learners
Jinwon Lee
 
Visual Object Category Recognition
Visual Object Category RecognitionVisual Object Category Recognition
Visual Object Category Recognition
Ashish Gupta
 
Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)
Vitaly Bondar
 
You only look once
You only look onceYou only look once
You only look once
Gin Kyeng Lee
 
XGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competitionXGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competition
Jaroslaw Szymczak
 
Session-based recommendations with recurrent neural networks
Session-based recommendations with recurrent neural networksSession-based recommendations with recurrent neural networks
Session-based recommendations with recurrent neural networks
Zimin Park
 
Positive-Unlabeled Learning with Non-Negative Risk Estimator
Positive-Unlabeled Learning with Non-Negative Risk EstimatorPositive-Unlabeled Learning with Non-Negative Risk Estimator
Positive-Unlabeled Learning with Non-Negative Risk Estimator
Kiryo Ryuichi
 
How to use in R model-agnostic data explanation with DALEX & iml
How to use in R model-agnostic data explanation with DALEX & imlHow to use in R model-agnostic data explanation with DALEX & iml
How to use in R model-agnostic data explanation with DALEX & iml
Satoshi Kato
 
Discovery of Linear Acyclic Models Using Independent Component Analysis
Discovery of Linear Acyclic Models Using Independent Component AnalysisDiscovery of Linear Acyclic Models Using Independent Component Analysis
Discovery of Linear Acyclic Models Using Independent Component Analysis
Shiga University, RIKEN
 
Wasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 IWasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 I
Sungbin Lim
 
CSC446: Pattern Recognition (LN8)
CSC446: Pattern Recognition (LN8)CSC446: Pattern Recognition (LN8)
CSC446: Pattern Recognition (LN8)
Mostafa G. M. Mostafa
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
ivaderivader
 
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
NAVER Engineering
 
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
STAIR Lab, Chiba Institute of Technology
 
画像生成・生成モデル メタサーベイ
画像生成・生成モデル メタサーベイ画像生成・生成モデル メタサーベイ
画像生成・生成モデル メタサーベイ
cvpaper. challenge
 
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
Deep Learning JP
 
Faster R-CNN - PR012
Faster R-CNN - PR012Faster R-CNN - PR012
Faster R-CNN - PR012
Jinwon Lee
 
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
[DL輪読会]Progressive Growing of GANs for Improved Quality, Stability, and Varia...
Deep Learning JP
 
はじパタ8章 svm
はじパタ8章 svmはじパタ8章 svm
はじパタ8章 svm
tetsuro ito
 
Deep Generative Models
Deep Generative Models Deep Generative Models
Deep Generative Models
Chia-Wen Cheng
 
PR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision LearnersPR-355: Masked Autoencoders Are Scalable Vision Learners
PR-355: Masked Autoencoders Are Scalable Vision Learners
Jinwon Lee
 
Visual Object Category Recognition
Visual Object Category RecognitionVisual Object Category Recognition
Visual Object Category Recognition
Ashish Gupta
 
Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)Generative models (Geek hub 2021 lecture)
Generative models (Geek hub 2021 lecture)
Vitaly Bondar
 
XGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competitionXGBoost: the algorithm that wins every competition
XGBoost: the algorithm that wins every competition
Jaroslaw Szymczak
 
Session-based recommendations with recurrent neural networks
Session-based recommendations with recurrent neural networksSession-based recommendations with recurrent neural networks
Session-based recommendations with recurrent neural networks
Zimin Park
 
Positive-Unlabeled Learning with Non-Negative Risk Estimator
Positive-Unlabeled Learning with Non-Negative Risk EstimatorPositive-Unlabeled Learning with Non-Negative Risk Estimator
Positive-Unlabeled Learning with Non-Negative Risk Estimator
Kiryo Ryuichi
 
How to use in R model-agnostic data explanation with DALEX & iml
How to use in R model-agnostic data explanation with DALEX & imlHow to use in R model-agnostic data explanation with DALEX & iml
How to use in R model-agnostic data explanation with DALEX & iml
Satoshi Kato
 
Discovery of Linear Acyclic Models Using Independent Component Analysis
Discovery of Linear Acyclic Models Using Independent Component AnalysisDiscovery of Linear Acyclic Models Using Independent Component Analysis
Discovery of Linear Acyclic Models Using Independent Component Analysis
Shiga University, RIKEN
 
Wasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 IWasserstein GAN 수학 이해하기 I
Wasserstein GAN 수학 이해하기 I
Sungbin Lim
 
A Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial NetworksA Style-Based Generator Architecture for Generative Adversarial Networks
A Style-Based Generator Architecture for Generative Adversarial Networks
ivaderivader
 
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
1시간만에 GAN(Generative Adversarial Network) 완전 정복하기
NAVER Engineering
 
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
Higher-order Factorization Machines(第5回ステアラボ人工知能セミナー)
STAIR Lab, Chiba Institute of Technology
 
画像生成・生成モデル メタサーベイ
画像生成・生成モデル メタサーベイ画像生成・生成モデル メタサーベイ
画像生成・生成モデル メタサーベイ
cvpaper. challenge
 
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
[DL輪読会]Wasserstein GAN/Towards Principled Methods for Training Generative Adv...
Deep Learning JP
 

Similar to [PR12] Generative Models as Distributions of Functions (20)

Image captioning
Image captioningImage captioning
Image captioning
Muhammad Zbeedat
 
Machine learning in science and industry — day 4
Machine learning in science and industry — day 4Machine learning in science and industry — day 4
Machine learning in science and industry — day 4
arogozhnikov
 
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
miaoli35
 
Automatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face RecognitionAutomatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face Recognition
vatsal199567
 
Variants of GANs - Jaejun Yoo
Variants of GANs - Jaejun YooVariants of GANs - Jaejun Yoo
Variants of GANs - Jaejun Yoo
JaeJun Yoo
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
Sudeep Das, Ph.D.
 
Disentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative ModelsDisentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative Models
Ryohei Suzuki
 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jinwon Lee
 
Convolutional_neural_network mechanism.pptx.pdf
Convolutional_neural_network mechanism.pptx.pdfConvolutional_neural_network mechanism.pptx.pdf
Convolutional_neural_network mechanism.pptx.pdf
SwathiSoman5
 
brief Introduction to Different Kinds of GANs
brief Introduction to Different Kinds of GANsbrief Introduction to Different Kinds of GANs
brief Introduction to Different Kinds of GANs
Parham Zilouchian
 
[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization
JaeJun Yoo
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
Eyad Alshami
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
Renārs Liepiņš
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Sangmin Woo
 
NIPS2007: deep belief nets
NIPS2007: deep belief netsNIPS2007: deep belief nets
NIPS2007: deep belief nets
zukun
 
MLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learningMLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learning
Charles Deledalle
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
Amr Rashed
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019
Amr Rashed
 
Machine learning in science and industry — day 4
Machine learning in science and industry — day 4Machine learning in science and industry — day 4
Machine learning in science and industry — day 4
arogozhnikov
 
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
stable_diffusion_a_tutorial, How stable_diffusion works, build stable_diffusi...
miaoli35
 
Automatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face RecognitionAutomatic Attendace using convolutional neural network Face Recognition
Automatic Attendace using convolutional neural network Face Recognition
vatsal199567
 
Variants of GANs - Jaejun Yoo
Variants of GANs - Jaejun YooVariants of GANs - Jaejun Yoo
Variants of GANs - Jaejun Yoo
JaeJun Yoo
 
Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it! Crafting Recommenders: the Shallow and the Deep of it!
Crafting Recommenders: the Shallow and the Deep of it!
Sudeep Das, Ph.D.
 
Disentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative ModelsDisentangled Representation Learning of Deep Generative Models
Disentangled Representation Learning of Deep Generative Models
Ryohei Suzuki
 
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning TasksPR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
PR095: Modularity Matters: Learning Invariant Relational Reasoning Tasks
Jinwon Lee
 
Convolutional_neural_network mechanism.pptx.pdf
Convolutional_neural_network mechanism.pptx.pdfConvolutional_neural_network mechanism.pptx.pdf
Convolutional_neural_network mechanism.pptx.pdf
SwathiSoman5
 
brief Introduction to Different Kinds of GANs
brief Introduction to Different Kinds of GANsbrief Introduction to Different Kinds of GANs
brief Introduction to Different Kinds of GANs
Parham Zilouchian
 
[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization[PR12] understanding deep learning requires rethinking generalization
[PR12] understanding deep learning requires rethinking generalization
JaeJun Yoo
 
introduction to deeplearning
introduction to deeplearningintroduction to deeplearning
introduction to deeplearning
Eyad Alshami
 
Deep Learning and Reinforcement Learning
Deep Learning and Reinforcement LearningDeep Learning and Reinforcement Learning
Deep Learning and Reinforcement Learning
Renārs Liepiņš
 
Image classification with Deep Neural Networks
Image classification with Deep Neural NetworksImage classification with Deep Neural Networks
Image classification with Deep Neural Networks
Yogendra Tamang
 
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon
 
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene GraphsAction Genome: Action As Composition of Spatio Temporal Scene Graphs
Action Genome: Action As Composition of Spatio Temporal Scene Graphs
Sangmin Woo
 
NIPS2007: deep belief nets
NIPS2007: deep belief netsNIPS2007: deep belief nets
NIPS2007: deep belief nets
zukun
 
MLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learningMLIP - Chapter 3 - Introduction to deep learning
MLIP - Chapter 3 - Introduction to deep learning
Charles Deledalle
 
Deep Learning Tutorial
Deep Learning TutorialDeep Learning Tutorial
Deep Learning Tutorial
Amr Rashed
 
Deep learning tutorial 9/2019
Deep learning tutorial 9/2019Deep learning tutorial 9/2019
Deep learning tutorial 9/2019
Amr Rashed
 
Ad

More from JaeJun Yoo (12)

[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
JaeJun Yoo
 
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
JaeJun Yoo
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
JaeJun Yoo
 
A beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsA beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trends
JaeJun Yoo
 
[PR12] Spectral Normalization for Generative Adversarial Networks
[PR12] Spectral Normalization for Generative Adversarial Networks[PR12] Spectral Normalization for Generative Adversarial Networks
[PR12] Spectral Normalization for Generative Adversarial Networks
JaeJun Yoo
 
Introduction to ambient GAN
Introduction to ambient GANIntroduction to ambient GAN
Introduction to ambient GAN
JaeJun Yoo
 
[PR12] categorical reparameterization with gumbel softmax
[PR12] categorical reparameterization with gumbel softmax[PR12] categorical reparameterization with gumbel softmax
[PR12] categorical reparameterization with gumbel softmax
JaeJun Yoo
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo
JaeJun Yoo
 
[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
JaeJun Yoo
 
[PR12] PixelRNN- Jaejun Yoo
[PR12] PixelRNN- Jaejun Yoo[PR12] PixelRNN- Jaejun Yoo
[PR12] PixelRNN- Jaejun Yoo
JaeJun Yoo
 
[Pr12] dann jaejun yoo
[Pr12] dann   jaejun yoo[Pr12] dann   jaejun yoo
[Pr12] dann jaejun yoo
JaeJun Yoo
 
[PR12] intro. to gans jaejun yoo
[PR12] intro. to gans   jaejun yoo[PR12] intro. to gans   jaejun yoo
[PR12] intro. to gans jaejun yoo
JaeJun Yoo
 
[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques[CVPR2020] Simple but effective image enhancement techniques
[CVPR2020] Simple but effective image enhancement techniques
JaeJun Yoo
 
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Anal...
JaeJun Yoo
 
Super resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun YooSuper resolution in deep learning era - Jaejun Yoo
Super resolution in deep learning era - Jaejun Yoo
JaeJun Yoo
 
A beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trendsA beginner's guide to Style Transfer and recent trends
A beginner's guide to Style Transfer and recent trends
JaeJun Yoo
 
[PR12] Spectral Normalization for Generative Adversarial Networks
[PR12] Spectral Normalization for Generative Adversarial Networks[PR12] Spectral Normalization for Generative Adversarial Networks
[PR12] Spectral Normalization for Generative Adversarial Networks
JaeJun Yoo
 
Introduction to ambient GAN
Introduction to ambient GANIntroduction to ambient GAN
Introduction to ambient GAN
JaeJun Yoo
 
[PR12] categorical reparameterization with gumbel softmax
[PR12] categorical reparameterization with gumbel softmax[PR12] categorical reparameterization with gumbel softmax
[PR12] categorical reparameterization with gumbel softmax
JaeJun Yoo
 
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo[PR12] Capsule Networks - Jaejun Yoo
[PR12] Capsule Networks - Jaejun Yoo
JaeJun Yoo
 
[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo[PR12] Inception and Xception - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
JaeJun Yoo
 
[PR12] PixelRNN- Jaejun Yoo
[PR12] PixelRNN- Jaejun Yoo[PR12] PixelRNN- Jaejun Yoo
[PR12] PixelRNN- Jaejun Yoo
JaeJun Yoo
 
[Pr12] dann jaejun yoo
[Pr12] dann   jaejun yoo[Pr12] dann   jaejun yoo
[Pr12] dann jaejun yoo
JaeJun Yoo
 
[PR12] intro. to gans jaejun yoo
[PR12] intro. to gans   jaejun yoo[PR12] intro. to gans   jaejun yoo
[PR12] intro. to gans jaejun yoo
JaeJun Yoo
 
Ad

Recently uploaded (20)

Issues in using AI in academic publishing.pdf
Issues in using AI in academic publishing.pdfIssues in using AI in academic publishing.pdf
Issues in using AI in academic publishing.pdf
Angelo Salatino
 
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptxSiver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
PriyaAntil3
 
Brief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdfBrief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdf
BharathKumar556689
 
Hypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptxHypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptx
klynct
 
Carboxylic-Acid-Derivatives.lecture.presentation
Carboxylic-Acid-Derivatives.lecture.presentationCarboxylic-Acid-Derivatives.lecture.presentation
Carboxylic-Acid-Derivatives.lecture.presentation
GLAEXISAJULGA
 
Batteries and fuel cells for btech first year
Batteries and fuel cells for btech first yearBatteries and fuel cells for btech first year
Batteries and fuel cells for btech first year
MithilPillai1
 
The Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infectionsThe Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infections
NABIHANAEEM2
 
ICAI OpenGov Lab: A Quick Introduction | AI for Open Government
ICAI OpenGov Lab: A Quick Introduction | AI for Open GovernmentICAI OpenGov Lab: A Quick Introduction | AI for Open Government
ICAI OpenGov Lab: A Quick Introduction | AI for Open Government
David Graus
 
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
Sérgio Sacani
 
Freshwater Biome Types, Characteristics and Factors
Freshwater Biome Types, Characteristics and FactorsFreshwater Biome Types, Characteristics and Factors
Freshwater Biome Types, Characteristics and Factors
mytriplemonlineshop
 
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.pptSULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
HRUTUJA WAGH
 
Antimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry IIIAntimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry III
HRUTUJA WAGH
 
Anti fungal agents Medicinal Chemistry III
Anti fungal agents Medicinal Chemistry  IIIAnti fungal agents Medicinal Chemistry  III
Anti fungal agents Medicinal Chemistry III
HRUTUJA WAGH
 
Fatigue and its management in aviation medicine
Fatigue and its management in aviation medicineFatigue and its management in aviation medicine
Fatigue and its management in aviation medicine
ImranJewel2
 
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Professional Content Writing's
 
Pharmacologically active constituents.pdf
Pharmacologically active constituents.pdfPharmacologically active constituents.pdf
Pharmacologically active constituents.pdf
Nistarini College, Purulia (W.B) India
 
Top 10 Biotech Startups for Beginners.pptx
Top 10 Biotech Startups for Beginners.pptxTop 10 Biotech Startups for Beginners.pptx
Top 10 Biotech Startups for Beginners.pptx
alexbagheriam
 
Reticular formation_groups_organization_
Reticular formation_groups_organization_Reticular formation_groups_organization_
Reticular formation_groups_organization_
klynct
 
dsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentationdsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentation
JessaMaeDacayo
 
Seismic evidence of liquid water at the base of Mars' upper crust
Seismic evidence of liquid water at the base of Mars' upper crustSeismic evidence of liquid water at the base of Mars' upper crust
Seismic evidence of liquid water at the base of Mars' upper crust
Sérgio Sacani
 
Issues in using AI in academic publishing.pdf
Issues in using AI in academic publishing.pdfIssues in using AI in academic publishing.pdf
Issues in using AI in academic publishing.pdf
Angelo Salatino
 
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptxSiver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
Siver Nanoparticles syntheisis, mechanism, Antibacterial activity.pptx
PriyaAntil3
 
Brief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdfBrief Presentation on Garment Washing.pdf
Brief Presentation on Garment Washing.pdf
BharathKumar556689
 
Hypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptxHypothalamus_structure_nuclei_ functions.pptx
Hypothalamus_structure_nuclei_ functions.pptx
klynct
 
Carboxylic-Acid-Derivatives.lecture.presentation
Carboxylic-Acid-Derivatives.lecture.presentationCarboxylic-Acid-Derivatives.lecture.presentation
Carboxylic-Acid-Derivatives.lecture.presentation
GLAEXISAJULGA
 
Batteries and fuel cells for btech first year
Batteries and fuel cells for btech first yearBatteries and fuel cells for btech first year
Batteries and fuel cells for btech first year
MithilPillai1
 
The Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infectionsThe Microbial World. Microbiology , Microbes, infections
The Microbial World. Microbiology , Microbes, infections
NABIHANAEEM2
 
ICAI OpenGov Lab: A Quick Introduction | AI for Open Government
ICAI OpenGov Lab: A Quick Introduction | AI for Open GovernmentICAI OpenGov Lab: A Quick Introduction | AI for Open Government
ICAI OpenGov Lab: A Quick Introduction | AI for Open Government
David Graus
 
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
A Massive Black Hole 0.8kpc from the Host Nucleus Revealed by the Offset Tida...
Sérgio Sacani
 
Freshwater Biome Types, Characteristics and Factors
Freshwater Biome Types, Characteristics and FactorsFreshwater Biome Types, Characteristics and Factors
Freshwater Biome Types, Characteristics and Factors
mytriplemonlineshop
 
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.pptSULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
SULPHONAMIDES AND SULFONES Medicinal Chemistry III.ppt
HRUTUJA WAGH
 
Antimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry IIIAntimalarial drug Medicinal Chemistry III
Antimalarial drug Medicinal Chemistry III
HRUTUJA WAGH
 
Anti fungal agents Medicinal Chemistry III
Anti fungal agents Medicinal Chemistry  IIIAnti fungal agents Medicinal Chemistry  III
Anti fungal agents Medicinal Chemistry III
HRUTUJA WAGH
 
Fatigue and its management in aviation medicine
Fatigue and its management in aviation medicineFatigue and its management in aviation medicine
Fatigue and its management in aviation medicine
ImranJewel2
 
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Chemistry of Warfare (Chemical weapons in warfare: An in-depth analysis of cl...
Professional Content Writing's
 
Top 10 Biotech Startups for Beginners.pptx
Top 10 Biotech Startups for Beginners.pptxTop 10 Biotech Startups for Beginners.pptx
Top 10 Biotech Startups for Beginners.pptx
alexbagheriam
 
Reticular formation_groups_organization_
Reticular formation_groups_organization_Reticular formation_groups_organization_
Reticular formation_groups_organization_
klynct
 
dsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentationdsDNA-ASF, asfaviridae, virus in virology presentation
dsDNA-ASF, asfaviridae, virus in virology presentation
JessaMaeDacayo
 
Seismic evidence of liquid water at the base of Mars' upper crust
Seismic evidence of liquid water at the base of Mars' upper crustSeismic evidence of liquid water at the base of Mars' upper crust
Seismic evidence of liquid water at the base of Mars' upper crust
Sérgio Sacani
 

[PR12] Generative Models as Distributions of Functions

  • 1. Generative Models as Distributions of Functions PR12와 함께 이해하는 Jaejun Yoo (current) Postdoc. @EPFL (from July) Assistant Prof., @UNIST PR-312, 11th April, 2021
  • 2. Today’s contents “For all datasets, we use an MLP with 3 hidden layers of size 128 … and an MLP with 2 hidden layers of size 256 and 512” “We performed all training on a single 2080Ti GPU with 11GB of RAM.”
  • 3. Motivation and Main Problem “Conventional signal representations are usually discrete.” However, Mother Nature is continuous! (well… up to planck constant…?) 2D Images Audio 3D Shapes
  • 4. Motivation and Main Problem Of course, these functions are usually not analytically tractable. it is impossible to "write down" the function that parameterizes a natural image as a mathematical formula. Continuous representation? Why hard?
  • 5. Motivation and Main Problem Why important? • independent of spatial resolution (infinite resolution) • Geometric transformation of images: zoom, rotation, super-resolution. • Derivatives are well-defined.
  • 6. Motivation and Main Problem Why important? • independent of spatial resolution (infinite resolution) • Geometric transformation of images: zoom, rotation, super-resolution. • Derivatives are well-defined.
  • 7. Motivation and Main Problem Why important? Piecewise Constant Bilinear Cubic Spline
  • 8. Motivation and Main Problem Why important? Piecewise constant Bilinear Cubic Spline
  • 9. Motivation and Main Problem Why important?
  • 10. Motivation and Main Problem Spline: Variational properties Explicit derivative!
  • 11. Motivation and Main Problem Spline: Variational properties
  • 12. Continuous representation? • DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation (Park et al. 2019) • Occupancy Networks: Learning 3D Reconstruction in Function Space (Mescheder et al. 2019) • IM-Net: Learning Implicit Fields for Generative Shape Modeling (Chen et al. 2018) • … NeRF (PR-302)… “Implicit Neural Representations approximate this function via a neural network!” Motivation and Main Problem Implicit Neural Representation!
  • 13. Implicit Neural Representation - Remarkably, the representation !" is independent of the number of pixels. The representation !" therefore, unlike most image representations, does not depend on the resolution of the image. - The core property of these representations is that they scale with signal complexity and not with signal resolution.
  • 14. Learning Distributions of Functions 1. Parameterizing a distribution over neural networks with a hypernetwork (Ha et al., 2017) Overall Scheme “Sample the weights of a neural network” to obtain a function. : Learning a distribution over functions !" is equivalent to learning a distribution over weights #(%). : Then, #(%), where % = () * , is refer to as a neural function distribution (NFD).
  • 15. Learning Distributions of Functions 1. Parameterizing a distribution over neural networks with a hypernetwork (Ha et al., 2017) Overall Scheme “Sample the weights of a neural network” to obtain a function. : Learning a distribution over functions !" is equivalent to learning a distribution over weights #(%). : Then, #(%), where % = () * , is refer to as a neural function distribution (NFD). However! How do we get access to the ground truth functions to train the network?
  • 16. Learning Distributions of Functions 1. Parameterizing a distribution over neural networks with a hypernetwork (Ha et al., 2017) Overall Scheme “We do have access to input/output pairs of these functions through the coordinates and features, allowing us to learn function distributions without operating directly on the functions!”
  • 17. Learning Distributions of Functions 1. Parameterizing a distribution over neural networks with a hypernetwork (Ha et al., 2017) Overall Scheme 2. Training this distribution with an adversarial approach (Goodfellow et al., 2014). “We do have access to input/output pairs of these functions through the coordinates and features, allowing us to learn function distributions without operating directly on the functions!”
  • 18. Learning Distributions of Functions 1. Parameterizing a distribution over neural networks with a hypernetwork (Ha et al., 2017) Overall Scheme 2. Training this distribution with an adversarial approach (Goodfellow et al., 2014). * is a kind of position encoding (Fourier feature). “We do have access to input/output pairs of these functions through the coordinates and features, allowing us to learn function distributions without operating directly on the functions!”
  • 19. Learning Distributions of Functions Overall Scheme NFD Now we know how to design a network to learn continuous functions!
  • 20. Learning Distributions of Functions Overall Scheme Discriminator But, the data we consider may not necessarily lie on a grid…
  • 21. Learning Distributions of Functions Overall Scheme Discriminator … in which case it is not possible to use convolutional discriminators.
  • 22. Learning Distributions of Functions Overall Scheme Discriminator Our discriminator should be able to distinguish between real and fake sets of coordinate and feature pairs.
  • 23. Point Cloud Discriminator Point Convolution In contrast to regular convolutions, where the convolution kernels are only defined at certain grid locations, the convolution filters in PointConv are parameterized by an MLP mapping coordinates to kernel values:
  • 24. Experiments “For all datasets, we use an MLP with 3 hidden layers of size 128 … and an MLP with 2 hidden layers of size 256 and 512” “We performed all training on a single 2080Ti GPU with 11GB of RAM.” “Remarkably, such a simple architecture is sufficient for learning rich distributions of images and 3D shapes.“ “Use the exact same model for both images and 3D shapes except for the input and output dimensions of the function representation.” Implementation Setups
  • 25. Results 2D Image generation • Samples from our model trained on CelebAHQ. • 64×64 (top) and 128×128 (bottom) • Each image corresponds to a function which was sampled from our model and then evaluated on the grid. • To produce this figure we sampled 5 batches and chose the best batch by visual inspection.
  • 26. Results “To the infinity and beyond!” - Buzz Lightyear, Toy Story Super-resolution NFD 64×64 NFD 256×256 Bicubic 256×256 NFD 28×28 NFD 256×256 Bicubic 256×256
  • 27. Results 3D shapes Voxel grids from Choy et al. (2016) representing the chairs category from the ShapeNet (Chang et al., 2015) dataset. The dataset contains 6778 chairs each of dimension 32# . For each 3D model, uniformly subsample K = 4096 points among 32# = 32,768 points and use them for training.
  • 28. • A step towards making implicit neural representation methods genuinely useful for modeling datasets rather than individual data points. • The first framework to model data of this complexity in an entirely continuous fashion. • The ability of being independent to resolution and operating outside of a grid. • A unique way of using point cloud discriminators. Conclusion Summary of Contributions (I think)
  • 29. Things to discuss about… • What kinds of study would be derived from this? • Architectural developments (better quality)? • Then How? Or what would be helpful? • Other applications? • Again, compute-driven AI vs human-knowledge based? • Big model vs inductive bias? • Etc.?
  翻译: