SlideShare a Scribd company logo
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Vertex Centric Asynchronous Belief Propagation
Algorithm for Large-Scale Graphs
Gabriel Gimenes
Hugo Gualdron
Jose F. Rodrigues-Jr
Instituto de Ciencias Matematicas e de Computacao
University of Sao Paulo - Sao Carlos
DamNet - 2016 ICDM Workshop, Barcelona, Spain
This work has finantial support from FAPESP 2014/25337-0
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Context
Ubiquitous data generation
Information availability: pros and cons
Web 2.0 – users are producing data and not only consuming
Relationships between elements
Facebook, Twitter, Amazon, GooglePlay, Email
Intuitive modelling: Graphs(Networks)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Problem
Analyzing large-scale networks – efficient and powerful
Some graphs (e.g YahooWeb e Twitter) may not fit memory
Naive processing: prohibitive
Alternative: distributed processing
complexity, infrastructure, cost
How to process in a single computational node?
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Rationale
New approaches: Taking advatange of the multi-core
architecturess
Centralized → Decentralized
Vertex-centric processing techniques
Block-based processing
Asynchronous processing
Proposals: TurboGraph, GraphChi, X-Stream, MMap,
M-Flash, FlashGraph; Pregel, GraphLab, Giraph.
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Vertex-centric paradigm
Vertex-centric model
procedure Graph scan(Graph G)
for i = 1 to |V | do
sete ← set of edges adjacent to V [i]
V [i].value ← f (sete )
for each edge e in sete do
e.value ← g(V [i].value, e.value)
Outer loop
procedure Graph processing
while convergence criterion is not satisfied do
Graph scan(G)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Algorithm
Belief propagation - bayesian inference method
Estimating the marginal probability distribution for
non-annotated nodes
Message passing: information travels from annotated to
unannotated nodes
Guilty-by-association or ”birds of a feather flock together”
Heterophily vs Homophily
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Problem
Original algorithm proposed for trees - no loops
Loopy BP (Murphy et al.) generalized algorithm
Problems with convergence and performance
Early applications in stereo-imaging and facial reconstruction
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Evolution
Performance and scalability: distributed processing
Gonzalez et al. – distributed inefficiencies
Kang et al. – algorithm relevance for anti-malware and fraud
detection applications
Gatterbauer et al. – linear approximation, convergence
guarantees and better performance
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
BP vs LinBP
Belief Propagation
bs (i) = es (i)
u∈N(s)
mus (i)
mst (i) =
c−1
j=0
Hst (j, i)es (j)
u∈N(s)t
mus (j)
Linearized Belief Propagation
ˆbs (i) = ˆes (i) +
1
k
u∈N(s)
ˆmus (i)
ˆmst (i) = k
j
ˆHst (j, i)ˆbs (j) −
j
ˆHst (j, i) ˆmts (j)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Proposal and contributions
Algorithm: change of paradigm, asynchronous parallel
vertex-centric processing
Convergence: better convergence speed (number of iterations)
Scalability: commodity computer
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Our algorithm
VC-LinBP
1: procedure VC-LinBP(G(V , E), VExplicit, H, h, t)
2: set H = hH
3: set H2 = H2
4: repeat
5: for each vertex in V do
6: Update(vertex)
7: until t iterations or convergence achieved
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Our algorithm
Update
1: procedure Update(vertex)
2: Set degree = 0
3: for each class c in vertex do initializing vertex values for each class
4: vertex.value(c) = 0
5: for each incoming edge e to vertex do processing incoming messages
6: degree+ = e.weight2
7: for each each class cfrom do
8: for each each class cto do
9: vertex.value(cto) += e.weight * e.value(cfrom) * H(cfrom, cto)
10: if vertex is not explicit then echo cancellation of messages
11: for each each class cfrom do
12: for each each class cto do
13: vertex.value(cto)− = degree ∗ vertex.value(cfrom) ∗ H2(cfrom, cto)
14: else adding explicit value of the vertex
15: vertex.value(c)+ = VExplicit (vertex)(c)
16: for each outgoing edge e from vertex do sending messages to neighbors
17: for each each class c do
18: e.value(c) = vertex.value(c)
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
Efficiency and efficacy
i7 CPU 8 cores, 16GB RAM, 240GB SSD
Comparison with LinBP
2 versions: single e multi-threaded
Utilizing the GraphChi framework
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Datasets
Generated with the Kronecker product method – SNAP
4 different networks
Datasets
Graph # Nodes # Edges
1 59,049 1,048,576
2 177,147 4,194,304
3 531,441 16,777,216
4 1,594,323 67,108,864
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
Coupling Matrix
1 2 3
1 0.266667 -0.033333 0.366667
2 0.033333 -0.333333 0.366667
3 -0.233333 0.366667 -0.133333
3 classes, 5% randomly initialized (annotated)
Coupling matrix and initialization procedure based on LinBP’s
experimentation
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments - Validation
Validation
Graph Top-beliefs’ Agreement (%)
1 100%
2 100%
3 99%
4 99%
Divergences are related to tiebreak scenarios
Efficacy – to be expected
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments - Scalability
Runtime (sec)
Graph LinBP-SQL VC-LinBP-1thread VC-LinBP-8threads
1 39.04 0.31 0.23
2 179 1.27 0.75
3 826 5.90 3.15
4 5000 34.62 18.69
Fixed number of iterations (5 iterations)
Only runtime is considered – excluding pre-processing time
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Experiments
1
10
100
1000
1e+06 1e+07 1e+08
Number of Edges
Runtime(sec)
VC_LinBP_1thread
VC_LinBP_8threads
LinBP
(a) Scalability
0
2
4
6
8
1 2 3 4
Dataset
NumberofIterations
LinBP
VC_LinBP
(b) Convergence
Elidan et al. – asynchronous version is at worst the same as synchronous
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Outline
1 Introduction
2 Belief Propagation Algorithm
3 Methodology and Experiments
4 Conclusions
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Future Work
In-memory implementation – performance comparison
Experiments with bigger datasets
Detailed tiebreak scenarios
Real-world dataset experiments – DBLP, Malware detection,
Image segmentation
Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions
Thank you!
Questions?
Ad

More Related Content

Viewers also liked (20)

Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Universidade de São Paulo
 
Frequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data explorationFrequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data exploration
Universidade de São Paulo
 
Apresentacao vldb
Apresentacao vldbApresentacao vldb
Apresentacao vldb
Universidade de São Paulo
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical Study
Universidade de São Paulo
 
An introduction to MongoDB
An introduction to MongoDBAn introduction to MongoDB
An introduction to MongoDB
Universidade de São Paulo
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisions
Universidade de São Paulo
 
SuperGraph visualization
SuperGraph visualizationSuperGraph visualization
SuperGraph visualization
Universidade de São Paulo
 
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
Universidade de São Paulo
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approach
Universidade de São Paulo
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Universidade de São Paulo
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...
Universidade de São Paulo
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...
Universidade de São Paulo
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Universidade de São Paulo
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Universidade de São Paulo
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media images
Universidade de São Paulo
 
Physics of Algorithms Talk
Physics of Algorithms TalkPhysics of Algorithms Talk
Physics of Algorithms Talk
jasonj383
 
C04922125
C04922125C04922125
C04922125
IOSR-JEN
 
Efficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth FindingEfficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth Finding
Samantha Luber
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Universidade de São Paulo
 
Graph-based Relational Data Visualization
Graph-based RelationalData VisualizationGraph-based RelationalData Visualization
Graph-based Relational Data Visualization
Universidade de São Paulo
 
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing ModelFast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Fast Billion-scale Graph Computation Using a Bimodal Block Processing Model
Universidade de São Paulo
 
Frequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data explorationFrequency plot and relevance plot to enhance visual data exploration
Frequency plot and relevance plot to enhance visual data exploration
Universidade de São Paulo
 
Reviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical StudyReviewing Data Visualization: an Analytical Taxonomical Study
Reviewing Data Visualization: an Analytical Taxonomical Study
Universidade de São Paulo
 
Visualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisionsVisualization tree multiple linked analytical decisions
Visualization tree multiple linked analytical decisions
Universidade de São Paulo
 
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
6 7-metodologia depesquisaemcienciadacomputacao-escritadeartigocientifico-plagio
Universidade de São Paulo
 
Unveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approachUnveiling smoke in social images with the SmokeBlock approach
Unveiling smoke in social images with the SmokeBlock approach
Universidade de São Paulo
 
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Effective and Unsupervised Fractal-based Feature Selection for Very Large Dat...
Universidade de São Paulo
 
On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...On the Support of a Similarity-Enabled Relational Database Management System ...
On the Support of a Similarity-Enabled Relational Database Management System ...
Universidade de São Paulo
 
StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...StructMatrix: large-scale visualization of graphs by means of structure detec...
StructMatrix: large-scale visualization of graphs by means of structure detec...
Universidade de São Paulo
 
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring networkSupervised-Learning Link Recommendation in the DBLP co-authoring network
Supervised-Learning Link Recommendation in the DBLP co-authoring network
Universidade de São Paulo
 
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Multimodal graph-based analysis over the DBLP repository: critical discoverie...
Universidade de São Paulo
 
Techniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media imagesTechniques for effective and efficient fire detection from social media images
Techniques for effective and efficient fire detection from social media images
Universidade de São Paulo
 
Physics of Algorithms Talk
Physics of Algorithms TalkPhysics of Algorithms Talk
Physics of Algorithms Talk
jasonj383
 
Efficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth FindingEfficient Belief Propagation in Depth Finding
Efficient Belief Propagation in Depth Finding
Samantha Luber
 
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Fire Detection on Unconstrained Videos Using Color-Aware Spatial Modeling and...
Universidade de São Paulo
 

Similar to Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs (20)

Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
IRJET Journal
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
MLReview
 
New Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR ApproachNew Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR Approach
Akos Hajdu
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
ESCOM
 
Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...
MajedahAlkharji
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
Jaguaraci Silva
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural network
nooriasukmaningtyas
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
IAEME Publication
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
IAEME Publication
 
Eswc2009
Eswc2009Eswc2009
Eswc2009
fanizzi
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
RamithaDevi
 
Empirical project powerpoint
Empirical project powerpointEmpirical project powerpoint
Empirical project powerpoint
Joe Krall
 
Concept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process ExecutionConcept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process Execution
iosrjce
 
I017366469
I017366469I017366469
I017366469
IOSR Journals
 
Flavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approachFlavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approach
Alexander Rakhlin
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1
butest
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted Dropout
Seunghyun Hwang
 
(eBook PDF) Compressed Sensing in Radar Signal Processing
(eBook PDF) Compressed Sensing in Radar Signal Processing(eBook PDF) Compressed Sensing in Radar Signal Processing
(eBook PDF) Compressed Sensing in Radar Signal Processing
makispouchzh
 
sensors-20-04577-v3akslññidnlasjjc,,jas.pdf
sensors-20-04577-v3akslññidnlasjjc,,jas.pdfsensors-20-04577-v3akslññidnlasjjc,,jas.pdf
sensors-20-04577-v3akslññidnlasjjc,,jas.pdf
SergioDasilva48
 
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
Saliency Based Hookworm and Infection Detection for Wireless Capsule Endoscop...
IRJET Journal
 
Tutorial on Deep Generative Models
 Tutorial on Deep Generative Models Tutorial on Deep Generative Models
Tutorial on Deep Generative Models
MLReview
 
New Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR ApproachNew Search Strategies for the Petri Net CEGAR Approach
New Search Strategies for the Petri Net CEGAR Approach
Akos Hajdu
 
NEURAL Network Design Training
NEURAL Network Design  TrainingNEURAL Network Design  Training
NEURAL Network Design Training
ESCOM
 
Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...Presentation Slides - Genetic algorithm based key generation for fully homomo...
Presentation Slides - Genetic algorithm based key generation for fully homomo...
MajedahAlkharji
 
Sound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software TestingSound Empirical Evidence in Software Testing
Sound Empirical Evidence in Software Testing
Jaguaraci Silva
 
Noise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural networkNoise-robust classification with hypergraph neural network
Noise-robust classification with hypergraph neural network
nooriasukmaningtyas
 
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
COMPARISON BETWEEN THE GENETIC ALGORITHMS OPTIMIZATION AND PARTICLE SWARM OPT...
IAEME Publication
 
Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...Comparison between the genetic algorithms optimization and particle swarm opt...
Comparison between the genetic algorithms optimization and particle swarm opt...
IAEME Publication
 
Eswc2009
Eswc2009Eswc2009
Eswc2009
fanizzi
 
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
DeepDRImageGuidedDiabeticRetinopathyDetectionUsingAttentionBasedDeepLearningS...
RamithaDevi
 
Empirical project powerpoint
Empirical project powerpointEmpirical project powerpoint
Empirical project powerpoint
Joe Krall
 
Concept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process ExecutionConcept Drift for obtaining Accurate Insight on Process Execution
Concept Drift for obtaining Accurate Insight on Process Execution
iosrjce
 
Flavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approachFlavours of Physics Challenge: Transfer Learning approach
Flavours of Physics Challenge: Transfer Learning approach
Alexander Rakhlin
 
November, 2006 CCKM'06 1
November, 2006 CCKM'06 1 November, 2006 CCKM'06 1
November, 2006 CCKM'06 1
butest
 
Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)Metabolomic Data Analysis Workshop and Tutorials (2014)
Metabolomic Data Analysis Workshop and Tutorials (2014)
Dmitry Grapov
 
Learning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted DropoutLearning Sparse Networks using Targeted Dropout
Learning Sparse Networks using Targeted Dropout
Seunghyun Hwang
 
(eBook PDF) Compressed Sensing in Radar Signal Processing
(eBook PDF) Compressed Sensing in Radar Signal Processing(eBook PDF) Compressed Sensing in Radar Signal Processing
(eBook PDF) Compressed Sensing in Radar Signal Processing
makispouchzh
 
sensors-20-04577-v3akslññidnlasjjc,,jas.pdf
sensors-20-04577-v3akslññidnlasjjc,,jas.pdfsensors-20-04577-v3akslññidnlasjjc,,jas.pdf
sensors-20-04577-v3akslññidnlasjjc,,jas.pdf
SergioDasilva48
 
Ad

More from Universidade de São Paulo (13)

A gentle introduction to Deep Learning
A gentle introduction to Deep LearningA gentle introduction to Deep Learning
A gentle introduction to Deep Learning
Universidade de São Paulo
 
Computação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalhoComputação: carreira e mercado de trabalho
Computação: carreira e mercado de trabalho
Universidade de São Paulo
 
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema HadoopIntrodução às ferramentas de Business Intelligence do ecossistema Hadoop
Introdução às ferramentas de Business Intelligence do ecossistema Hadoop
Universidade de São Paulo
 
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Complexidade de Algoritmos, Notação assintótica, Algoritmos polinomiais e in...
Universidade de São Paulo
 
Dawarehouse e OLAP
Dawarehouse e OLAPDawarehouse e OLAP
Dawarehouse e OLAP
Universidade de São Paulo
 
Metric s plat - a platform for quick development testing and visualization of...
Metric s plat - a platform for quick development testing and visualization of...Metric s plat - a platform for quick development testing and visualization of...
Metric s plat - a platform for quick development testing and visualization of...
Universidade de São Paulo
 
Hierarchical visual filtering pragmatic and epistemic actions for database vi...
Hierarchical visual filtering pragmatic and epistemic actions for database vi...Hierarchical visual filtering pragmatic and epistemic actions for database vi...
Hierarchical visual filtering pragmatic and epistemic actions for database vi...
Universidade de São Paulo
 
Java generics-basics
Java generics-basicsJava generics-basics
Java generics-basics
Universidade de São Paulo
 
Java collections-basic
Java collections-basicJava collections-basic
Java collections-basic
Universidade de São Paulo
 
Java network-sockets-etc
Java network-sockets-etcJava network-sockets-etc
Java network-sockets-etc
Universidade de São Paulo
 
Java streams
Java streamsJava streams
Java streams
Universidade de São Paulo
 
Infovis tutorial
Infovis tutorialInfovis tutorial
Infovis tutorial
Universidade de São Paulo
 
Java platform
Java platformJava platform
Java platform
Universidade de São Paulo
 
Ad

Recently uploaded (20)

Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdfZ14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Fariborz Seyedloo
 
Transforming health care with ai powered
Transforming health care with ai poweredTransforming health care with ai powered
Transforming health care with ai powered
gowthamarvj
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
Process Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce DowntimeProcess Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce Downtime
Process mining Evangelist
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfjOral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
maitripatel5301
 
Ann Naser Nabil- Data Scientist Portfolio.pdf
Ann Naser Nabil- Data Scientist Portfolio.pdfAnn Naser Nabil- Data Scientist Portfolio.pdf
Ann Naser Nabil- Data Scientist Portfolio.pdf
আন্ নাসের নাবিল
 
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdfTOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
NhiV747372
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
national income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptxnational income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptx
j2492618
 
AWS-Certified-ML-Engineer-Associate-Slides.pdf
AWS-Certified-ML-Engineer-Associate-Slides.pdfAWS-Certified-ML-Engineer-Associate-Slides.pdf
AWS-Certified-ML-Engineer-Associate-Slides.pdf
philsparkshome
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
Automated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptxAutomated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptx
handrymaharjan23
 
Mining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - MicrosoftMining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - Microsoft
Process mining Evangelist
 
2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf
dominikamizerska1
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Multi-tenant Data Pipeline Orchestration
Multi-tenant Data Pipeline OrchestrationMulti-tenant Data Pipeline Orchestration
Multi-tenant Data Pipeline Orchestration
Romi Kuntsman
 
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdfZ14_IBM__APL_by_Christian_Demmer_IBM.pdf
Z14_IBM__APL_by_Christian_Demmer_IBM.pdf
Fariborz Seyedloo
 
Transforming health care with ai powered
Transforming health care with ai poweredTransforming health care with ai powered
Transforming health care with ai powered
gowthamarvj
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
indonesia-gen-z-report-2024 Gen Z (born between 1997 and 2012) is currently t...
disnakertransjabarda
 
Process Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce DowntimeProcess Mining Machine Recoveries to Reduce Downtime
Process Mining Machine Recoveries to Reduce Downtime
Process mining Evangelist
 
problem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursingproblem solving.presentation slideshow bsc nursing
problem solving.presentation slideshow bsc nursing
vishnudathas123
 
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfjOral Malodor.pptx jsjshdhushehsidjjeiejdhfj
Oral Malodor.pptx jsjshdhushehsidjjeiejdhfj
maitripatel5301
 
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdfTOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
TOAE201-Slides-Chapter 4. Sample theoretical basis (1).pdf
NhiV747372
 
RAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit FrameworkRAG Chatbot using AWS Bedrock and Streamlit Framework
RAG Chatbot using AWS Bedrock and Streamlit Framework
apanneer
 
Automation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success storyAutomation Platforms and Process Mining - success story
Automation Platforms and Process Mining - success story
Process mining Evangelist
 
AWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptxAWS RDS Presentation to make concepts easy.pptx
AWS RDS Presentation to make concepts easy.pptx
bharatkumarbhojwani
 
national income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptxnational income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptx
j2492618
 
AWS-Certified-ML-Engineer-Associate-Slides.pdf
AWS-Certified-ML-Engineer-Associate-Slides.pdfAWS-Certified-ML-Engineer-Associate-Slides.pdf
AWS-Certified-ML-Engineer-Associate-Slides.pdf
philsparkshome
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
Automated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptxAutomated Melanoma Detection via Image Processing.pptx
Automated Melanoma Detection via Image Processing.pptx
handrymaharjan23
 
Mining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - MicrosoftMining a Global Trade Process with Data Science - Microsoft
Mining a Global Trade Process with Data Science - Microsoft
Process mining Evangelist
 
2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf2024 Digital Equity Accelerator Report.pdf
2024 Digital Equity Accelerator Report.pdf
dominikamizerska1
 
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docxAnalysis of Billboards hot 100 toop five hit makers on the chart.docx
Analysis of Billboards hot 100 toop five hit makers on the chart.docx
hershtara1
 
Multi-tenant Data Pipeline Orchestration
Multi-tenant Data Pipeline OrchestrationMulti-tenant Data Pipeline Orchestration
Multi-tenant Data Pipeline Orchestration
Romi Kuntsman
 

Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs

  • 1. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Vertex Centric Asynchronous Belief Propagation Algorithm for Large-Scale Graphs Gabriel Gimenes Hugo Gualdron Jose F. Rodrigues-Jr Instituto de Ciencias Matematicas e de Computacao University of Sao Paulo - Sao Carlos DamNet - 2016 ICDM Workshop, Barcelona, Spain This work has finantial support from FAPESP 2014/25337-0
  • 2. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 3. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 4. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Context Ubiquitous data generation Information availability: pros and cons Web 2.0 – users are producing data and not only consuming Relationships between elements Facebook, Twitter, Amazon, GooglePlay, Email Intuitive modelling: Graphs(Networks)
  • 5. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Problem Analyzing large-scale networks – efficient and powerful Some graphs (e.g YahooWeb e Twitter) may not fit memory Naive processing: prohibitive Alternative: distributed processing complexity, infrastructure, cost How to process in a single computational node?
  • 6. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Rationale New approaches: Taking advatange of the multi-core architecturess Centralized → Decentralized Vertex-centric processing techniques Block-based processing Asynchronous processing Proposals: TurboGraph, GraphChi, X-Stream, MMap, M-Flash, FlashGraph; Pregel, GraphLab, Giraph.
  • 7. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Vertex-centric paradigm Vertex-centric model procedure Graph scan(Graph G) for i = 1 to |V | do sete ← set of edges adjacent to V [i] V [i].value ← f (sete ) for each edge e in sete do e.value ← g(V [i].value, e.value) Outer loop procedure Graph processing while convergence criterion is not satisfied do Graph scan(G)
  • 8. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 9. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Algorithm Belief propagation - bayesian inference method Estimating the marginal probability distribution for non-annotated nodes Message passing: information travels from annotated to unannotated nodes Guilty-by-association or ”birds of a feather flock together” Heterophily vs Homophily
  • 10. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Problem Original algorithm proposed for trees - no loops Loopy BP (Murphy et al.) generalized algorithm Problems with convergence and performance Early applications in stereo-imaging and facial reconstruction
  • 11. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Evolution Performance and scalability: distributed processing Gonzalez et al. – distributed inefficiencies Kang et al. – algorithm relevance for anti-malware and fraud detection applications Gatterbauer et al. – linear approximation, convergence guarantees and better performance
  • 12. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions BP vs LinBP Belief Propagation bs (i) = es (i) u∈N(s) mus (i) mst (i) = c−1 j=0 Hst (j, i)es (j) u∈N(s)t mus (j) Linearized Belief Propagation ˆbs (i) = ˆes (i) + 1 k u∈N(s) ˆmus (i) ˆmst (i) = k j ˆHst (j, i)ˆbs (j) − j ˆHst (j, i) ˆmts (j)
  • 13. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 14. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Proposal and contributions Algorithm: change of paradigm, asynchronous parallel vertex-centric processing Convergence: better convergence speed (number of iterations) Scalability: commodity computer
  • 15. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Our algorithm VC-LinBP 1: procedure VC-LinBP(G(V , E), VExplicit, H, h, t) 2: set H = hH 3: set H2 = H2 4: repeat 5: for each vertex in V do 6: Update(vertex) 7: until t iterations or convergence achieved
  • 16. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Our algorithm Update 1: procedure Update(vertex) 2: Set degree = 0 3: for each class c in vertex do initializing vertex values for each class 4: vertex.value(c) = 0 5: for each incoming edge e to vertex do processing incoming messages 6: degree+ = e.weight2 7: for each each class cfrom do 8: for each each class cto do 9: vertex.value(cto) += e.weight * e.value(cfrom) * H(cfrom, cto) 10: if vertex is not explicit then echo cancellation of messages 11: for each each class cfrom do 12: for each each class cto do 13: vertex.value(cto)− = degree ∗ vertex.value(cfrom) ∗ H2(cfrom, cto) 14: else adding explicit value of the vertex 15: vertex.value(c)+ = VExplicit (vertex)(c) 16: for each outgoing edge e from vertex do sending messages to neighbors 17: for each each class c do 18: e.value(c) = vertex.value(c)
  • 17. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments Efficiency and efficacy i7 CPU 8 cores, 16GB RAM, 240GB SSD Comparison with LinBP 2 versions: single e multi-threaded Utilizing the GraphChi framework
  • 18. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Datasets Generated with the Kronecker product method – SNAP 4 different networks Datasets Graph # Nodes # Edges 1 59,049 1,048,576 2 177,147 4,194,304 3 531,441 16,777,216 4 1,594,323 67,108,864
  • 19. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments Coupling Matrix 1 2 3 1 0.266667 -0.033333 0.366667 2 0.033333 -0.333333 0.366667 3 -0.233333 0.366667 -0.133333 3 classes, 5% randomly initialized (annotated) Coupling matrix and initialization procedure based on LinBP’s experimentation
  • 20. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments - Validation Validation Graph Top-beliefs’ Agreement (%) 1 100% 2 100% 3 99% 4 99% Divergences are related to tiebreak scenarios Efficacy – to be expected
  • 21. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments - Scalability Runtime (sec) Graph LinBP-SQL VC-LinBP-1thread VC-LinBP-8threads 1 39.04 0.31 0.23 2 179 1.27 0.75 3 826 5.90 3.15 4 5000 34.62 18.69 Fixed number of iterations (5 iterations) Only runtime is considered – excluding pre-processing time
  • 22. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Experiments 1 10 100 1000 1e+06 1e+07 1e+08 Number of Edges Runtime(sec) VC_LinBP_1thread VC_LinBP_8threads LinBP (a) Scalability 0 2 4 6 8 1 2 3 4 Dataset NumberofIterations LinBP VC_LinBP (b) Convergence Elidan et al. – asynchronous version is at worst the same as synchronous
  • 23. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Outline 1 Introduction 2 Belief Propagation Algorithm 3 Methodology and Experiments 4 Conclusions
  • 24. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Future Work In-memory implementation – performance comparison Experiments with bigger datasets Detailed tiebreak scenarios Real-world dataset experiments – DBLP, Malware detection, Image segmentation
  • 25. Introduction Belief Propagation Algorithm Methodology and Experiments Conclusions Thank you! Questions?
  翻译: