Scalable Parallel Computing on Clouds

Scalable Parallel Computing on
Clouds

Thilina Gunarathne (tgunarat@indiana.edu)
Advisor : Prof.Geoffrey Fox (gcf@indiana.edu)
Committee : Prof.Judy Qui, Prof.Beth Plale, Prof.David Leake

Clouds for scientific computations
No
Zero Horizontal
upfront
maintenance scalability
cost

Compute, storage and other services

Loose service guarantees

Not trivial to utilize effectively 

Scalable Programming Models
Parallel
Computing
on Clouds
Scalability
Performance
Fault Tolerance
Monitoring

Pleasingly Parallel Frameworks
Cap3 Sequence
Assembly
100%
90%

Parallel Efficiency
80%
70% DryadLINQ
Hadoop
60% EC2
50% Azure
512 1512 2512 3512
Number of Files

150
Per Core Per File Time (s)

100
DryadLINQ
50 Hadoop
EC2
Azure
0
Classic Cloud Frameworks 512 1024 1536 2048 2560 3072 3584 4096
Number of Files

Programming
Model

Fault Map Moving
Computation
Tolerance
Reduce to Data

Scalable

Ideal for data intensive pleasingly parallel applications

MRRoles4Azure

Azure Cloud Services
• Highly-available and scalable
• Utilize eventually-consistent , high-latency cloud services effectively
• Minimal maintenance and management overhead
Decentralized
• Avoids Single Point of Failure
• Global queue based dynamic scheduling
• Dynamically scale up/down

MapReduce
• First pure MapReduce for Azure
• Typical MapReduce fault tolerance

MRRoles4Azure

Azure Queues for scheduling, Tables to store meta-data and monitoring data, Blobs for
input/output/intermediate data storage.

SWG Sequence Alignment

Performance
comparable to
Hadoop, EMR
Costs less than
EMR

Smith-Waterman-GOTOH to calculate all-pairs dissimilarity

Data Intensive Iterative Applications
Compute Communication Reduce/ barrier
Broadcast Smaller Loop-
Variant Data

New Iteration

Larger Loop-
Invariant Data
• Growing class of applications
– Clustering, data mining, machine learning & dimension
reduction applications
– Driven by data deluge & emerging computation fields

Extensions to support Iterative MapReduce for Azure Cloud
broadcast data

Merge step

Hybrid intermediate
In-Memory/Disk
data transfer
caching of static
data
http://salsahpc.indiana.edu/twister4azure

Hybrid Task Scheduling
First iteration
through queues

 Cache aware hybrid
scheduling
 Decentralized
 Fault Tolerant
 Multiple MapReduce
applications within an
iteration
Left over tasks

Data in cache +
Task meta data
history
New iteration in Job
Bulleting Board

First iteration performs the Overhead between iterations
initial data fetch

Task Execution Time Histogram Number of Executing Map Task Histogram

Scales better than Hadoop on
bare metal

Strong Scaling with 128M Data Points
Weak Scaling

Applications
• Bioinformatics pipeline

Clustering
Cluster Indices
Pairwise
Gene Alignment & Visualization 3D Plot
Sequences Distance
Calculation
Coordinates
Distance Matrix
Multi-
Dimensional
Scaling

http://salsahpc.indiana.edu/

Multi-Dimensional-Scaling
• Many iterations
• Memory & Data intensive
• 3 Map Reduce jobs per iteration
• Xk = invV * B(X(k-1)) * X(k-1)
• 2 matrix vector multiplications termed BC and X

BC: Calculate BX X: Calculate invV Calculate Stress
Map Reduce Merge Map (BX) Merge
Reduce Map Reduce Merge

New Iteration

Performance adjusted for sequential
performance difference

First iteration performs theSize Scaling
Data
Weak Scaling initial data fetch

Azure Instance Type Study Number of Executing Map Task Histogram

BLAST Sequence Search

Scales better than Hadoop & EC2-
Classic Cloud

Current Research
• Collective communication primitives
• Exploring additional data communication and
broadcasting mechanisms
– Fault tolerance
• Twister4Cloud
– Twister4Azure architecture implementations
for other cloud infrastructures

Contributions
• Twister4Azure
– Decentralized iterative MapReduce architecture for clouds
– More natural Iterative programming model extensions to
MapReduce model
– Leveraging eventual consistent cloud services for large scale
coordinated computations
• Performance comparison of applications in Clouds, VM
environments and in bare metal
• Exploration of the effect of data inhomogeneity for scientific
MapReduce run times
• Implementation of data mining and scientific applications for Azure
cloud as well as using Hadoop/DryadLinq
• GPU OpenCL implementation of iterative data analysis algorithms

Acknowledgements
• My PhD advisory committee
• Present and past members of SALSA group –
Indiana University
• National Institutes of Health grant 5 RC2
HG005806-02.
• FutureGrid
• Microsoft Research
• Amazon AWS

Selected Publications
1. Gunarathne, T., Wu, T.-L., Choi, J. Y., Bae, S.-H. and Qiu, J. Cloud computing paradigms for pleasingly parallel
biomedical applications. Concurrency and Computation: Practice and Experience. doi: 10.1002/cpe.1780
2. Ekanayake, J.; Gunarathne, T.; Qiu, J.; , Cloud Technologies for Bioinformatics Applications, Parallel and
Distributed Systems, IEEE Transactions on , vol.22, no.6, pp.998-1011, June 2011. doi: 10.1109/TPDS.2010.178
3. Thilina Gunarathne, BingJing Zang, Tak-Lon Wu and Judy Qiu. Portable Parallel Programming on Cloud and HPC:
Scientific Applications of Twister4Azure. In Proceedings of the forth IEEE/ACM International Conference on
Utility and Cloud Computing (UCC 2011) , Melbourne, Australia. 2011. To appear.
4. Gunarathne, T., J. Qiu, and G. Fox, Iterative MapReduce for Azure Cloud, Cloud Computing and Its
Applications, Argonne National Laboratory, Argonne, IL, 04/12-13/2011.
5. Gunarathne, T.; Tak-Lon Wu; Qiu, J.; Fox, G.; MapReduce in the Clouds for Science, Cloud Computing Technology
and Science (CloudCom), 2010 IEEE Second International Conference on , vol., no., pp.565-572, Nov. 30 2010-
Dec. 3 2010. doi: 10.1109/CloudCom.2010.107
6. Thilina Gunarathne, Bimalee Salpitikorala, and Arun Chauhan. Optimizing OpenCL Kernels for Iterative
Statistical Algorithms on GPUs. In Proceedings of the Second International Workshop on GPUs and Scientific
Applications (GPUScA), Galveston Island, TX. 2011.
7. Gunarathne, T., C. Herath, E. Chinthaka, and S. Marru, Experience with Adapting a WS-BPEL Runtime for
eScience Workflows. The International Conference for High Performance Computing, Networking, Storage and
Analysis (SC'09), Portland, OR, ACM Press, pp. 7, 11/20/2009
8. Judy Qiu, Jaliya Ekanayake, Thilina Gunarathne, Jong Youl Choi, Seung-Hee Bae, Yang Ruan, Saliya
Ekanayake, Stephen Wu, Scott Beason, Geoffrey Fox, Mina Rho, Haixu Tang. Data Intensive Computing for
Bioinformatics, Data Intensive Distributed Computing, Tevik Kosar, Editor. 2011, IGI Publishers.

Questions?

Thank You!
http://salsahpc.indiana.edu/twister4azure
http://www.cs.indiana.edu/~tgunarat/

Scalable Parallel Computing on Clouds

Recommended

More Related Content

What's hot (20)

Viewers also liked (19)

Similar to Scalable Parallel Computing on Clouds (20)

Recently uploaded (20)

Scalable Parallel Computing on Clouds

Editor's Notes