SlideShare a Scribd company logo
SRE
Bruno Connelly
Facilitating DevOps Execution in an
All Digital Delivery Environment
Kurt Andersen
Kurt Andersen
Sr. Staff
Site Reliability
Joined LinkedIn in January 2013
Background in managed services
and anti-abuse security
Introduction
Currently senior technical lead for Product-SRE
(all member & customer facing services)
OUR VISION
Create economic opportunity for every
member of the global workforce
ECONOMIC GRAPH
MEMBERS COMPANIES
JOBSSKILLS
SCHOOLS KNOWLEDGE
Growing Global Network
546M+ 100K
Members Articles published weekly
40%
yr/yr increase in engaged
feed sessions weekly
2+ 50%
New sign-ups per second
Active members use
Linkedin Messaging
weekly
100M+
Monthly Unique Visitors
Global Distribution
Numbers Behind the Scenes
380K 1.2M
Graph QPS
1.5K
Graph EdgesEdge QPS
75B
Services in production
10T
Kafka Messages
consumed/day
700TB 4.5T
Kafka Messages
published/day
20PB
Data storage Espresso Data
Engineering Scale
4K+ 90M
Engineers Lines of Code
600+
Feature Release/Day
25 mins 200+
Avg Build/Test/Publish Code Release/Day
15K
Commits/Day
Vision to Values
VISION
MISSION
VALUE PROPOSITION
TARGET AUDIENCES
STRATEGY
PRIORITIES
OBJECTIVES
CULTURE Transformation - Integrity - Collaboration - Humor - Results
VALUES
Members First - Relationships Matter - Be open, honest, and constructive -
Demand excellence - Take Intelligent Risks - Act like an Owner
Values
• Members First
• Relationships Matter
• Be Open, Honest, and Constructive
• Demand Excellence
• Take Intelligent Risks
• Act Like an Owner
Kevin Scott’s Hierarchy of Engineering Needs
FoundationSite Reliability
Engineering
Site Up & Secure
Technology at scale
Development at scale
Solid APIs and
building blocks
Efficient
Magic
Looking back 2010...
Member Growth
500,000,000
450,000,000
400,000,000
350,000,000
300,000,000
250,000,000
200,000,000
150,000,000
100,000,000
50,000,000
0
2003 2004 2005 20072006 2008 2009 2010 2011 2012 2013 2014 2015 2016
7 Years of Tech Debt
2017
We were here
Inversion
LinkedIn Operations
● Classical, stratified model: Systems,
Networks, Applications, DBA
● Heavy-weight processes driven by tickets
and heroes
● Culture of not trusting developers in any
deployed environments
● Huge wall and growing frustration between
Dev and Ops teams (and in ops itself)
● 7 engineers in total made up NOC, SRE,
Release Operations: “Site Operations”
● On-call was horrible
2010
Is the Site Up?
● Peak traffic periods Mon-Wed ~ 6-10am
● Regular capacity related outages Mon-
Wed ~ 6-10am
● Zero tolerance for failure in the
application stack
● Near zero instrumentation
● Bi-weekly downtime maintenances
2010
Let’s make a few changes
change software development model
active/active serving model
cheaper datacenters
remove monolithic databases
graceful degradation
remove hardware load balancers
more data centers
move to service oriented architecture24/7 deployments
dev driven deployments
replace java serialized objects over RPC with REST APIs
modernize our application stack
move faster
self service everything
code contributions to the main application stack
3x3 deployments
auto escalation
auto remediation
automated datacenter buildout
Development Practices 2010
33
Poor testing practices
Unmaintained, brittle
tests
1
Merge hell
Branch and isolate
engineers from each
other
2 Poor understanding
of change impact
Monolithic codebase
Unspecified dependencies
EFFECTS
Development Practices 2010
Best Case: Two weeks lag from
commit to production
deployment of a feature
Production Deployment:
Heroic efforts, released only
part of planned changes
Speed, Safety and Stability
Development Code Release Feature Release
1 2 3
 Developer Satisfaction & Happiness 
Development Practices 2018
33
Automatic detection and
rollback to reduce
MTTR
Basic code coverage
1 Stable shared code
base
Trunk based
development
2 Versioned
dependencies
Modular logical code
components
Development Practices 2018
Rapid, incremental, small
changes to production
throughout the day
Automated tooling gives
Go/No-Go signal at each
stage
15K+
Successful commits/day Build Test Jobs/Day
35K 28
Mins for Code Review
Core SRE Principles
Site Up Empower Developer
Ownership
Operations is an
Engineering Problem
1 2 3
Everyone should be able to deploy code
[safely]
Self-service Deployments
Promote to a single production data center
“Canary” to a single production instance
EKG: automated metrics-based validation
Ramp features slowly to the member base
Promote to remaining production data centers
1
2
3
4
5
15K+
Successful commits/day
Code promotions/day
200+
600+
Feature ramps/day
Create a culture of operational metrics
“What gets measured gets fixed”
REST API
Self-service Instrumentation and Monitoring
java
applications
non-java
applications
metrics
collectors
alerting visualization
metrics api
IRIS
23K
Graph dashboards
10M
Metrics ingested/sec
340K
Alerts processed/min
600M+
Total metrics
We don’t want a traditional NOC
[permanently]
Correlation
Engine
Self-service Remediation and Escalation
15K
Remediation Plans
Escalation Plans
9K
17K
Executions/day
Alerts Salt
Deployment
Metrics
Notify
(IRIS, JIRA, etc..)
FeedbackNurse
SFSan Francisco
SNVSunnyvale BLR Bangalore
NYC New York City
SRE
SRE Globally Today
400+ SREs across four global
offices
Composed of Software, Database, Security, and Infrastructure
Engineering generalists that make LinkedIn work
Embedded SRE Engagement Model
Partner with application
development teams leveraging
metrics, SLOs, and KPIs
Involved from software
inception to decommission
Participate in sprints, attend
regular staff meetings and sit
with the development teams
Contribute to code base: bug
fixes, instrumentation, logging,
improve efficiency, resilience
and scaling
Participate in on-call rotation for
critical issues along with
development team
Define production-readiness
and overall operability
requirements
Engineering Culture
Act like an
owner
Build
Leverage
Reduce
MTTR
Automate
Everything
Measure
Everything
Protect
Member
Data
Change Velocity vs. Member Impact
Velocity Major Medium
Change Velocity vs. Member Impact
Velocity Major Medium
Faster
Safer
Take-Home
Points
• Values Drive Culture & Practice
• Organize to Support Intent
• Measure Everything
Questions?
Facilitating DevOps Execution in an All Digital Environment
Ad

More Related Content

What's hot (19)

Spool manager pipe weld shop
Spool manager pipe weld shop Spool manager pipe weld shop
Spool manager pipe weld shop
Yakup Bozkurt
 
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Legacy Typesafe (now Lightbend)
 
SoCal DevOps Meetup 1/26/2017 - Habitat by Chef
SoCal DevOps Meetup 1/26/2017 - Habitat by ChefSoCal DevOps Meetup 1/26/2017 - Habitat by Chef
SoCal DevOps Meetup 1/26/2017 - Habitat by Chef
Trevor Hess
 
DevOps Powered by Splunk
DevOps Powered by SplunkDevOps Powered by Splunk
DevOps Powered by Splunk
Splunk
 
DevOps Powered by Splunk Hands-On
DevOps Powered by Splunk Hands-OnDevOps Powered by Splunk Hands-On
DevOps Powered by Splunk Hands-On
Splunk
 
Getting Started with IT Service Intelligence
Getting Started with IT Service IntelligenceGetting Started with IT Service Intelligence
Getting Started with IT Service Intelligence
Splunk
 
Rez gateway - RezOS - innovate the future
Rez gateway - RezOS -   innovate the futureRez gateway - RezOS -   innovate the future
Rez gateway - RezOS - innovate the future
indikaMaligaspe
 
Executing Deployment & Release Strategies
Executing Deployment & Release StrategiesExecuting Deployment & Release Strategies
Executing Deployment & Release Strategies
OpenSense Labs
 
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
Anders Lundsgård
 
Scania's DevSecOps approach - Gamifying Security - auto:CODE
Scania's DevSecOps approach - Gamifying Security - auto:CODEScania's DevSecOps approach - Gamifying Security - auto:CODE
Scania's DevSecOps approach - Gamifying Security - auto:CODE
Anders Lundsgård
 
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
Anders Lundsgård
 
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
Anders Lundsgård
 
Innovation in Action - #MFSummit2017
Innovation in Action - #MFSummit2017Innovation in Action - #MFSummit2017
Innovation in Action - #MFSummit2017
Micro Focus
 
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
VMware Tanzu
 
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
Anders Lundsgård
 
Pushing the DevOps envelope into the network with microservices
Pushing the DevOps envelope into the network with microservicesPushing the DevOps envelope into the network with microservices
Pushing the DevOps envelope into the network with microservices
Lori MacVittie
 
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale
 
0 to hero with Azure DevOps
0 to hero with Azure DevOps0 to hero with Azure DevOps
0 to hero with Azure DevOps
Christos Matskas
 
Cloud @ Scania - Södertälje Science Week
Cloud @ Scania - Södertälje Science WeekCloud @ Scania - Södertälje Science Week
Cloud @ Scania - Södertälje Science Week
Anders Lundsgård
 
Spool manager pipe weld shop
Spool manager pipe weld shop Spool manager pipe weld shop
Spool manager pipe weld shop
Yakup Bozkurt
 
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Modernizing Your Aging Architecture: What Enterprise Architects Need To Know ...
Legacy Typesafe (now Lightbend)
 
SoCal DevOps Meetup 1/26/2017 - Habitat by Chef
SoCal DevOps Meetup 1/26/2017 - Habitat by ChefSoCal DevOps Meetup 1/26/2017 - Habitat by Chef
SoCal DevOps Meetup 1/26/2017 - Habitat by Chef
Trevor Hess
 
DevOps Powered by Splunk
DevOps Powered by SplunkDevOps Powered by Splunk
DevOps Powered by Splunk
Splunk
 
DevOps Powered by Splunk Hands-On
DevOps Powered by Splunk Hands-OnDevOps Powered by Splunk Hands-On
DevOps Powered by Splunk Hands-On
Splunk
 
Getting Started with IT Service Intelligence
Getting Started with IT Service IntelligenceGetting Started with IT Service Intelligence
Getting Started with IT Service Intelligence
Splunk
 
Rez gateway - RezOS - innovate the future
Rez gateway - RezOS -   innovate the futureRez gateway - RezOS -   innovate the future
Rez gateway - RezOS - innovate the future
indikaMaligaspe
 
Executing Deployment & Release Strategies
Executing Deployment & Release StrategiesExecuting Deployment & Release Strategies
Executing Deployment & Release Strategies
OpenSense Labs
 
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
An agile journey - Scania Connected Services at Meetup Go Agile - Stockholm (...
Anders Lundsgård
 
Scania's DevSecOps approach - Gamifying Security - auto:CODE
Scania's DevSecOps approach - Gamifying Security - auto:CODEScania's DevSecOps approach - Gamifying Security - auto:CODE
Scania's DevSecOps approach - Gamifying Security - auto:CODE
Anders Lundsgård
 
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
The Cloud Journey in an Enterprise - IDC Multicloud - Stockholm November 20, ...
Anders Lundsgård
 
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
The Cloud journey in an Enterprise - Delivery of Things World - Berlin April ...
Anders Lundsgård
 
Innovation in Action - #MFSummit2017
Innovation in Action - #MFSummit2017Innovation in Action - #MFSummit2017
Innovation in Action - #MFSummit2017
Micro Focus
 
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
Costruire Applicazioni Cloud-Native con Spring Boot (Pivotal Cloud-Native Wor...
VMware Tanzu
 
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
The DevOps journey in an Enterprise - CoDe-Conf. Stockholm September 14, 2017
Anders Lundsgård
 
Pushing the DevOps envelope into the network with microservices
Pushing the DevOps envelope into the network with microservicesPushing the DevOps envelope into the network with microservices
Pushing the DevOps envelope into the network with microservices
Lori MacVittie
 
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale Webinar: Continuous Integration and Delivery in the Cloud - How Ri...
RightScale
 
0 to hero with Azure DevOps
0 to hero with Azure DevOps0 to hero with Azure DevOps
0 to hero with Azure DevOps
Christos Matskas
 
Cloud @ Scania - Södertälje Science Week
Cloud @ Scania - Södertälje Science WeekCloud @ Scania - Södertälje Science Week
Cloud @ Scania - Södertälje Science Week
Anders Lundsgård
 

Similar to Facilitating DevOps Execution in an All Digital Environment (20)

Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Mike Villiger
 
Preparing for Neo - Singapore OutSystems User Group October 2022 Meetup
Preparing for Neo - Singapore OutSystems User Group October 2022 MeetupPreparing for Neo - Singapore OutSystems User Group October 2022 Meetup
Preparing for Neo - Singapore OutSystems User Group October 2022 Meetup
YashrajNayak4
 
Tech foundations-slides
Tech foundations-slidesTech foundations-slides
Tech foundations-slides
tranquynh93
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a Mainframe
VMware Tanzu
 
Keys to continuous testing for faster delivery euro star webinar
Keys to continuous testing for faster delivery euro star webinar Keys to continuous testing for faster delivery euro star webinar
Keys to continuous testing for faster delivery euro star webinar
TEST Huddle
 
Accelerating Digital Transformation: It's About Digital Enablement
Accelerating Digital Transformation:  It's About Digital EnablementAccelerating Digital Transformation:  It's About Digital Enablement
Accelerating Digital Transformation: It's About Digital Enablement
Joshua Gossett
 
Platform Requirements for CI/CD Success—and the Enterprises Leading the Way
Platform Requirements for CI/CD Success—and the Enterprises Leading the WayPlatform Requirements for CI/CD Success—and the Enterprises Leading the Way
Platform Requirements for CI/CD Success—and the Enterprises Leading the Way
VMware Tanzu
 
DevOps and Microservice
DevOps and MicroserviceDevOps and Microservice
DevOps and Microservice
Inho Kang
 
Cooper, Mark Resume.Final.8.2.16
Cooper, Mark Resume.Final.8.2.16Cooper, Mark Resume.Final.8.2.16
Cooper, Mark Resume.Final.8.2.16
Mark Cooper
 
Morphis Technologies Overview
Morphis Technologies OverviewMorphis Technologies Overview
Morphis Technologies Overview
jrhartley62
 
Connecting ALM Tools for a DevOps World with RLIA-TE
Connecting ALM Tools for a DevOps World with RLIA-TEConnecting ALM Tools for a DevOps World with RLIA-TE
Connecting ALM Tools for a DevOps World with RLIA-TE
Tasktop
 
Innovate Better Through Machine data Analytics
Innovate Better Through Machine data AnalyticsInnovate Better Through Machine data Analytics
Innovate Better Through Machine data Analytics
Hal Rottenberg
 
SplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for DevopsSplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for Devops
Splunk
 
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Lightbend
 
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
Cisco DevNet
 
What is DevOps?
What is DevOps?What is DevOps?
What is DevOps?
Mesut Güneş
 
The Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs PublicThe Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs Public
David Solivan
 
DevOps on Oracle Cloud
DevOps on Oracle CloudDevOps on Oracle Cloud
DevOps on Oracle Cloud
Mee Nam Lee
 
A Software Factory Integrating Rational & WebSphere Tools
A Software Factory Integrating Rational & WebSphere ToolsA Software Factory Integrating Rational & WebSphere Tools
A Software Factory Integrating Rational & WebSphere Tools
ghodgkinson
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Jade Global
 
Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Performance Metrics Driven CI/CD - Introduction to Continuous Innovation and ...
Mike Villiger
 
Preparing for Neo - Singapore OutSystems User Group October 2022 Meetup
Preparing for Neo - Singapore OutSystems User Group October 2022 MeetupPreparing for Neo - Singapore OutSystems User Group October 2022 Meetup
Preparing for Neo - Singapore OutSystems User Group October 2022 Meetup
YashrajNayak4
 
Tech foundations-slides
Tech foundations-slidesTech foundations-slides
Tech foundations-slides
tranquynh93
 
How to Migrate Applications Off a Mainframe
How to Migrate Applications Off a MainframeHow to Migrate Applications Off a Mainframe
How to Migrate Applications Off a Mainframe
VMware Tanzu
 
Keys to continuous testing for faster delivery euro star webinar
Keys to continuous testing for faster delivery euro star webinar Keys to continuous testing for faster delivery euro star webinar
Keys to continuous testing for faster delivery euro star webinar
TEST Huddle
 
Accelerating Digital Transformation: It's About Digital Enablement
Accelerating Digital Transformation:  It's About Digital EnablementAccelerating Digital Transformation:  It's About Digital Enablement
Accelerating Digital Transformation: It's About Digital Enablement
Joshua Gossett
 
Platform Requirements for CI/CD Success—and the Enterprises Leading the Way
Platform Requirements for CI/CD Success—and the Enterprises Leading the WayPlatform Requirements for CI/CD Success—and the Enterprises Leading the Way
Platform Requirements for CI/CD Success—and the Enterprises Leading the Way
VMware Tanzu
 
DevOps and Microservice
DevOps and MicroserviceDevOps and Microservice
DevOps and Microservice
Inho Kang
 
Cooper, Mark Resume.Final.8.2.16
Cooper, Mark Resume.Final.8.2.16Cooper, Mark Resume.Final.8.2.16
Cooper, Mark Resume.Final.8.2.16
Mark Cooper
 
Morphis Technologies Overview
Morphis Technologies OverviewMorphis Technologies Overview
Morphis Technologies Overview
jrhartley62
 
Connecting ALM Tools for a DevOps World with RLIA-TE
Connecting ALM Tools for a DevOps World with RLIA-TEConnecting ALM Tools for a DevOps World with RLIA-TE
Connecting ALM Tools for a DevOps World with RLIA-TE
Tasktop
 
Innovate Better Through Machine data Analytics
Innovate Better Through Machine data AnalyticsInnovate Better Through Machine data Analytics
Innovate Better Through Machine data Analytics
Hal Rottenberg
 
SplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for DevopsSplunkLive! London 2016 Splunk for Devops
SplunkLive! London 2016 Splunk for Devops
Splunk
 
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Modernizing Applications with Microservices and DC/OS (Lightbend/Mesosphere c...
Lightbend
 
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
Cisco DevNet
 
The Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs PublicThe Magic Of Application Lifecycle Management In Vs Public
The Magic Of Application Lifecycle Management In Vs Public
David Solivan
 
DevOps on Oracle Cloud
DevOps on Oracle CloudDevOps on Oracle Cloud
DevOps on Oracle Cloud
Mee Nam Lee
 
A Software Factory Integrating Rational & WebSphere Tools
A Software Factory Integrating Rational & WebSphere ToolsA Software Factory Integrating Rational & WebSphere Tools
A Software Factory Integrating Rational & WebSphere Tools
ghodgkinson
 
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...Informatica to ODI Migration – What, Why and How |  Informatica to Oracle Dat...
Informatica to ODI Migration – What, Why and How | Informatica to Oracle Dat...
Jade Global
 
Ad

More from Kurt Andersen (10)

Collective Mindfulness for Better Decision Making
Collective Mindfulness for Better Decision MakingCollective Mindfulness for Better Decision Making
Collective Mindfulness for Better Decision Making
Kurt Andersen
 
How bad is your toil? Measuring the Human Impact of Process
How bad is your toil? Measuring the Human Impact of ProcessHow bad is your toil? Measuring the Human Impact of Process
How bad is your toil? Measuring the Human Impact of Process
Kurt Andersen
 
Assessing stages of practice
Assessing stages of practiceAssessing stages of practice
Assessing stages of practice
Kurt Andersen
 
The NeverEnding Story: Site Reliability
The NeverEnding Story: Site ReliabilityThe NeverEnding Story: Site Reliability
The NeverEnding Story: Site Reliability
Kurt Andersen
 
Lessons from Iraq - Building & Running SRE Teams
Lessons from Iraq - Building & Running SRE TeamsLessons from Iraq - Building & Running SRE Teams
Lessons from Iraq - Building & Running SRE Teams
Kurt Andersen
 
What You Need to Know About Email Authentication
What You Need to Know About Email AuthenticationWhat You Need to Know About Email Authentication
What You Need to Know About Email Authentication
Kurt Andersen
 
Weeping Angels of Site Reliability
Weeping Angels of Site ReliabilityWeeping Angels of Site Reliability
Weeping Angels of Site Reliability
Kurt Andersen
 
Join us at #SREcon15
Join us at #SREcon15Join us at #SREcon15
Join us at #SREcon15
Kurt Andersen
 
Fighting Email Abuse with DMARC
Fighting Email Abuse with DMARCFighting Email Abuse with DMARC
Fighting Email Abuse with DMARC
Kurt Andersen
 
Operational Costs of Technical Debt
Operational Costs of Technical DebtOperational Costs of Technical Debt
Operational Costs of Technical Debt
Kurt Andersen
 
Collective Mindfulness for Better Decision Making
Collective Mindfulness for Better Decision MakingCollective Mindfulness for Better Decision Making
Collective Mindfulness for Better Decision Making
Kurt Andersen
 
How bad is your toil? Measuring the Human Impact of Process
How bad is your toil? Measuring the Human Impact of ProcessHow bad is your toil? Measuring the Human Impact of Process
How bad is your toil? Measuring the Human Impact of Process
Kurt Andersen
 
Assessing stages of practice
Assessing stages of practiceAssessing stages of practice
Assessing stages of practice
Kurt Andersen
 
The NeverEnding Story: Site Reliability
The NeverEnding Story: Site ReliabilityThe NeverEnding Story: Site Reliability
The NeverEnding Story: Site Reliability
Kurt Andersen
 
Lessons from Iraq - Building & Running SRE Teams
Lessons from Iraq - Building & Running SRE TeamsLessons from Iraq - Building & Running SRE Teams
Lessons from Iraq - Building & Running SRE Teams
Kurt Andersen
 
What You Need to Know About Email Authentication
What You Need to Know About Email AuthenticationWhat You Need to Know About Email Authentication
What You Need to Know About Email Authentication
Kurt Andersen
 
Weeping Angels of Site Reliability
Weeping Angels of Site ReliabilityWeeping Angels of Site Reliability
Weeping Angels of Site Reliability
Kurt Andersen
 
Join us at #SREcon15
Join us at #SREcon15Join us at #SREcon15
Join us at #SREcon15
Kurt Andersen
 
Fighting Email Abuse with DMARC
Fighting Email Abuse with DMARCFighting Email Abuse with DMARC
Fighting Email Abuse with DMARC
Kurt Andersen
 
Operational Costs of Technical Debt
Operational Costs of Technical DebtOperational Costs of Technical Debt
Operational Costs of Technical Debt
Kurt Andersen
 
Ad

Recently uploaded (15)

Cloud-to-cloud Migration presentation.pptx
Cloud-to-cloud Migration presentation.pptxCloud-to-cloud Migration presentation.pptx
Cloud-to-cloud Migration presentation.pptx
marketing140789
 
AG-FIRMA Ai Agent for Agriculture | RAG ..
AG-FIRMA Ai Agent for Agriculture  | RAG ..AG-FIRMA Ai Agent for Agriculture  | RAG ..
AG-FIRMA Ai Agent for Agriculture | RAG ..
Anass Nabil
 
introduction to html and cssIntroHTML.ppt
introduction to html and cssIntroHTML.pptintroduction to html and cssIntroHTML.ppt
introduction to html and cssIntroHTML.ppt
SherifElGohary7
 
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
werhkr1
 
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
Taqyea
 
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdfGiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
Giacomo Vacca
 
Paper: World Game (s) Great Redesign.pdf
Paper: World Game (s) Great Redesign.pdfPaper: World Game (s) Great Redesign.pdf
Paper: World Game (s) Great Redesign.pdf
Steven McGee
 
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness GuideThe Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
russellpeter1995
 
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
Taqyea
 
How to Install & Activate ListGrabber - eGrabber
How to Install & Activate ListGrabber - eGrabberHow to Install & Activate ListGrabber - eGrabber
How to Install & Activate ListGrabber - eGrabber
eGrabber
 
Presentation Mehdi Monitorama 2022 Cancer and Monitoring
Presentation Mehdi Monitorama 2022 Cancer and MonitoringPresentation Mehdi Monitorama 2022 Cancer and Monitoring
Presentation Mehdi Monitorama 2022 Cancer and Monitoring
mdaoudi
 
IoT PPT introduction to internet of things
IoT PPT introduction to internet of thingsIoT PPT introduction to internet of things
IoT PPT introduction to internet of things
VaishnaviPatil3995
 
Breaking Down the Latest Spectrum Internet Plans.pdf
Breaking Down the Latest Spectrum Internet Plans.pdfBreaking Down the Latest Spectrum Internet Plans.pdf
Breaking Down the Latest Spectrum Internet Plans.pdf
Internet Bundle Now
 
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
emestica1
 
ProjectArtificial Intelligence Good or Evil.pptx
ProjectArtificial Intelligence Good or Evil.pptxProjectArtificial Intelligence Good or Evil.pptx
ProjectArtificial Intelligence Good or Evil.pptx
OlenaKotovska
 
Cloud-to-cloud Migration presentation.pptx
Cloud-to-cloud Migration presentation.pptxCloud-to-cloud Migration presentation.pptx
Cloud-to-cloud Migration presentation.pptx
marketing140789
 
AG-FIRMA Ai Agent for Agriculture | RAG ..
AG-FIRMA Ai Agent for Agriculture  | RAG ..AG-FIRMA Ai Agent for Agriculture  | RAG ..
AG-FIRMA Ai Agent for Agriculture | RAG ..
Anass Nabil
 
introduction to html and cssIntroHTML.ppt
introduction to html and cssIntroHTML.pptintroduction to html and cssIntroHTML.ppt
introduction to html and cssIntroHTML.ppt
SherifElGohary7
 
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
DEF CON 25 - Whitney-Merrill-and-Terrell-McSweeny-Tick-Tick-Boom-Tech-and-the...
werhkr1
 
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
学生卡英国RCA毕业证皇家艺术学院电子毕业证学历证书
Taqyea
 
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdfGiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
GiacomoVacca - WebRTC - troubleshooting media negotiation.pdf
Giacomo Vacca
 
Paper: World Game (s) Great Redesign.pdf
Paper: World Game (s) Great Redesign.pdfPaper: World Game (s) Great Redesign.pdf
Paper: World Game (s) Great Redesign.pdf
Steven McGee
 
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness GuideThe Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
The Hidden Risks of Hiring Hackers to Change Grades: An Awareness Guide
russellpeter1995
 
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
美国文凭明尼苏达大学莫里斯分校毕业证范本UMM学位证书
Taqyea
 
How to Install & Activate ListGrabber - eGrabber
How to Install & Activate ListGrabber - eGrabberHow to Install & Activate ListGrabber - eGrabber
How to Install & Activate ListGrabber - eGrabber
eGrabber
 
Presentation Mehdi Monitorama 2022 Cancer and Monitoring
Presentation Mehdi Monitorama 2022 Cancer and MonitoringPresentation Mehdi Monitorama 2022 Cancer and Monitoring
Presentation Mehdi Monitorama 2022 Cancer and Monitoring
mdaoudi
 
IoT PPT introduction to internet of things
IoT PPT introduction to internet of thingsIoT PPT introduction to internet of things
IoT PPT introduction to internet of things
VaishnaviPatil3995
 
Breaking Down the Latest Spectrum Internet Plans.pdf
Breaking Down the Latest Spectrum Internet Plans.pdfBreaking Down the Latest Spectrum Internet Plans.pdf
Breaking Down the Latest Spectrum Internet Plans.pdf
Internet Bundle Now
 
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
CompTIA-Security-Study-Guide-with-over-500-Practice-Test-Questions-Exam-SY0-7...
emestica1
 
ProjectArtificial Intelligence Good or Evil.pptx
ProjectArtificial Intelligence Good or Evil.pptxProjectArtificial Intelligence Good or Evil.pptx
ProjectArtificial Intelligence Good or Evil.pptx
OlenaKotovska
 

Facilitating DevOps Execution in an All Digital Environment

  • 1. SRE Bruno Connelly Facilitating DevOps Execution in an All Digital Delivery Environment Kurt Andersen
  • 2. Kurt Andersen Sr. Staff Site Reliability Joined LinkedIn in January 2013 Background in managed services and anti-abuse security Introduction Currently senior technical lead for Product-SRE (all member & customer facing services)
  • 3. OUR VISION Create economic opportunity for every member of the global workforce
  • 5. Growing Global Network 546M+ 100K Members Articles published weekly 40% yr/yr increase in engaged feed sessions weekly 2+ 50% New sign-ups per second Active members use Linkedin Messaging weekly 100M+ Monthly Unique Visitors
  • 7. Numbers Behind the Scenes 380K 1.2M Graph QPS 1.5K Graph EdgesEdge QPS 75B Services in production 10T Kafka Messages consumed/day 700TB 4.5T Kafka Messages published/day 20PB Data storage Espresso Data
  • 8. Engineering Scale 4K+ 90M Engineers Lines of Code 600+ Feature Release/Day 25 mins 200+ Avg Build/Test/Publish Code Release/Day 15K Commits/Day
  • 9. Vision to Values VISION MISSION VALUE PROPOSITION TARGET AUDIENCES STRATEGY PRIORITIES OBJECTIVES CULTURE Transformation - Integrity - Collaboration - Humor - Results VALUES Members First - Relationships Matter - Be open, honest, and constructive - Demand excellence - Take Intelligent Risks - Act like an Owner
  • 10. Values • Members First • Relationships Matter • Be Open, Honest, and Constructive • Demand Excellence • Take Intelligent Risks • Act Like an Owner
  • 11. Kevin Scott’s Hierarchy of Engineering Needs FoundationSite Reliability Engineering Site Up & Secure Technology at scale Development at scale Solid APIs and building blocks Efficient Magic
  • 13. Member Growth 500,000,000 450,000,000 400,000,000 350,000,000 300,000,000 250,000,000 200,000,000 150,000,000 100,000,000 50,000,000 0 2003 2004 2005 20072006 2008 2009 2010 2011 2012 2013 2014 2015 2016 7 Years of Tech Debt 2017 We were here Inversion
  • 14. LinkedIn Operations ● Classical, stratified model: Systems, Networks, Applications, DBA ● Heavy-weight processes driven by tickets and heroes ● Culture of not trusting developers in any deployed environments ● Huge wall and growing frustration between Dev and Ops teams (and in ops itself) ● 7 engineers in total made up NOC, SRE, Release Operations: “Site Operations” ● On-call was horrible 2010
  • 15. Is the Site Up? ● Peak traffic periods Mon-Wed ~ 6-10am ● Regular capacity related outages Mon- Wed ~ 6-10am ● Zero tolerance for failure in the application stack ● Near zero instrumentation ● Bi-weekly downtime maintenances 2010
  • 16. Let’s make a few changes change software development model active/active serving model cheaper datacenters remove monolithic databases graceful degradation remove hardware load balancers more data centers move to service oriented architecture24/7 deployments dev driven deployments replace java serialized objects over RPC with REST APIs modernize our application stack move faster self service everything code contributions to the main application stack 3x3 deployments auto escalation auto remediation automated datacenter buildout
  • 17. Development Practices 2010 33 Poor testing practices Unmaintained, brittle tests 1 Merge hell Branch and isolate engineers from each other 2 Poor understanding of change impact Monolithic codebase Unspecified dependencies
  • 18. EFFECTS Development Practices 2010 Best Case: Two weeks lag from commit to production deployment of a feature Production Deployment: Heroic efforts, released only part of planned changes
  • 19. Speed, Safety and Stability Development Code Release Feature Release 1 2 3  Developer Satisfaction & Happiness 
  • 20. Development Practices 2018 33 Automatic detection and rollback to reduce MTTR Basic code coverage 1 Stable shared code base Trunk based development 2 Versioned dependencies Modular logical code components
  • 21. Development Practices 2018 Rapid, incremental, small changes to production throughout the day Automated tooling gives Go/No-Go signal at each stage 15K+ Successful commits/day Build Test Jobs/Day 35K 28 Mins for Code Review
  • 22. Core SRE Principles Site Up Empower Developer Ownership Operations is an Engineering Problem 1 2 3
  • 23. Everyone should be able to deploy code [safely]
  • 24. Self-service Deployments Promote to a single production data center “Canary” to a single production instance EKG: automated metrics-based validation Ramp features slowly to the member base Promote to remaining production data centers 1 2 3 4 5 15K+ Successful commits/day Code promotions/day 200+ 600+ Feature ramps/day
  • 25. Create a culture of operational metrics “What gets measured gets fixed”
  • 26. REST API Self-service Instrumentation and Monitoring java applications non-java applications metrics collectors alerting visualization metrics api IRIS 23K Graph dashboards 10M Metrics ingested/sec 340K Alerts processed/min 600M+ Total metrics
  • 27. We don’t want a traditional NOC [permanently]
  • 28. Correlation Engine Self-service Remediation and Escalation 15K Remediation Plans Escalation Plans 9K 17K Executions/day Alerts Salt Deployment Metrics Notify (IRIS, JIRA, etc..) FeedbackNurse
  • 29. SFSan Francisco SNVSunnyvale BLR Bangalore NYC New York City SRE SRE Globally Today 400+ SREs across four global offices Composed of Software, Database, Security, and Infrastructure Engineering generalists that make LinkedIn work
  • 30. Embedded SRE Engagement Model Partner with application development teams leveraging metrics, SLOs, and KPIs Involved from software inception to decommission Participate in sprints, attend regular staff meetings and sit with the development teams Contribute to code base: bug fixes, instrumentation, logging, improve efficiency, resilience and scaling Participate in on-call rotation for critical issues along with development team Define production-readiness and overall operability requirements
  • 31. Engineering Culture Act like an owner Build Leverage Reduce MTTR Automate Everything Measure Everything Protect Member Data
  • 32. Change Velocity vs. Member Impact Velocity Major Medium
  • 33. Change Velocity vs. Member Impact Velocity Major Medium Faster Safer
  • 34. Take-Home Points • Values Drive Culture & Practice • Organize to Support Intent • Measure Everything
  翻译: