SlideShare a Scribd company logo
Data Engineer CareerTalk
WeCloudData
@WeCloudData @WeCloudData tordatascience
weclouddata
WeCloudData tordatascience
2013 2014 2015 20172016 2018 2019
Introduction
Edwin Guo
Introduction
Agenda
Data Engineer
What is Data Engineer?
Data engineers are mainly tasked with transforming data into a format that can be
easily analyzed. They do this by developing, maintaining, and testing infrastructures
for data generation. Data engineers work closely with data scientists and are largely in
charge of architecting solutions for data scientists that enable them to do their jobs.
In addition, data engineers possess a plethora of technical skills and the ability to
approach problems in a creative manner.
Data Engineer
What is Data Engineer?
Data Engineer
What is Data Engineer?
Data Engineer
What does Data Engineer do?
Data Engineer
What does Data Engineer do?
Batch mode
Twitter API
Kinesis
Redshift
Data Engineer
What does Data Engineer do?
Streaming Mode
Data Engineer
What is Data Engineer’s
responsibility?
Data Engineer
What is Data Engineer’s required Skills?
Data Engineer
What is Data Engineer’s required Skills?
Data Engineer
What is Data Engineer’s required Skills?
Data Science
DW vs DL vs DM
Banking Telecom Consulting Startups
Data Engineer
Hiring Companies
Data Engineer
Required skills
Requirements:
! Bachelor's degree in Computer Science/Engineering or equivalent experience. Master’s degree
preferred. 
! Experience with large-scale distributed systems , Microservice and service-oriented
architectures.
! Extensive experience with Amazon AWS, and other cloud offerings.
! Strong development skills in Scala, Java, Python and/or C++.
! Experience with caching technologies using Redis, Memcached.
! Knowledge of various databases / database technologies - Oracle, Postgres, Cassandra
(NoSQL).
! Exposure to implementing real-time streaming data pipelines on large volumes of data using
Kafka, Spark. 
! Experience with Data Processing (ETL, Data Warehousing, etc.) 
! Big Data technologies and languages (Pig, Hive, Spark, Hadoop). 
! Familiarity with version control software, such as Git.
! Highly proficient in Object Oriented Design and Development. 
! Experience in Automation and Load Testing Frameworks. Build, test, and maintain optimal data
pipeline architecture
! Assemble large, complex data sets to meet both functional and non-functional requirement
! Build the infrastructure necessary for optimal extraction, transformation, and loading of
data. Identify, design, implement, and enhance internal processes
Unleash your data potential!
Senior Position
$120K - $140K
Expert
$150K+
Entry Level
$70K - $80K
Experienced
$90K - $110K
WeCloudData offers data Engineer accelerator program. We
specialize in teaching the newest open source tools and
techniques such as Hadoop, Spark, Python, Machine
Learning, Deep Learning, and Cloud.
Introduction
TYPE OF DATA JOB SEEKERS
传感器器
数据
机器器学
习
⼈人⼯工智
能
机器器⼈人
⾏行行动
触发器器
Ad

More Related Content

What's hot (20)

Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
Mark Tabladillo
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Global AI Bootcamp Madrid - Azure Databricks
Global AI Bootcamp Madrid - Azure DatabricksGlobal AI Bootcamp Madrid - Azure Databricks
Global AI Bootcamp Madrid - Azure Databricks
Alberto Diaz Martin
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
Mark Tabladillo
 
Databricks Overview for MLOps
Databricks Overview for MLOpsDatabricks Overview for MLOps
Databricks Overview for MLOps
Databricks
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
SoftServe
 
Data engineering
Data engineeringData engineering
Data engineering
Parimala Killada
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonData Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Databricks
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
Databricks
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
Rob Winters
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Alberto Diaz Martin
 
Data Engineering for Data Scientists
Data Engineering for Data Scientists Data Engineering for Data Scientists
Data Engineering for Data Scientists
jlacefie
 
MLCommons: Better ML for Everyone
MLCommons: Better ML for EveryoneMLCommons: Better ML for Everyone
MLCommons: Better ML for Everyone
Databricks
 
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Databricks
 
Data Science Career Insights by WeCloudData
Data Science Career Insights by WeCloudDataData Science Career Insights by WeCloudData
Data Science Career Insights by WeCloudData
WeCloudData
 
Spark as a Service with Azure Databricks
Spark as a Service with Azure DatabricksSpark as a Service with Azure Databricks
Spark as a Service with Azure Databricks
Lace Lofranco
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
Novita Sari
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
DataWorks Summit
 
Big Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft AzureBig Data Adavnced Analytics on Microsoft Azure
Big Data Adavnced Analytics on Microsoft Azure
Mark Tabladillo
 
Building Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics PrimerBuilding Lakehouses on Delta Lake with SQL Analytics Primer
Building Lakehouses on Delta Lake with SQL Analytics Primer
Databricks
 
Global AI Bootcamp Madrid - Azure Databricks
Global AI Bootcamp Madrid - Azure DatabricksGlobal AI Bootcamp Madrid - Azure Databricks
Global AI Bootcamp Madrid - Azure Databricks
Alberto Diaz Martin
 
201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning201905 Azure Databricks for Machine Learning
201905 Azure Databricks for Machine Learning
Mark Tabladillo
 
Databricks Overview for MLOps
Databricks Overview for MLOpsDatabricks Overview for MLOps
Databricks Overview for MLOps
Databricks
 
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
SoftServe
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonData Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Databricks
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
Databricks
 
Intro to Delta Lake
Intro to Delta LakeIntro to Delta Lake
Intro to Delta Lake
Databricks
 
Building data "Py-pelines"
Building data "Py-pelines"Building data "Py-pelines"
Building data "Py-pelines"
Rob Winters
 
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientistAi & Data Analytics 2018 - Azure Databricks for data scientist
Ai & Data Analytics 2018 - Azure Databricks for data scientist
Alberto Diaz Martin
 
Data Engineering for Data Scientists
Data Engineering for Data Scientists Data Engineering for Data Scientists
Data Engineering for Data Scientists
jlacefie
 
MLCommons: Better ML for Everyone
MLCommons: Better ML for EveryoneMLCommons: Better ML for Everyone
MLCommons: Better ML for Everyone
Databricks
 
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Deep Learning for Large-Scale Online Fraud Detection—Fighting Fraudsters Amon...
Databricks
 
Data Science Career Insights by WeCloudData
Data Science Career Insights by WeCloudDataData Science Career Insights by WeCloudData
Data Science Career Insights by WeCloudData
WeCloudData
 
Spark as a Service with Azure Databricks
Spark as a Service with Azure DatabricksSpark as a Service with Azure Databricks
Spark as a Service with Azure Databricks
Lace Lofranco
 
Summary introduction to data engineering
Summary introduction to data engineeringSummary introduction to data engineering
Summary introduction to data engineering
Novita Sari
 
Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches Implementing and running a secure datalake from the trenches
Implementing and running a secure datalake from the trenches
DataWorks Summit
 

Similar to Data Engineer Intro - WeCloudData (20)

2010/08 - Database Architechs - Presentation
2010/08 - Database Architechs - Presentation2010/08 - Database Architechs - Presentation
2010/08 - Database Architechs - Presentation
Database Architechs
 
2010/10 - Database Architechs presentation
2010/10 - Database Architechs presentation2010/10 - Database Architechs presentation
2010/10 - Database Architechs presentation
Database Architechs
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
Data Con LA
 
Zahid Resume.pdf
Zahid Resume.pdfZahid Resume.pdf
Zahid Resume.pdf
MDZAHEDULISLAM14
 
Full Stack | Android | Web | PHP Developement | SEO | Digital Marketing Cour...
Full Stack | Android | Web | PHP Developement  | SEO | Digital Marketing Cour...Full Stack | Android | Web | PHP Developement  | SEO | Digital Marketing Cour...
Full Stack | Android | Web | PHP Developement | SEO | Digital Marketing Cour...
DSS Academy
 
Sriramjasti
SriramjastiSriramjasti
Sriramjasti
Sriram Jasti
 
The Basics of Data Engineering with IABAC
The Basics of Data Engineering with IABACThe Basics of Data Engineering with IABAC
The Basics of Data Engineering with IABAC
IABAC
 
2010/10 - Database Architechs Consulting Services Summary
2010/10 - Database Architechs Consulting Services Summary2010/10 - Database Architechs Consulting Services Summary
2010/10 - Database Architechs Consulting Services Summary
Database Architechs
 
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
Simplilearn
 
NAVEED KAMAL SHAH
NAVEED KAMAL SHAHNAVEED KAMAL SHAH
NAVEED KAMAL SHAH
NAVEED SHAH (MCPS,MCSA, Microsoft Azure Certified)
 
Mani_Sagar_ETL
Mani_Sagar_ETLMani_Sagar_ETL
Mani_Sagar_ETL
Mani Sagar
 
Chandan's_Resume
Chandan's_ResumeChandan's_Resume
Chandan's_Resume
Chandan Das
 
How to Build a Data Engineering Career | IABAC
How to Build a Data Engineering Career | IABACHow to Build a Data Engineering Career | IABAC
How to Build a Data Engineering Career | IABAC
vamshit5
 
omar_alhussein_final_cv
omar_alhussein_final_cvomar_alhussein_final_cv
omar_alhussein_final_cv
Omar AlHussein
 
Rajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developerRajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developer
Rajeev Kumar
 
Imran_SAP_BI_BW_BODS_RESUME
Imran_SAP_BI_BW_BODS_RESUMEImran_SAP_BI_BW_BODS_RESUME
Imran_SAP_BI_BW_BODS_RESUME
IMRAN KHAN
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
 
2010/10 - Database Architechs - Data Services Summary
2010/10 - Database Architechs - Data Services Summary2010/10 - Database Architechs - Data Services Summary
2010/10 - Database Architechs - Data Services Summary
Database Architechs
 
Best Software Coaching Institute in Hyderabad
Best Software Coaching Institute in HyderabadBest Software Coaching Institute in Hyderabad
Best Software Coaching Institute in Hyderabad
seodigimarket4712
 
Senthilkumar_SQL_New
Senthilkumar_SQL_NewSenthilkumar_SQL_New
Senthilkumar_SQL_New
Senthilkumar Sivasankaran
 
2010/08 - Database Architechs - Presentation
2010/08 - Database Architechs - Presentation2010/08 - Database Architechs - Presentation
2010/08 - Database Architechs - Presentation
Database Architechs
 
2010/10 - Database Architechs presentation
2010/10 - Database Architechs presentation2010/10 - Database Architechs presentation
2010/10 - Database Architechs presentation
Database Architechs
 
Enabling Data centric Teams
Enabling Data centric TeamsEnabling Data centric Teams
Enabling Data centric Teams
Data Con LA
 
Full Stack | Android | Web | PHP Developement | SEO | Digital Marketing Cour...
Full Stack | Android | Web | PHP Developement  | SEO | Digital Marketing Cour...Full Stack | Android | Web | PHP Developement  | SEO | Digital Marketing Cour...
Full Stack | Android | Web | PHP Developement | SEO | Digital Marketing Cour...
DSS Academy
 
The Basics of Data Engineering with IABAC
The Basics of Data Engineering with IABACThe Basics of Data Engineering with IABAC
The Basics of Data Engineering with IABAC
IABAC
 
2010/10 - Database Architechs Consulting Services Summary
2010/10 - Database Architechs Consulting Services Summary2010/10 - Database Architechs Consulting Services Summary
2010/10 - Database Architechs Consulting Services Summary
Database Architechs
 
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
🔥 Top 5 Skills For Data Engineer In 2023 | Data Engineer Skills Required For ...
Simplilearn
 
Mani_Sagar_ETL
Mani_Sagar_ETLMani_Sagar_ETL
Mani_Sagar_ETL
Mani Sagar
 
Chandan's_Resume
Chandan's_ResumeChandan's_Resume
Chandan's_Resume
Chandan Das
 
How to Build a Data Engineering Career | IABAC
How to Build a Data Engineering Career | IABACHow to Build a Data Engineering Career | IABAC
How to Build a Data Engineering Career | IABAC
vamshit5
 
omar_alhussein_final_cv
omar_alhussein_final_cvomar_alhussein_final_cv
omar_alhussein_final_cv
Omar AlHussein
 
Rajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developerRajeev kumar apache_spark & scala developer
Rajeev kumar apache_spark & scala developer
Rajeev Kumar
 
Imran_SAP_BI_BW_BODS_RESUME
Imran_SAP_BI_BW_BODS_RESUMEImran_SAP_BI_BW_BODS_RESUME
Imran_SAP_BI_BW_BODS_RESUME
IMRAN KHAN
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
Eric Kavanagh
 
2010/10 - Database Architechs - Data Services Summary
2010/10 - Database Architechs - Data Services Summary2010/10 - Database Architechs - Data Services Summary
2010/10 - Database Architechs - Data Services Summary
Database Architechs
 
Best Software Coaching Institute in Hyderabad
Best Software Coaching Institute in HyderabadBest Software Coaching Institute in Hyderabad
Best Software Coaching Institute in Hyderabad
seodigimarket4712
 
Ad

More from WeCloudData (10)

Data Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudDataData Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudData
WeCloudData
 
Machine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudDataMachine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudData
WeCloudData
 
Deep Learning Introduction - WeCloudData
Deep Learning Introduction - WeCloudDataDeep Learning Introduction - WeCloudData
Deep Learning Introduction - WeCloudData
WeCloudData
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
WeCloudData
 
Introduction to Python by WeCloudData
Introduction to Python by WeCloudDataIntroduction to Python by WeCloudData
Introduction to Python by WeCloudData
WeCloudData
 
Web scraping project aritza-compressed
Web scraping project   aritza-compressedWeb scraping project   aritza-compressed
Web scraping project aritza-compressed
WeCloudData
 
Applied Machine Learning Course - Jodie Zhu (WeCloudData)
Applied Machine Learning Course - Jodie Zhu (WeCloudData)Applied Machine Learning Course - Jodie Zhu (WeCloudData)
Applied Machine Learning Course - Jodie Zhu (WeCloudData)
WeCloudData
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
WeCloudData
 
WeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData Toronto Open311 Workshop - Matthew ReyesWeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData
 
Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901
WeCloudData
 
Data Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudDataData Engineering Course Syllabus - WeCloudData
Data Engineering Course Syllabus - WeCloudData
WeCloudData
 
Machine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudDataMachine learning in Healthcare - WeCloudData
Machine learning in Healthcare - WeCloudData
WeCloudData
 
Deep Learning Introduction - WeCloudData
Deep Learning Introduction - WeCloudDataDeep Learning Introduction - WeCloudData
Deep Learning Introduction - WeCloudData
WeCloudData
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
WeCloudData
 
Introduction to Python by WeCloudData
Introduction to Python by WeCloudDataIntroduction to Python by WeCloudData
Introduction to Python by WeCloudData
WeCloudData
 
Web scraping project aritza-compressed
Web scraping project   aritza-compressedWeb scraping project   aritza-compressed
Web scraping project aritza-compressed
WeCloudData
 
Applied Machine Learning Course - Jodie Zhu (WeCloudData)
Applied Machine Learning Course - Jodie Zhu (WeCloudData)Applied Machine Learning Course - Jodie Zhu (WeCloudData)
Applied Machine Learning Course - Jodie Zhu (WeCloudData)
WeCloudData
 
Introduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudDataIntroduction to Machine Learning - WeCloudData
Introduction to Machine Learning - WeCloudData
WeCloudData
 
WeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData Toronto Open311 Workshop - Matthew ReyesWeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData Toronto Open311 Workshop - Matthew Reyes
WeCloudData
 
Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901Tordatasci meetup-precima-retail-analytics-201901
Tordatasci meetup-precima-retail-analytics-201901
WeCloudData
 
Ad

Recently uploaded (20)

Get Started with FukreyGame Today!......
Get Started with FukreyGame Today!......Get Started with FukreyGame Today!......
Get Started with FukreyGame Today!......
liononline785
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Snowflake training | Snowflake online course
Snowflake training | Snowflake online courseSnowflake training | Snowflake online course
Snowflake training | Snowflake online course
Accentfuture
 
The-Future-is-Now-Information-Technology-Trends.pptx.pdf
The-Future-is-Now-Information-Technology-Trends.pptx.pdfThe-Future-is-Now-Information-Technology-Trends.pptx.pdf
The-Future-is-Now-Information-Technology-Trends.pptx.pdf
winnt04
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
national income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptxnational income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptx
j2492618
 
DATA ANALYST and Techniques in Kochi Explore cutting-edge analytical skills ...
DATA ANALYST  and Techniques in Kochi Explore cutting-edge analytical skills ...DATA ANALYST  and Techniques in Kochi Explore cutting-edge analytical skills ...
DATA ANALYST and Techniques in Kochi Explore cutting-edge analytical skills ...
aacj102006
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Industry Experts
 
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual FormStorage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Professional Content Writing's
 
report (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhsreport (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhs
AngelPinedaTaguinod
 
Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201
GraceSolaa1
 
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
Taqyea
 
How to make impact with process mining? - PGGM
How to make impact with process mining? - PGGMHow to make impact with process mining? - PGGM
How to make impact with process mining? - PGGM
Process mining Evangelist
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
Digital Disruption Use Case_Music Industry_for students.pdf
Digital Disruption Use Case_Music Industry_for students.pdfDigital Disruption Use Case_Music Industry_for students.pdf
Digital Disruption Use Case_Music Industry_for students.pdf
ProsenjitMitra9
 
Dynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics DynamicsDynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics Dynamics
heyoubro69
 
Urban models for professional practice 03
Urban models for professional practice 03Urban models for professional practice 03
Urban models for professional practice 03
DanisseLoiDapdap
 
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptxConcrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
ssuserd1f4a3
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 
Get Started with FukreyGame Today!......
Get Started with FukreyGame Today!......Get Started with FukreyGame Today!......
Get Started with FukreyGame Today!......
liononline785
 
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm     mmmmmfftro.pptxlecture_13 tree in mmmmmmmm     mmmmmfftro.pptx
lecture_13 tree in mmmmmmmm mmmmmfftro.pptx
sarajafffri058
 
Snowflake training | Snowflake online course
Snowflake training | Snowflake online courseSnowflake training | Snowflake online course
Snowflake training | Snowflake online course
Accentfuture
 
The-Future-is-Now-Information-Technology-Trends.pptx.pdf
The-Future-is-Now-Information-Technology-Trends.pptx.pdfThe-Future-is-Now-Information-Technology-Trends.pptx.pdf
The-Future-is-Now-Information-Technology-Trends.pptx.pdf
winnt04
 
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial IntelligenceDr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug - Expert In Artificial Intelligence
Dr. Robert Krug
 
national income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptxnational income & related aggregates (1)(1).pptx
national income & related aggregates (1)(1).pptx
j2492618
 
DATA ANALYST and Techniques in Kochi Explore cutting-edge analytical skills ...
DATA ANALYST  and Techniques in Kochi Explore cutting-edge analytical skills ...DATA ANALYST  and Techniques in Kochi Explore cutting-edge analytical skills ...
DATA ANALYST and Techniques in Kochi Explore cutting-edge analytical skills ...
aacj102006
 
CS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docxCS-404 COA COURSE FILE JAN JUN 2025.docx
CS-404 COA COURSE FILE JAN JUN 2025.docx
nidarizvitit
 
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Carbon Nanomaterials Market Size, Trends and Outlook 2024-2030
Industry Experts
 
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual FormStorage Devices and the Mechanism of Data Storage in Audio and Visual Form
Storage Devices and the Mechanism of Data Storage in Audio and Visual Form
Professional Content Writing's
 
report (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhsreport (maam dona subject).pptxhsgwiswhs
report (maam dona subject).pptxhsgwiswhs
AngelPinedaTaguinod
 
Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201Mixed Methods Research.pptx education 201
Mixed Methods Research.pptx education 201
GraceSolaa1
 
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
英国学位证(利物浦约翰摩尔斯大学本科毕业证)LJMU文凭证书办理
Taqyea
 
How to make impact with process mining? - PGGM
How to make impact with process mining? - PGGMHow to make impact with process mining? - PGGM
How to make impact with process mining? - PGGM
Process mining Evangelist
 
Feature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record SystemsFeature Engineering for Electronic Health Record Systems
Feature Engineering for Electronic Health Record Systems
Process mining Evangelist
 
Digital Disruption Use Case_Music Industry_for students.pdf
Digital Disruption Use Case_Music Industry_for students.pdfDigital Disruption Use Case_Music Industry_for students.pdf
Digital Disruption Use Case_Music Industry_for students.pdf
ProsenjitMitra9
 
Dynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics DynamicsDynamics 365 Business Rules Dynamics Dynamics
Dynamics 365 Business Rules Dynamics Dynamics
heyoubro69
 
Urban models for professional practice 03
Urban models for professional practice 03Urban models for professional practice 03
Urban models for professional practice 03
DanisseLoiDapdap
 
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptxConcrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
Concrete_Presenbmlkvvbvvvfvbbbfcfftation.pptx
ssuserd1f4a3
 
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Day 1 MS Excel Basics #.pptxDay 1 MS Excel Basics #.pptxDay 1 MS Excel Basics...
Jayantilal Bhanushali
 

Data Engineer Intro - WeCloudData

  • 1. Data Engineer CareerTalk WeCloudData @WeCloudData @WeCloudData tordatascience weclouddata WeCloudData tordatascience
  • 2. 2013 2014 2015 20172016 2018 2019 Introduction Edwin Guo
  • 4. Data Engineer What is Data Engineer? Data engineers are mainly tasked with transforming data into a format that can be easily analyzed. They do this by developing, maintaining, and testing infrastructures for data generation. Data engineers work closely with data scientists and are largely in charge of architecting solutions for data scientists that enable them to do their jobs. In addition, data engineers possess a plethora of technical skills and the ability to approach problems in a creative manner.
  • 5. Data Engineer What is Data Engineer?
  • 6. Data Engineer What is Data Engineer?
  • 7. Data Engineer What does Data Engineer do?
  • 8. Data Engineer What does Data Engineer do? Batch mode
  • 9. Twitter API Kinesis Redshift Data Engineer What does Data Engineer do? Streaming Mode
  • 10. Data Engineer What is Data Engineer’s responsibility?
  • 11. Data Engineer What is Data Engineer’s required Skills?
  • 12. Data Engineer What is Data Engineer’s required Skills?
  • 13. Data Engineer What is Data Engineer’s required Skills?
  • 14. Data Science DW vs DL vs DM
  • 15. Banking Telecom Consulting Startups Data Engineer Hiring Companies
  • 16. Data Engineer Required skills Requirements: ! Bachelor's degree in Computer Science/Engineering or equivalent experience. Master’s degree preferred.  ! Experience with large-scale distributed systems , Microservice and service-oriented architectures. ! Extensive experience with Amazon AWS, and other cloud offerings. ! Strong development skills in Scala, Java, Python and/or C++. ! Experience with caching technologies using Redis, Memcached. ! Knowledge of various databases / database technologies - Oracle, Postgres, Cassandra (NoSQL). ! Exposure to implementing real-time streaming data pipelines on large volumes of data using Kafka, Spark.  ! Experience with Data Processing (ETL, Data Warehousing, etc.)  ! Big Data technologies and languages (Pig, Hive, Spark, Hadoop).  ! Familiarity with version control software, such as Git. ! Highly proficient in Object Oriented Design and Development.  ! Experience in Automation and Load Testing Frameworks. Build, test, and maintain optimal data pipeline architecture ! Assemble large, complex data sets to meet both functional and non-functional requirement ! Build the infrastructure necessary for optimal extraction, transformation, and loading of data. Identify, design, implement, and enhance internal processes
  • 17. Unleash your data potential! Senior Position $120K - $140K Expert $150K+ Entry Level $70K - $80K Experienced $90K - $110K WeCloudData offers data Engineer accelerator program. We specialize in teaching the newest open source tools and techniques such as Hadoop, Spark, Python, Machine Learning, Deep Learning, and Cloud. Introduction
  • 18. TYPE OF DATA JOB SEEKERS 传感器器 数据 机器器学 习 ⼈人⼯工智 能 机器器⼈人 ⾏行行动 触发器器
  翻译: