fbpx
 

Data Engineering

Designed for

Session Duration

3 hrs

Price

Available on request

Duration

3 months

Session Duration

3 hrs

Price

TBA

Duration

3 months

‘Data Engineering’ is a customized but focused course to help students and developers achieve their potential for Data Engineering. Data engineers build systems that collect, manage, and convert raw data into usable information for data scientists and business analysts to interpret. While working with Python, you’ll also grow your skill in other languages, such as Shell and SQL, create data engineering pipelines, automate common file system tasks, and build a high-performance database.

Who should attend:

  • Students from CS, IT or Software background are welcome.
  • Anyone interested in this cutting-edge science..

Objectives:

  • Build skills in programming
  • Deploying application on multiple clouds
  • Data Acquisition from various platforms

Learning Outcome:

Upon successful completion of this program, participants will be able to:

Knowledge

  • Practical understanding of the Data Engineering Ecosystem and Lifecycle.
  • Apply critical concepts of data acquisition techniques.
  • Understand different data storage tools and options e.g files, SQL and NoSQL.
  • learn the scale of applications in terms of deployment.
  • Guidance and counseling from professionals in managing a career in this field.

Skills

  • Python programming basics including data structures, logic, working with files, invoking APIs, using libraries such as Pandas and Numpy, doing ETL.
  • SQL query language, SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procs, working with multiple tables, JOINs, & transactions.
  • Schedule, automate, and monitor data pipelines using Apache Airflow. Run data quality checks, track data lineage, and work with data pipelines in production.
  • Orchestrate Data with Airflow.

Course Name

Class Timings

Course Location

Course Level

Entry Requirements

Accreditation/Awarding
Body

Data Engineering

As Communicated 

Online/Blended

Learner / Professional / Expert Level

There are no formal requirements for this certification.

CET and ICE

Course Name

Data engineering

Class Timings

As Communicated 

Course Location

Online/Blended

Course Level

Learner/ Professional/ Expert Level

Accreditation/Awarding Body

CET and ICE

Entry Requirements

There are no formal requirements for this certification.

Outlines

The list below provides a overview of the topics covered in this certification.

  • Python Basics
  • Python Data Structures
  • Working with Data in Python
  • Basics of Python + Code Best Practices using Python
  • Git (basic implementation & advance concepts)
  • Fetching & Posting data using apis (postman/python)
  • Hands-on: Python Request Library
  • Authentication for APIs
  • Oauth 2.0 flow
  • Hands-on: Fetching data from Youtube Data API
  • Hands-on: Google Sheets API
  • Any public API listed here
  • Why scrape the web?
  • How Does Web Scraping Work?
  • Techniques to scrape data from web
  • Hands-on: Web Scraping
  • Custom web Scraper (Bs4, requests)
  • Introduction to Scrapy
  • Shell Scripting
  • Linux & Command Line Basics
  • Linux shell: Setup
  • What is the shell?
  • Navigating, Creation and Migration commands
  • Differentiate and Pipes
  • Job Scheduling
    • Crontab in Linux
    • Scheduling Cron jobs: Linux
    • Scheduling Tasks with At Utility: Windows
  • Relational Database Concepts
  • Using Relational Databases
  • MySQL and PostgreSQL
  • Introduction to Cloud databases
  • Cloud Storage types and uses cases
  • Getting data from cloud database
  • Getting Started with SQL
  • Introduction to Relational Databases and Tables
  • Intermediate SQL
  • Accessing Databases using Python
  • Introduction to data pipelines
  • ELT Basics
  • Comparing ETL & ELT
  • Data Extraction Techniques
  • Introduction to Data Transformation Techniques
  • ETL using Shell Scripting
  • Batch Versus Streaming Data Pipeline Use Cases
  • Use Case: Building an Advanced ETL pipeline
  • Apache Airflow Overview
  • Advantage of Using Data Pipelines as DAGs
  • Apache Airflow UI
  • Build DAG Using Airflow
  • Airflow Monitoring and Logging
  • Hands-on: Writing DAG
  • Distributed Event Streaming Platform Components
  • Apache Kafka Overview
  • Building Event Streaming Pipelines using Kafka
  • Kafka Streaming Process
  • Data Warehouses, Data Marts, and Data Lakes
  • Designing, Modeling and Implementing Data Warehouses
  • Data Warehouse Analytics
  • Introducing NoSQL
  • Introducing MongoDB – An Open-Source NoSQL Database
  • Introducing Apache Cassandra – An Open-Source NoSQL Database
  • Hands-on: Working with NoSQL Databases
  • Introduction to Hadoop and Spark
  • introduction to MapRedude
  • Spark for Data Engineering
  • SparkML

Exams & Certification

One hour ‘closed book’ with 40 multiple choice questions
Pass mark is 65% (26/40)

Tutors

All leading professionals and academics from across the globe 

Yasir Qayam

Project Management Consultant at Yas Engineering Solutions
B.E Mechanical, Queen Mary University

Certificate-Icon.png

Certificate

Upon completion of the Ethical Hacking & Penetration Testing course, you will also receive the certificate awarded by ICE

All certificate images are for illustrative purposes only and may be subject to change at the discretion of ICE.

Certificate

Upon completion of the Ethical Hacking & Penetration Testing course, you will also receive the certificate awarded by ICE

All certificate images are for illustrative purposes only and may be subject to change at the discretion of ICE.

More Questions?