“Learn key Data Engineering Skills such as SQL, Python and PySpark with tons of Hands-on tasks and exercises using labs.”
截至目前,超过 40151+ 人们已经注册了这门课程,而且已经结束了 1297+ 评论.
课程内容
“Introduction about the course Getting Started with ITVersity Labs for Data Engineering Essentials on Udemy Setup Environment to learn Python, SQL, Hadoop, Spark using Docker on Windows 11 Setup Environment to learn Python, SQL, Hadoop, Spark using Docker on Windows 10 Setup Environment to learn Python, SQL, Hadoop and Spark using Docker on Mac Setting up Environment to learn Python, SQL as well as Spark using AWS Cloud9 Networking Concepts for Beginners – ip addresses and port numbers Database Essentials – Getting Started Database Essentials – Database Operations Database Essentials – Writing Basic SQL Queries Database Essentials – Creating Tables and Indexes Database Essentials – Partitioning Tables and Indexes Database Essentials – Predefined Functions Database Essentials – Writing Advanced SQL Queries Programming Essentials using Python – Perform Database Operations Programming Essentials using Python – Getting Started with Python Programming Essentials using Python – Basic Programming Constructs Programming Essentials using Python – Predefined Functions Programming Essentials using Python – User Defined Functions Programming Essentials using Python – Overview of Collections – list and set Programming Essentials using Python – Overview of Collections – dict and tuple Programming Essentials using Python – Manipulating Collections using loops Programming Essentials using Python – Development of Map Reduce APIs Programming Essentials using Python – Understanding Map Reduce Libraries Programming Essentials using Python – Basics of File IO using Python Programming Essentials using Python – Delimited Files and Collections Programming Essentials using Python – Overview of Pandas Libraries Programming Essentials using Python – Database Programming – CRUD Operations Programming Essentials using Python – Database Programming – Batch Operations Programming Essentials using Python – Processing JSON Data Programming Essentials using Python – Processing REST Payloads Understanding Python Virtual Environments Overview of Pycharm for Python Application Development Data Copier – Getting Started Data Copier – Reading Data using Pandas Data Copier – Database Programming using Pandas Data Copier – Loading Data from files to tables Data Copier – Modularizing the application Data Copier – Dockerizing the application Data Copier – Using custom Docker Image Data Copier – Deploy and Validate Application on Remote Server Validate ITVersity Hadoop and Spark Cluster (for ITVersity lab customers) Setup Single Node Hadoop and Spark Cluster or Lab using Docker Introduction to Hadoop eco system – Overview of HDFS Data Engineering using Spark SQL – Getting Started Data Engineering using Spark SQL – Basic Transformations Data Engineering using Spark SQL – Managing Tables – Basic DDL and DML Data Engineering using Spark SQL – Managing Tables – DML and Partitioning Data Engineering using Spark SQL – Overview of Spark SQL Functions Data Engineering using Spark SQL – Windowing Functions Apache Spark using Python – Data Processing Overview Apache Spark using Python – Processing Column Data Apache Spark using Python – Basic Transformations Apache Spark using Python – Joining Data Sets Apache Spark using Python – Spark Metastore Getting Started with Semi Structured Data using Spark Process Semi Structured Data using Spark Data Frame APIs Apache Spark – Development Life Cycle using Python Spark Application Execution Life Cycle and Spark UI Setup SSH Proxy to access Spark Application logs Deployment Modes of Spark Applications”
“Hands on Data Interaction using – ETL, Web Scraping ,Big Data,SQL,Power BI”
截至目前,超过 24966+ 人们已经注册了这门课程,而且已经结束了 267+ 评论.
课程内容
“ETL (Extract, Transform ,Load) environment setup Implementing ETL Process with SSIS Data Interaction with SQL (Transact-SQL) Web Scraping Installing Required Software for Web Scraping Web Scraping with Python and Beautiful Soup Web Scraping with Python and Scrapy Introduction to Big Data Data Interaction with Power BI Connecting to Web Data with Power BI Connecting and transforming database data with Power BI Data Modelling with Power BI”
“Build Data Engineering Pipelines using AWS Data Analytics Services such as Glue, EMR, Athena, Kinesis, Lambda, etc”
截至目前,超过 7995+ 人们已经注册了这门课程,而且已经结束了 661+ 评论.
课程内容
“Introduction to the course Setup Local Development Environment for AWS on Windows 10 or Windows 11 Setup Local Development Environment for AWS on Mac Setup Environment for Practice using Cloud9 AWS Getting Started with s3, IAM and CLI Storage -Deep Dive into AWS Simple Storage Service aka s3 AWS Security using IAM – Managing AWS Users, Roles and Policies using AWS IAM Infrastructure – Getting Started with AWS Elastic Cloud Compute aka EC2 Infrastructure – AWS EC2 Advanced Data Ingestion using Lambda Functions Overview of Glue Components Setup Spark History Server for Glue Jobs Deep Dive into Glue Catalog Exploring Glue Job APIs Glue Job Bookmarks Getting Started with AWS EMR Development Lifecycle for Pyspark Deploying Spark Applications using AWS EMR Streaming Pipeline using Kinesis Consuming Data from s3 using boto3 Populating GitHub Data to Dynamodb Overview of Amazon Athena Amazon Athena using AWS CLI Amazon Athena using Python boto3 Getting Started with Amazon Redshift Copy Data from s3 into Redshift Tables Develop Applications using Redshift Cluster Redshift Tables with Distkeys and Sortkeys Redshift Federated Queries and Spectrum”
“Build Data Engineering Pipelines using Databricks core features such as Spark, Delta Lake, cloudFiles, etc.”
截至目前,超过 6452+ 人们已经注册了这门课程,而且已经结束了 446+ 评论.
课程内容
Introduction to Data Engineering using Databricks Getting Started with Databricks on Azure Azure Essentials for Databricks – Azure CLI Mount ADLS on to Azure Databricks to access files from Azure Blob Storage Getting Started with Databricks on AWS AWS Essentials for Databricks – Setup Local Development Environment on Windows AWS Essentials for Databricks – Setup Local Development Environment on Mac AWS Essentials for Databricks – Overview of AWS Storage Solutions AWS Essentials for Databricks – Overview of AWS s3 and IAM Roles for Databricks AWS Essentials for Databricks – Integrating AWS s3 and Glue Catalog Setup Local Development Environment for Databricks Using Databricks CLI Spark Application Development Life Cycle Databricks Jobs and Clusters Deploy and Run Spark Applications on Databricks Deploy Spark Jobs using Notebooks Deep Dive into Delta Lake using Spark Data Frames on Databricks Deep Dive into Delta Lake using Spark SQL on Databricks Accessing Databricks Cluster Terminal via Web as well as SSH Installing Softwares on Databricks Clusters using init scripts Quick Recap of Spark Structured Streaming Incremental Loads using Spark Structured Streaming on Databricks Incremental Loads using autoLoader Cloud Files on Databricks Overview of Databricks SQL Clusters
“End to end batch processing,data orchestration and real time streaming analytics on GCP”
截至目前,超过 3527+ 人们已经注册了这门课程,而且已经结束了 406+ 评论.
课程内容
“Introduction and Overview Batch Processing and ETL using BigQuery,Spark and Airflow / Google composer Batch Data ingestion using Apache Sqoop and Apache Airflow / Google Composer Kafka Crash Course Real-Time Streaming and Analytics using Spark Structured Streaming with Kafka Real-Time Streaming with streaming files as source of data with IOT sensor data Update – BigQuery / CLoudSql – Federated Queries”
A comprehensive Data Engineering course on building streaming pipelines using Kafka and Spark Structured Streaming
截至目前,超过 971+ 人们已经注册了这门课程,而且已经结束了 56+ 评论.
课程内容
Introduction Getting Started with Kafka Data Ingestion using Kafka Connect Overview of Spark Structured Streaming Kafka and Spark Structured Streaming Integration Incremental Loads using Spark Structured Streaming Setting up Environment using AWS Cloud9 Setting up Environment – Overview of GCP and Provision Ubuntu VM Setup Single Node Hadoop Cluster Setup Hive and Spark Setup Single Node Kafka Cluster
Learn the skills to become a Data Scientist [ Data Science A – Z ]
截至目前,超过 351+ 人们已经注册了这门课程,而且已经结束了 44+ 评论.
课程内容
Setting up Python Python Theory Software Design Python Tutorials Setting up the Environment for Machine Learning Understanding Data With Statistics & Data Pre-processing Data Visualization with Python Artificial Neural Networks [Comprehensive Sessions]Naive Bayes Classifier with Python [Lecture & Demo]Linear regression Logistic regression Introduction to clustering [K – Means Clustering ]Extra Reading