Home Page/Posts/Use PySpark for Data Clean up/Use PySpark for Data Clean up2024-06-06·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogTable of ContentsUse PySpark for Data Clean upTable of ContentsUse PySpark for Data Clean upUse PySpark for Data Clean up#In this article, we will be cleaning up a dirty data by using PySpark RelatedAirflow Introduction Pipeline1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogChange Data Capture (CDC) Pipeline Implementation1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogElasticsearch Indexing and Kibana Dashboard with PySpark1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogOptimizing Spark Applications1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogProcessing Complex Nested JSON File with Spark1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogSpark Streaming Hands On from/to Kafka1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Airflow Introduction Pipeline1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Change Data Capture (CDC) Pipeline Implementation1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Elasticsearch Indexing and Kibana Dashboard with PySpark1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Optimizing Spark Applications1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Processing Complex Nested JSON File with Spark1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog
Spark Streaming Hands On from/to Kafka1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog