Home Page/Posts/Use PySpark for Data Clean up/Use PySpark for Data Clean up2024-06-06·1 min· loading · loading · LikeNaci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogTable of ContentsUse PySpark for Data Clean upTable of ContentsUse PySpark for Data Clean upUse PySpark for Data Clean up#In this article, we will be cleaning up a dirty data by using PySpark RelatedAirflow Introduction Pipeline2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogChange Data Capture (CDC) Pipeline Implementation2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogElasticsearch Indexing and Kibana Dashboard with PySpark2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogOptimizing Spark Applications2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogProcessing Complex Nested JSON File with Spark2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres CatalogSpark Streaming Hands On from/to Kafka2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering Tutorial Hdfs Hive Mapreduce Postgres Catalog