Home Page/Posts/Use PySpark for Data Clean up/Use PySpark for Data Clean up2024-06-06·1 min· loading · loading · LikeNaci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogUse PySpark for Data Clean up#In this article, we will be cleaning up a dirty data by using PySpark RelatedAirflow Introduction Pipeline2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogChange Data Capture (CDC) Pipeline Implementation2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogElasticsearch Indexing and Kibana Dashboard with PySpark2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogOptimizing Spark Applications2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogProcessing Complex Nested JSON File with Spark2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalogSpark Streaming Hands On from/to Kafka2024-06-09·1 min· loading · loading Naci SimsekDocker Hadoop Data Engineering tutorial hdfs hive mapreduce postgres catalog