Pathway reposted this
Now you can do Kafka ETL in Python! Kudos to this explainer and easy-to-run explainer with Pathway by Olivier Ruas, PhD. 🚀 Imagine you’ve been hired by a fraud-detection company monitoring logs from servers in New York and Paris. The logs have different time zones: You need to unify these different time zones into a single format to maintain data integrity. This is where ETL comes in. Kafka is a popular tool to build ETL pipelines that many companies use. But, it’s mainly used by Java and Scala developers, making it tricky for data scientists and ML engineers Enters Kafka ETL with Python using Pathway: Pathway, a faster framework built on Rust, can be used via a Python interface. As a Python developer, you can build ETL pipelines over Kafka in pure Python without compromising on performance. ✔️ Extract (E): Extract data streams from Kafka using Pathway Kafka input connectors. ✔️ Transform (T): Convert times with varying time zones into unified timestamps using Pathway’s datetime module. ✔️ Load (L): Load the final data stream back into Kafka. The entire script is available as a Pathway App Template, which can be run via Docker in minutes. Read the full tutorial and access the GitHub repository here: https://lnkd.in/gYB-NZwj #Pathway #Kafka #ETL #Python #DataEngineering #OpenSource
gg Saksham Goel
Informative
Enlightening
Good one!
Indeed helpful!
Quite impressive
Seems an amazing tool
Very interesting
Data Scientist @Neuron7.ai | Ex-Futures First Intern | IIT Delhi '24
3moSeems like a great tool for python developers