Skip to content
#

data-flow

Here are 21 public repositories matching this topic...

frankframework

Kafka consumer application which reads messages from Kafka and based on the schema registered in a schema registry and enriches the data. This is just a prototype on how we can efficiently evolve a schema and read messages with BACKWARD compatibility with out breaking the consumers. This application can be used as a base for a stateless stream p…

  • Updated Apr 6, 2018
  • Java

The Data Pulse pipeline processes and transforms web-scraped pageviews using Apache Beam and Google Cloud Dataflow. It reads JSON lines, parses them into PageView objects, filters for "product" post types, enriches with country info, and writes to Google BigQuery. Robust logging and error handling ensure data integrity

  • Updated Jul 2, 2024
  • Java

Improve this page

Add a description, image, and links to the data-flow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-flow topic, visit your repo's landing page and select "manage topics."

Learn more