How to Create A Cloud Dataflow Pipeline Using Java and Apache Maven
This post will explain how to create a simple Maven project with the Apache Beam SDK in order to run a pipeline on Google Cloud Dataflow service. One advantage to use Maven, is that this tool will let you manage external dependencies for the Java project, making it ideal for automation processes.
This project execute a very simple example where two strings "Hello" and "World" are the inputs and transformed to upper case on GCP Dataflow, the output is presented on console log.
Disclaimer: Purpose of this post is to present steps to create a Data pipeline using Dataflow on GCP, Java code syntax is not going to be discussed and is beyond this scope. Hope to make in the future some specific tutorials on this.
Read full article from How to Create A Cloud Dataflow Pipeline Using Java and Apache Maven
No comments:
Post a Comment