This repo contains examples in Java for developers who want to know how to manipulate data. It was part of a talk named Data engineering for Java Developers (or vice versa)
$ mvn clean compile
$ mvn exec:java -Dexec.mainClass="br.nom.martinelli.ricardo.DataProcessingWithStreams"
$ mvn clean compile
$ mvn exec:java -Dexec.mainClass="br.nom.martinelli.ricardo.DataProcessingWithSpark"
$ mvn clean compile
$ mvn exec:java -Dexec.mainClass="br.nom.martinelli.ricardo.DataProcessingWithCassandra"
$ podman run -u $(id -u) -p 8080:8080 --rm -v $PWD/logs:/logs -v $PWD/notebooks:/notebook:U -v $PWD:/data:/data:U \
-e ZEPPELIN_LOG_DIR='/logs' -e ZEPPELIN_NOTEBOOK_DIR='/notebook' --name zeppelin apache/zeppelin:0.10.0