Scray Maven Archetype

This archetype supports writing Scray compatible jobs by creating:

directory structure
pom.xml that creates an ueberjar for the job
bin directory with shell script to start the jobs
minimal job structure

Versions

Java	Scala	Spark	Hadoop
17	2.13	3.5.0	3.4

Usage:

Building

To use the archetype artefacts they must either be pulled from a archetype-repo or installed in the local repo.

mvn clean install

Create project from archetype with maven

Archetypes are enhancements of maven ("plugins") that can generate new projects. To use them classical maven coordinates must be provided.

mvn archetype:generate                  \
  -DarchetypeGroupId=org.scray          \
  -DarchetypeArtifactId=scray-archetype \
  -DarchetypeVersion=1.1.6-SNAPSHOT

Running the jobs:

Run in local Spark Standalone Mode

A local spark node will be started and this job will be executed on this node.

Example:

./bin/submit-job.sh                      \ 
  --local-mode                           \
  --master spark://<YOUR_LOCAL_IP>:7077  \
  --total-executor-cores 4               \
  -b                                     \
  -m  spark://127.0.0.1:7077

The options --master with the Spark master URL and --total-executor-cores providing the number of cores are required by the runner script.

For the url of the master there are several options:

spark://<IP>:<Port> (default port 7077)
yarn-client (run a job with a local client but execute on a Hadoop yarn cluster of spark workers)
yarn-cluster (run the client and the workers of the Spark job on a Hadoop yarn cluster)

Example start batch job

./bin/submit-job.sh                \
  --local-mode                     \
  --master spark://127.0.0.1:7077  \
  --total-executor-cores 2         \
  -b                               \
  -m spark://127.0.0.1:7077

Example start streaming job

./bin/submit-job.sh                 \
  --local-mode                      \
  --master spark://127.0.0.1:7077   \
  --total-executor-cores 2          \
  -m spark://127.0.0.1:7077         \
  -t db-nifi                        \
  -k www.db.opendata.s-node.de:9092 \
  -p /tmp/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Scray Maven Archetype

Versions

Usage:

Building

Create project from archetype with maven

Running the jobs:

Run in local Spark Standalone Mode

Example start batch job

Example start streaming job

Files

README.md

Latest commit

History

README.md

File metadata and controls

Scray Maven Archetype

Versions

Usage:

Building

Create project from archetype with maven

Running the jobs:

Run in local Spark Standalone Mode

Example start batch job

Example start streaming job