remote Archives - Java / Cloud / BigData

November 2, 2022November 8, 2022
Hadoop, Java, Spark
Leave a comment

Spark Java access remote HDFS

Suppose we need to work with different HDFS (clusterB, for instance) from our Spark Java application, running on clusterA. Firstly, you need to add –conf key to your run command. Depends on Spark version: Secondly, when you creating Spark’s Java context, add that: You need to go to clusterB and gather core-site.xml and hdfs-site.xml from there (default location for Cloudera is /etc/hadoop/conf) […]

Tag: remote

Spark Java access remote HDFS

Java access remote HDFS from current Hadoop cluster