cloudera Archives - Java / Cloud / BigData

November 2, 2022November 8, 2022
Hadoop, Java, Spark
Leave a comment

Spark Java access remote HDFS

Suppose we need to work with different HDFS (clusterB, for instance) from our Spark Java application, running on clusterA. Firstly, you need to add –conf key to your run command. Depends on Spark version: Secondly, when you creating Spark’s Java context, add that: You need to go to clusterB and gather core-site.xml and hdfs-site.xml from there (default location for Cloudera is /etc/hadoop/conf) […]

November 2, 2022November 2, 2022
Hadoop, Java
Leave a comment

Java access remote HDFS from current Hadoop cluster

Suppose we have our Java app running on Hadoop clusterA, and we want to access remote HDFS based on Hadoop clusterB. Let’s see how we can do it: You need to go to clusterB and gather core-site.xml and hdfs-site.xml from there (default location for Cloudera is /etc/hadoop/conf) and put near your app running in clusterA. […]

August 5, 2022August 5, 2022
Hadoop, Java
Leave a comment

HDFS DELEGATION TOKEN can’t be found in cache

The problem can be appears in Hadoop’s NodeManager logs. Usually it means that NodeManager is trying to use an expired / not renewed HDFS delegation token. For example, you can face this error while app log aggregation process. The timeline is: Your application pass HDFS delegation token to the NodeManager through the ContainerLaunchContext class, because […]

April 14, 2021April 14, 2021
Hadoop, Java, ZooKeeper
Leave a comment

ZooKeeper recursive watcher

If you need to set up a recursive watchers (watch on all nodes), the standard ZooKeeper’s Watcher class will not help much – it is installed on only 1 node (or one-level-forward when you calling getChildren()), and is also a one-time event. This means that after each watch trigger, you need to install a new […]

March 25, 2021August 10, 2021
Hadoop, Java, Spring Boot
Leave a comment

How to immediately terminate the Spring Boot Yarn container with an error

Imagine an error or exception occurs while running the Spring Boot Yarn container, and we need to kill container from itself and return an error code. You can use @OnContainerStart annotation as mentioned in this article: https://mchesnavsky.tech/how-to-set-up-exit-code-on-spring-boot-yarn-container. But if we need to stop the container immediately, we just need to call: – where parameter is […]

March 21, 2021November 23, 2021
Hadoop, HBase, ZooKeeper
2 Comments

KeeperErrorCode = ConnectionLoss for /hbase/hbaseid

IMPORTANT! If you trying to install Apache Atlas and receiving this error, there is a separate article: https://mchesnavsky.tech/apache-atlas-building-installing/ Suppose that we are faced with these exceptions. The first: The second: The third: The hbase-client cannot connect to the Zookeeper. You need to pay attention to the address: If there is a real Zookeeper instance at […]

March 11, 2021August 10, 2021
Hadoop, Java, ZooKeeper
Leave a comment

How to find out what is taking up disk space in Zookeeper

If you came to this page, it means that ZooKeeper began to take up a lot of space, and you need to find the reason. Mostly ephemeral data is stored in ZooKeeper. So at some time you asks the question – why the server began to consume so much disk space? The fact is that […]

March 5, 2021August 10, 2021
Hadoop, Ignite, Java
Leave a comment

IgniteException: Work directory does not exist

Suppose that when starting the application, you encounter the following exception: It is assumed, that the /tmp/ignite/work directory is located on the local file system. If the application is running on a cluster, then Ignite will try to create such a folder on each node of the cluster. Exception reasons: The account under which the […]

March 5, 2021August 10, 2021
Hadoop, Java, Spring Boot
Leave a comment

Why the AppMaster receives a different error code from the container Spring Boot Yarn

In the previous article, we already figured out how to terminate a container with a specific exit code. To begin with, the return value must be smaller then 256. You can notice that: If you terminate the container with a code of 0 or some other positive number, then the same code will be sent […]

March 5, 2021August 10, 2021
Hadoop, Java, Spring Boot
Leave a comment

How to set up exit code on Spring Boot Yarn container

You need to mark container’s start class using the @YarnComponent and mark method with the @OnContainerStart annotations: Despite the fact that the description of the annotation says that it cannot return anything, the return value works. We make the return value of type int. 0 – means successful shutdown of the container, non-zero values – […]