Let’s imagine that you need to know the actual distribution of keys by regions of the HBase table. There is no direct way to find it. But you can use this workaround: you need to send scan queries with region boundaries:
private AtomicLong count = new AtomicLong(0L);
private void printRegionStats() throws IOException {
List<HRegionInfo> regions = connection.getAdmin().getTableRegions(tableName);
try (Table table = connection.getTable(tableName)) {
regions.forEach(r -> {
Scan scan = new Scan();
scan.setStartRow(r.getStartKey());
scan.setStopRow(r.getEndKey());
try (ResultScanner resultScanner = table.getScanner(scan)) {
resultScanner.forEach(result -> { count.incrementAndGet() });
} catch (IOException e) {
e.printStackTrace();
}
log.error("found new region. keys count: {}", counter);
counter.set(0L);
});
}
}
This code will display line by line the number of keys in each region of the table.
If you still have any questions, feel free to ask me in the comments under this article, or write me on promark33@gmail.com.