Hi, not sure if this is the best place to ask but let me give it a try:
We are running a 9 node cluster in our own DC. Every day we batch-insert a couple of million rows in about 30 different tables. We use it sort of as a front-end to our Hive/Hadoop platform.
Also, all the queries are simple
selects on a single table .
Now, besides our daily batch-inserts we recently also started some import jobs from other datastores. And after running the cluster for a while we noticed the following “Problem Ranges Report”:
I am now wondering what can we do about it? Would it make sense to restart the problematic nodes in this case?