6 nodes with 2 nodes in each DC, of size 1TB.
Replica count 3, with restriction that 1 copy of replica is present in each DC.
Question: For a node N1 of size 1TB, how much storage should be provisioned for the user data?
Considering the fact that,
- the other node N2 in the same DC might go down, and data from N2 will be migrated to N1.
- data in N2(which is going down) might be in compressed form & would required additional storage in N1 for the rebalancing.
As per my understanding, a rough guess considering point(1) above will be 50% of 1TB, but again considering (2), it will be less than 50%.