Abstract
We design and implement LEGOStore, an erasure coding (EC) based linearizable data store over geo-distributed public cloud data centers (DCs). For such a data store, the confluence of the following factors opens up opportunities for EC to be latency-competitive with replication: (a) the necessity of communicating with remote DCs to tolerate entire DC failures and implement linearizability; and (b) the emergence of DCs near most large population centers. LEGOStore employs an optimization framework that, for a given object, carefully chooses among replication and EC, as well as among various DC placements to minimize overall costs. To handle workload dynamism, LEGOStore employs a novel agile reconfiguration protocol. Our evaluation using a LEGOStore prototype spanning 9 Google Cloud Platform DCs demonstrates the efficacy of our ideas. We observe cost savings ranging from moderate (5-20%) to significant (60%) over baselines representing the state of the art while meeting tail latency SLOs. Our reconfiguration protocol is able to transition key placements in 3 to 4 inter-DC RTTs (< 1s in our experiments), allowing for agile adaptation to dynamic conditions.
| Original language | English (US) |
|---|---|
| Pages (from-to) | 2201-2215 |
| Number of pages | 15 |
| Journal | Proceedings of the VLDB Endowment |
| Volume | 15 |
| Issue number | 10 |
| DOIs | |
| State | Published - 2022 |
| Event | 48th International Conference on Very Large Data Bases, VLDB 2022 - Sydney, Australia Duration: Sep 5 2022 → Sep 9 2022 |
All Science Journal Classification (ASJC) codes
- Computer Science (miscellaneous)
- General Computer Science