TY - GEN
T1 - One for all and all for one: Scalable consensus in a hybrid communication model
AU - Raynal, Michel
AU - Cao, Jiannong
PY - 2019/7
Y1 - 2019/7
N2 - This paper addresses consensus in an asynchronous model where the processes are partitioned into clusters. Inside each cluster, processes can communicate through a shared memory, which favors efficiency. Moreover, any pair of processes can also communicate through a message-passing communication system, which favors scalability. In such a 'hybrid communication' context, the paper presents two simple binary consensus algorithms (one based on local coins, the other one based on a common coin). These algorithms are straightforward extensions of existing message-passing randomized round-based consensus algorithms. At each round, the processes of each cluster first agree on the same value (using an underlying shared memory consensus algorithm), and then use a message-passing algorithm to converge on the same decided value. The algorithms are such that, if all except one processes of a cluster crash, the surviving process acts as if all the processes of its cluster were alive (hence the motto 'one for all and all for one'). As a consequence, the hybrid communication model allows us to obtain simple, efficient, and scalable fault-tolerant consensus algorithms. As an important side effect, according to the size of each cluster, consensus can be obtained even if a majority of processes crash.
AB - This paper addresses consensus in an asynchronous model where the processes are partitioned into clusters. Inside each cluster, processes can communicate through a shared memory, which favors efficiency. Moreover, any pair of processes can also communicate through a message-passing communication system, which favors scalability. In such a 'hybrid communication' context, the paper presents two simple binary consensus algorithms (one based on local coins, the other one based on a common coin). These algorithms are straightforward extensions of existing message-passing randomized round-based consensus algorithms. At each round, the processes of each cluster first agree on the same value (using an underlying shared memory consensus algorithm), and then use a message-passing algorithm to converge on the same decided value. The algorithms are such that, if all except one processes of a cluster crash, the surviving process acts as if all the processes of its cluster were alive (hence the motto 'one for all and all for one'). As a consequence, the hybrid communication model allows us to obtain simple, efficient, and scalable fault-tolerant consensus algorithms. As an important side effect, according to the size of each cluster, consensus can be obtained even if a majority of processes crash.
KW - Asynchronous system
KW - Atomic register
KW - Binary consensus
KW - Cluster
KW - Common coin
KW - Compare and swap
KW - Hybrid communication
KW - Local coin
KW - Message-passing
KW - Modularity
KW - Process crash failure
KW - Scalability
UR - https://www.scopus.com/pages/publications/85074870818
U2 - 10.1109/ICDCS.2019.00053
DO - 10.1109/ICDCS.2019.00053
M3 - Conference article published in proceeding or book
AN - SCOPUS:85074870818
T3 - Proceedings - International Conference on Distributed Computing Systems
SP - 464
EP - 471
BT - Proceedings - 2019 39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019
Y2 - 7 July 2019 through 9 July 2019
ER -