StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POCalculating the probability of system failure in a distributed network
primarykey
Id
3092694
data
AcceptedAnswerId
3134599
AnswerCount
2
ClosedDate
CommentCount
6
CommunityOwnedDate
CreationDate
2010-06-22T11:39:03.817
FavoriteCount
4
LastActivityDate
2013-04-15T10:28:43.860
LastEditDate
2010-06-24T14:46:58.523
LastEditorUserId
71354
OwnerUserId
71354
ParentId
0
PostTypeId
1
Score
7
ViewCount
2994
LastEditorDisplayName
text
Body
I am trying to build a mathematical model of the availability of a file in a distributed file-system. I posted this question at MathOverflow but this might as well be classified as a CS-question so I give it a shot here as well. The system works like this: a node stores a file (encoed using erasure codes) at r*b remotes nodes, where r is the replication-factor and b is an integer constant. Erasure-coded files have the property that the file can be restored iff at least b of the remote nodes are available and return its part of the file. The simplest approach to this is to assume that all remote nodes are independent of each other and have the same availability p. With these assumptions the availability of a file follows the Binomial distribution, i.e. <a href="http://bit.ly/dyJwwE" rel="noreferrer">Binomial distribution http://bit.ly/dyJwwE</a> Unfortunately these two assumptions can introduce a non-neligible error, as shown by this paper: <a href="http://deim.urv.cat/~lluis.pamies/uploads/Main/icpp09-paper.pdf" rel="noreferrer">http://deim.urv.cat/~lluis.pamies/uploads/Main/icpp09-paper.pdf</a>. One way to overcome the assumption that all nodes have the same availability is to calculate the probability of each possible combination of availaible/non-available node and take the sum of all these outcomes (which is sort of what they suggest in the paper above, just more formally than what I just described). You can see this approach as a binary tree with depth r*b and each leave is one possible combination of available/not-available nodes. The file's availability is the same thing as the probablity that you arrive at a leave with >=b available nodes. This approach is more correct but has a computational cost of <a href="http://bit.ly/cEZcAP" rel="noreferrer">Ordo http://bit.ly/cEZcAP</a>. Also, it doesn't deal with the assumption of node independence. Do you guys have any ideas of a good approximation which introduces less error than the binomial distribution-aproximation but with better computational cost than <a href="http://bit.ly/cEZcAP" rel="noreferrer">http://bit.ly/d52MM9 http://bit.ly/cEZcAP</a>? You can assume that the availability-data of each node is a set of tuples consisting of <code>(measurement-date, node measuring, node being measured, succes/failure-bit)</code>. With this data you could for example calculate the correlation of the availability between the nodes and the availability variance.
Tags
<computer-science><distributed><time-complexity><high-availability><binomial-cdf>
Title
Calculating the probability of system failure in a distributed network
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USYrlec
UserOwnerUserId
1. USYrlec
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POCalculating the probability of system failure in a distributed network
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POCalculating the probability of system failure in a distributed network
 UserUserId
 USPete Kirkham
 VoteTypeVoteTypeId
 VTFavorite
3. VO
 singulars
 PostPostId
 POCalculating the probability of system failure in a distributed network
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.