StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POWhy does the speed of this SOR solver depend on the input?
primarykey
Id
2392932
data
AcceptedAnswerId
2393038
AnswerCount
2
ClosedDate
CommentCount
4
CommunityOwnedDate
CreationDate
2010-03-06T15:15:35.177
FavoriteCount
3
LastActivityDate
2010-03-07T11:33:04.897
LastEditDate
2017-05-23T12:11:49.347
LastEditorUserId
-1
OwnerUserId
14637
ParentId
0
PostTypeId
1
Score
6
ViewCount
1366
LastEditorDisplayName
text
Body
Related to my <a href="https://stackoverflow.com/questions/2388196/how-to-speed-up-my-sparse-matrix-solver">other question</a>, I have now modified the sparse matrix solver to use the SOR (Successive Over-Relaxation) method. The code is now as follows: <pre><code>void SORSolver::step() { float const omega = 1.0f; float const *b = &d_b(1, 1), *w = &d_w(1, 1), *e = &d_e(1, 1), *s = &d_s(1, 1), *n = &d_n(1, 1), *xw = &d_x(0, 1), *xe = &d_x(2, 1), *xs = &d_x(1, 0), *xn = &d_x(1, 2); float *xc = &d_x(1, 1); for (size_t y = 1; y < d_ny - 1; ++y) { for (size_t x = 1; x < d_nx - 1; ++x) { float diff = *b - *xc - *e * *xe - *s * *xs - *n * *xn - *w * *xw; *xc += omega * diff; ++b; ++w; ++e; ++s; ++n; ++xw; ++xe; ++xs; ++xn; ++xc; } b += 2; w += 2; e += 2; s += 2; n += 2; xw += 2; xe += 2; xs += 2; xn += 2; xc += 2; } } </code></pre> Now the weird thing is: if I increase <code>omega</code> (the relaxation factor), the execution speed starts to depend dramatically on the values inside the various arrays! For <code>omega = 1.0f</code>, the execution time is more or less constant. For <code>omega = 1.8</code>, the first time, it will typically take, say, 5 milliseconds to execute this <code>step()</code> 10 times, but this will gradually increase to nearly 100 ms during the simulation. If I set <code>omega = 1.0001f</code>, I see an accordingly slight increase in execution time; the higher <code>omega</code> goes, the faster execution time will increase during the simulation. Since all this is embedded inside the fluid solver, it's hard to come up with a standalone example. But I have saved the initial state and rerun the solver on that state every time step, as well as solving for the actual time step. For the initial state it was fast, for the subsequent time steps incrementally slower. Since all else is equal, that proves that the execution speed of this code is dependent on the values in those six arrays. This is reproducible on Ubuntu with g++, as well as on 64-bit Windows 7 when compiling for 32-bits with VS2008. I heard that NaN and Inf values can be slower for floating point calculations, but no NaNs or Infs are present. Is it possible that the speed of float computations otherwise depends on the values of the input numbers?
Tags
<c++><optimization><floating-point><sparse-matrix>
Title
Why does the speed of this SOR solver depend on the input?
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USCommunity
UserOwnerUserId
1. USThomas
plurals
PostLinksPostIdRelatedPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POWhy does the speed of this SOR solver depend on the input?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POWhy does the speed of this SOR solver depend on the input?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POWhy does the speed of this SOR solver depend on the input?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.