StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POIs indexing vectors in MATLAB inefficient?
primarykey
Id
13382155
data
AcceptedAnswerId
0
AnswerCount
2
ClosedDate
CommentCount
12
CommunityOwnedDate
CreationDate
2012-11-14T15:46:36.337
FavoriteCount
17
LastActivityDate
2012-11-15T10:04:09.890
LastEditDate
2012-11-14T16:46:41.330
LastEditorUserId
1644189
OwnerUserId
1644189
ParentId
0
PostTypeId
1
Score
48
ViewCount
4025
LastEditorDisplayName
text
Body
Background My question is motivated by simple observations, which somewhat undermine the beliefs/assumptions often held/made by experienced MATLAB users: <ul> <li>MATLAB is very well optimized when it comes to the built-in functions and the fundamental language features, such as indexing vectors and matrices.</li> <li>Loops in MATLAB are slow (despite the JIT) and should generally be avoided if the algorithm can be expressed in a native, 'vectorized' manner. </li> </ul> The bottom line: core MATLAB functionality is efficient and trying to outperform it using MATLAB code is hard, if not impossible. Investigating performance of vector indexing The example codes shown below are as fundamental as it gets: I assign a scalar value to all vector entries. First, I allocate an empty vector <code>x</code>: <pre><code>tic; x = zeros(1e8,1); toc Elapsed time is 0.260525 seconds. </code></pre> Having <code>x</code> I would like to set all its entries to the same value. In practice you would do it differently, e.g., <code>x = value*ones(1e8,1)</code>, but the point here is to investigate the performance of vector indexing. The simplest way is to write: <pre><code>tic; x(:) = 1; toc Elapsed time is 0.094316 seconds. </code></pre> Let's call it method 1 (from the value assigned to <code>x</code>). It seems to be very fast (faster at least than memory allocation). Because the only thing I do here is operate on memory, I can estimate the efficiency of this code by calculating the obtained effective memory bandwidth and comparing it to the hardware memory bandwidth of my computer: <pre><code>eff_bandwidth = numel(x) * 8 bytes per double * 2 / time </code></pre> In the above, I multiply by <code>2</code> because unless SSE streaming is used, setting values in memory requires that the vector is both read from and written to the memory. In the above example: <pre><code>eff_bandwidth(1) = 1e8*8*2/0.094316 = 17 Gb/s </code></pre> <a href="https://www.cs.virginia.edu/stream/" rel="noreferrer">STREAM-benchmarked memory bandwidth</a> of my computer is around 17.9 Gb/s, so indeed - MATLAB delivers close to peak performance in this case! So far, so good. Method 1 is suitable if you want to set all vector elements to some value. But if you want to access elements every <code>step</code> entries, you need to substitute the <code>:</code> with e.g., <code>1:step:end</code>. Below is a direct speed comparison with method 1: <pre><code>tic; x(1:end) = 2; toc Elapsed time is 0.496476 seconds. </code></pre> While you would not expect it to perform any different, method 2 is clearly big trouble: factor 5 slowdown for no reason. My suspicion is that in this case MATLAB explicitly allocates the index vector (<code>1:end</code>). This is somewhat confirmed by using explicit vector size instead of <code>end</code>: <pre><code>tic; x(1:1e8) = 3; toc Elapsed time is 0.482083 seconds. </code></pre> Methods 2 and 3 perform equally bad. Another possibility is to explicitly create an index vector <code>id</code> and use it to index <code>x</code>. This gives you the most flexible indexing capabilities. In our case: <pre><code>tic; id = 1:1e8; % colon(1,1e8); x(id) = 4; toc Elapsed time is 1.208419 seconds. </code></pre> Now that is really something - 12 times slowdown compared to method 1! I understand it should perform worse than method 1 because of the additional memory used for <code>id</code>, but why is it so much worse than methods 2 and 3? Let's try to give the loops a try - as hopeless as it may sound. <pre><code>tic; for i=1:numel(x) x(i) = 5; end toc Elapsed time is 0.788944 seconds. </code></pre> A big surprise - a loop beats a <code>vectorized</code> method 4, but is still slower than methods 1, 2 and 3. It turns out that in this particular case you can do it better: <pre><code>tic; for i=1:1e8 x(i) = 6; end toc Elapsed time is 0.321246 seconds. </code></pre> And that is the probably the most bizarre outcome of this study - a MATLAB-written loop significantly outperforms native vector indexing. That should certainly not be so. Note that the JIT'ed loop is still 3 times slower than the theoretical peak almost obtained by method 1. So there is still plenty of room for improvement. It is just surprising (a stronger word would be more suitable) that usual 'vectorized' indexing (<code>1:end</code>) is even slower. Questions <ul> <li>is simple indexing in MATLAB very inefficient (methods 2, 3, and 4 are slower than method 1), or did I miss something?</li> <li>why is method 4 (so much) slower than methods 2 and 3?</li> <li>why does using <code>1e8</code> instead of <code>numel(x)</code> as a loop bound speed up the code by factor 2?</li> </ul> Edit After reading Jonas's comment, here is another way to do that using logical indices: <pre><code>tic; id = logical(ones(1, 1e8)); x(id) = 7; toc Elapsed time is 0.613363 seconds. </code></pre> Much better than method 4. For convenience: <pre><code>function test tic; x = zeros(1,1e8); toc tic; x(:) = 1; toc tic; x(1:end) = 2; toc tic; x(1:1e8) = 3; toc tic; id = 1:1e8; % colon(1,1e8); x(id) = 4; toc tic; for i=1:numel(x) x(i) = 5; end toc tic; for i=1:1e8 x(i) = 6; end toc end </code></pre>
Tags
<arrays><performance><matlab><loops><vectorization>
Title
Is indexing vectors in MATLAB inefficient?
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USangainor
UserOwnerUserId
1. USangainor
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
2. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
3. PL
 singulars
 LinkTypeLinkTypeId
 LTLinked
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 POIs indexing vectors in MATLAB inefficient?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 POIs indexing vectors in MATLAB inefficient?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 POIs indexing vectors in MATLAB inefficient?
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.