StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
19524187
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2013-10-22T17:09:19.023
FavoriteCount
0
LastActivityDate
2013-10-22T17:09:19.023
LastEditDate
LastEditorUserId
0
OwnerUserId
325565
ParentId
19521493
PostTypeId
2
Score
2
ViewCount
0
LastEditorDisplayName
text
Body
<h2>Summary</h2> In a nutshell, just move the view before the slicing. Instead of: <pre><code>ar2 = zeros((1000,2000),dtype=uint16) ar2 = ar2[:,1000:] ar2 = ar2.view(dtype=uint8) </code></pre> Do: <pre><code>ar2 = zeros((1000,2000),dtype=uint16) ar2 = ar2.view(dtype=uint8) # ar2 is now a 1000x4000 array... ar2 = ar2[:,2000:] # Note the 2000 instead of 1000! </code></pre> What's happening is that the sliced array isn't contiguous (as @Craig noted) and <code>view</code> errs on the conservative side and doesn't try to re-interpret non-contiguous memory buffers. (It happens to be possible in this exact case, but in some cases it would result in a non-evenly-strided array, which numpy doesn't allow.) <hr> If you're not very familiar with <code>numpy</code>, it's possible that you're misunderstanding <code>view</code>, and you actually want <code>astype</code> instead. <hr> <h2>What does <code>view</code> do?</h2> First off, let's take a detailed look at what <code>view</code> does. In this case, it re-interprets the memory buffer of a numpy array as a new datatype, if possible. That means that the number of elements in the array will often change when you use view. (You can also use it to view the array as a different subclass of <code>ndarray</code>, but we'll skip that part for now.) You may already be aware of the following (your problem is a bit more subtle), but if not, here's an explanation. As an example: <pre><code>In [1]: import numpy as np In [2]: x = np.zeros(2, dtype=np.uint16) In [3]: x Out[3]: array([0, 0], dtype=uint16) In [4]: x.view(np.uint8) Out[4]: array([0, 0, 0, 0], dtype=uint8) In [5]: x.view(np.uint32) Out[5]: array([0], dtype=uint32) </code></pre> If you want to make a copy of the array with the new datatype instead, use <code>astype</code>: <pre><code>In [6]: x Out[6]: array([0, 0], dtype=uint16) In [7]: x.astype(np.uint8) Out[7]: array([0, 0], dtype=uint8) In [8]: x.astype(np.uint32) Out[8]: array([0, 0], dtype=uint32) </code></pre> <hr> Now let's take a look at what happens with when viewing a 2D array. <pre><code>In [9]: y = np.arange(4, dtype=np.uint16).reshape(2, 2) In [10]: y Out[10]: array([[0, 1], [2, 3]], dtype=uint16) In [11]: y.view(np.uint8) Out[12]: array([[0, 0, 1, 0], [2, 0, 3, 0]], dtype=uint8) </code></pre> Notice that the shape of the array has changed, and that the changes have happened along the last axis (in this case, extra columns have been added). At first glance it may appear that extra zeros have been added. It's not that extra zeros are being inserted, it's that the <code>uint16</code> representation of <code>2</code> is equivalent to two <code>uint8</code>s, one with a value of <code>2</code> and one with a value of <code>0</code>. Therefore, any <code>uint16</code> under 255 will result in the value and a zero, while any value over that will result in two smaller <code>uint8</code>s. As an example: <pre><code>In [13]: y * 100 Out[14]: array([[ 0, 100], [200, 300]], dtype=uint16) In [15]: (y * 100).view(np.uint8) Out[15]: array([[ 0, 0, 100, 0], [200, 0, 44, 1]], dtype=uint8) </code></pre> <hr> <h2>What's happening behind the scenes</h2> Numpy arrays consist of a "raw" memory buffer that's interpreted through a shape, a dtype, and strides (and an offset, but let's ignore that for now). For more detail, there are several good overviews: <a href="http://docs.scipy.org/doc/numpy/reference/arrays.ndarray.html" rel="nofollow">the official documentation</a>, <a href="http://csc.ucdavis.edu/~chaos/courses/nlp/Software/NumPyBook.pdf" rel="nofollow">the numpy book</a>, or <a href="http://scipy-lectures.github.io/advanced/advanced_numpy/" rel="nofollow">scipy-lectures</a>. This allows numpy to be very memory efficient and "slice and dice" the underlying memory buffer in many different ways without making a copy. Strides tell numpy how many bytes to jump within the memory buffer to go one increment along a particular axis. For example: <pre><code>In [17]: y Out[17]: array([[0, 1], [2, 3]], dtype=uint16) In [18]: y.strides Out[18]: (4, 2) </code></pre> So, to go one row deeper in the array, numpy needs to step forward 4 bytes in the memory buffer, while to go one column farther in the array, numpy needs to step 2 bytes. Transposing the array just amounts to reversing the strides (and shape, but in this case, <code>y</code> is 2x2): <pre><code>In [19]: y.T.strides Out[19]: (2, 4) </code></pre> When we view the array as <code>uint8</code>, the strides change. We still step forward 4 bytes per row, but only one byte per column: <pre><code>In [20]: y.view(np.uint8).strides Out[20]: (4, 1) </code></pre> However, numpy arrays have to have the one stride length per dimension. This is what "evenly-strided" means. In other words, do move forward one row/column/whatever, numpy needs to be able to step the same amount through the underlying memory buffer each time. In other words, there's no way to tell numpy to step different amounts for each row/column/whatever. For that reason, <code>view</code> takes a very conservative route. If the array isn't contiguous, and the view would change the shape and strides of the array, it doesn't try to handle it. As @Craig noted, it's because the slice of <code>y</code> isn't contiguous that <code>view</code> isn't working. There are plenty of cases (yours is one) where the resulting array would be valid, but the <code>view</code> method doesn't try to be too smart about it. To really play around with what's possible, you can use <code>numpy.lib.stride_tricks.as_strided</code> or directly use the <a href="http://docs.scipy.org/doc/numpy/reference/arrays.interface.html#__array_interface__" rel="nofollow"><code>__array_interface__</code></a>. It's a good learning tool to experiment with it, but you have to really understand what you're doing to use it effectively. Hopefully that helps a bit, anyway! Sorry for the long-winded answer!
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POnumpy.view gives valueerror
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USJoe Kington
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POnumpy.view gives valueerror
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.