StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
3380229
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
1
CommunityOwnedDate
CreationDate
2010-07-31T23:56:21.563
FavoriteCount
0
LastActivityDate
2010-07-31T23:56:21.563
LastEditDate
LastEditorUserId
0
OwnerUserId
95810
ParentId
3380146
PostTypeId
2
Score
4
ViewCount
0
LastEditorDisplayName
text
Body
If the "lines" your file is divided into are of reasonable lengths, and there are no binary sequences in it that "reading as text" would break, you can use <code>fileinput</code>'s handy "make believe I'm rewriting a file in place" functionality: <pre><code> import re import fileinput tagre = re.compile(r"<o:p>.*?</o:p>") def sub(mo): return mo.group().replace(r"'", r"\'") for line in fileinput.input('thefilename', inplace=True): print tagre.sub(sub, line), </code></pre> If not, you'll have to simulate the "in-place rewriting" yourself, e.g. (oversimplified...): <pre><code> with open('thefilename', 'rb') as inf: with open('fixed', 'wb') as ouf: while True: b = inf.read(1024*1024) if not b: break ouf.write(tagre.sub(sub, b)) </code></pre> and then move <code>'fixed'</code> to take place of <code>'thefilename'</code> (either in code, or manually) if you need that filename to remain after the fixing. This is oversimplified because one of the crucial <code><o:p> ... </o:p></code> parts might end up getting split between two successive megabyte "blocks" and therefore not identified (in the first example, I'm assuming each such part is always fully contained within a "line" -- if that's not the case then you should not use that code, but the following, anyway). Fixing this requires, alas, more complicated code...: <pre><code> with open('thefilename', 'rb') as inf: with open('fixed', 'wb') as ouf: while True: b = getblock(inf) if not b: break ouf.write(tagre.sub(sub, b)) </code></pre> with e.g. <pre><code> partsofastartag = '<', '<o', '<o:', '<o:p' def getblock(inf): b = '' while True: newb = inf.read(1024 * 1024) if not newb: return b b += newb if any(b.endswith(p) for p in partsofastartag): continue if b.count('<o:p>') != b.count('</o:p>'): continue return b </code></pre> As you see, this is pretty delicate code, and therefore, what with it being untested, I can't know that it is correct for your problem. In particular, can there be cases of <code><o:p></code> that are NOT matched by a closing <code></o:p></code> or vice versa? If so, then a call to <code>getblock</code> could end up returning the whole file in quite a costly way, and even the RE matching and substitution might backfire (the latter would also occur if SOME of the single-quotes in such tags are already properly escaped, but not all). If you have at least a GB or so, avoiding the delicate issues with block division, at least, IS feasible, since everything should fit in memory, making the code much simpler: <pre><code> with open('thefilename', 'rb') as inf: with open('fixed', 'wb') as ouf: b = inf.read() ouf.write(tagre.sub(sub, b)) </code></pre> However, the other issues mentioned above (possible unbalanced opening/closing tags, etc) might remain -- only you can study your existing defective data and see if it affords such a reasonably simple approach at fixing!
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POEscape quotes contained within certain html tags
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USAlex Martelli
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POEscape quotes contained within certain html tags
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
3. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
CommentsPostId
1. COThanks, that worked great, all I need to do now is find all the other queries with unescaped quotes that don't have such a handy way of finding them...
 singulars
 PostPostId
 PO
 UserUserId
 USfredley

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.