StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POHow do I cycle through a csv in python, writing lines to a new file that meet new criteria
primarykey
Id
14928626
data
AcceptedAnswerId
14929419
AnswerCount
1
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2013-02-18T02:28:43.440
FavoriteCount
0
LastActivityDate
2013-02-18T04:17:54.013
LastEditDate
LastEditorUserId
0
OwnerUserId
437350
ParentId
0
PostTypeId
1
Score
0
ViewCount
623
LastEditorDisplayName
text
Body
I've been at this a while now, and I think it in my best interest to ask advice of the experts. I know I'm not writing this the best way possible, and I've gone down a rabbit hole and confused myself. I have a csv. A bunch, actually. That part is not the problem. The lines at the top of the CSV are not really CSV data, but it does contain an important piece of info, the data for which the data is valid. For certain kinds of a report, it is on one line, and on others another. My data starts on some line down from the top, usually 10 or 11, but I can't always be certain. I do know that the first column always has the same info (the header of the table of data). I want to pull the report date from the preceding lines, and for file type A, do stuffA, and for file tpye B, do stuffB, then write out that row to a new file. I'm having a problem incrementing the row and I have no idea what I'm doing wrong. Sample data: <pre><code>"Attribute ""OPSURVEYLEVEL2_O"" [Category = ""Retail v1""]" Date exported: 2/16/13 Exported by user: William Project: Classification: Online Retail v1 Report type: Attributes Date range: from 12/14/12 to 12/14/12 "Filter OpSurvey Level 2(mine): [ LEVEL:SENTENCE TYPE:KEYWORD {OPSURVEYLEVEL2_O:""gift certificate redemption"", OPSURVEYLEVEL2_O:""combine accounts"", OPSURVEYLEVEL2_O:""cancel account"", OPSURVEYLEVEL2_O:""saved project moved to purchased project"", OPSURVEYLEVEL2_O:""unlock account"", OPSURVEYLEVEL2_O:""affiliate promotions"", OPSURVEYLEVEL2_O:""print to store coupons"", OPSURVEYLEVEL2_O:""disclaimer not clear"", OPSURVEYLEVEL2_O:""prepaid issue"", OPSURVEYLEVEL2_O:""customer wants to use coupons for print to store"", OPSURVEYLEVEL2_O:""customer received someone else's order"", OPSURVEYLEVEL2_O:""hi-res images unavailable"", OPSURVEYLEVEL2_O:""how to re-order"", OPSURVEYLEVEL2_O:""missing items"", OPSURVEYLEVEL2_O:""missing envelopes: print to store"", OPSURVEYLEVEL2_O:""missing envelopes: mail order"", OPSURVEYLEVEL2_O:""group rooms"", OPSURVEYLEVEL2_O:""print to store"", OPSURVEYLEVEL2_O:""print to store coupons"", OPSURVEYLEVEL2_O:""publisher: card not available for print to store"", OPSURVEYLEVEL2_O:publisher}]" Total: 905 OPSURVEYLEVEL2_O,Distinct Document,% of Document,Sentiment Score PRINT TO STORE,297,32.82,-0.1 ... </code></pre> Sample Code <pre><code>#!/usr/bin/python import csv, os, glob, sys, errno path = '/path/to/Downloads' for infile in glob.glob(os.path.join(path,'report_ATTRIBUTE_OP*.csv')): if 'OPSURVEYLEVEL2' in infile: prime_column = 'ops2' elif 'OPSURVEYLEVEL3' in infile: prime_column = 'ops3' else: sys.exit(errno.ENOENT) with open(infile, "r") as csvfile: reader = csv.reader(csvfile) report_date = 'DATE NOT FOUND' # import pdb; pdb.set_trace() for row in reader: foo = 0 while foo < 1: if row[0][0:].find('OPSURVEYLEVEL') == 0: foo = 1 if "Date range" in row: report_date = row[0][-8:] break if foo >= 1: if row[0][0:].find('OPSURVEYLEVEL') == 0: break if 'ops2' in prime_column: dup_col = row[0] row.insert(0,dup_col) row.append(report_date) elif 'ops3' in prime_column: row.append(report_date) with open('report_merge.csv', 'a') as outfile: outfile.write(row) reader.next() </code></pre>
Tags
<python><for-loop><while-loop>
Title
How do I cycle through a csv in python, writing lines to a new file that meet new criteria
singulars
PostAcceptedAnswerId
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USgreenwar
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. This table or related slice is empty.
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.