StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

PO
primarykey
Id
15516238
data
AcceptedAnswerId
0
AnswerCount
0
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2013-03-20T05:47:36.257
FavoriteCount
0
LastActivityDate
2013-03-20T05:47:36.257
LastEditDate
LastEditorUserId
0
OwnerUserId
257449
ParentId
15501978
PostTypeId
2
Score
1
ViewCount
0
LastEditorDisplayName
text
Body
I have the following snippets. This first solution uses <code>groupBy</code> to group the entrances related to the same station. It does not assume that rows are sorted. Although it reads the file only once, it really does 3 passes (one to read all in memory, one for <code>groupBy</code> and one to create the stations). See at the end for the code for the <code>Row</code> extractor. <pre><code>val stations = { val file = new java.io.File("StationEntrances.csv") val reader = com.github.tototoshi.csv.CSVReader.open(file) val byStation = reader .all // read all in memory .drop(1) // drop header .groupBy { case List(division, line, station, _*) => (division, line, station) } reader.close byStation.values.toList map { rows => val entrances = rows map { case Row(_, _, _, _, entrance) => entrance } rows.head match { case Row(division, line, station, routes, _) => Station( division, line, station, routes.toList.filter(_ != ""), entrances) } } } </code></pre> This solution assumes that the rows are sorted and should be faster, as it does only one pass and build the result list as it reads the file. <pre><code>val stations2 = { import collection.mutable.ListBuffer def processByChunk(iter: Iterator[Seq[String]], acc: ListBuffer[Station]) : List[Station] = { if (!iter.hasNext) acc.toList else { val head = iter.next val marker = head.take(3) val (rows, rest) = iter.span(_ startsWith marker) val entrances = (head :: rows.toList) map { case Row(_, _, _, _, entrance) => entrance } val station = head match { case Row(division, line, station, routes, _) => Station( division, line, station, routes.toList.filter(_ != ""), entrances) } processByChunk(rest, acc += station) } } val file = new java.io.File("StationEntrances.csv") val reader = com.github.tototoshi.csv.CSVReader.open(file) val stations = processByChunk(reader.iterator.drop(1), ListBuffer()) reader.close stations } </code></pre> I have created a dedicated extractor to get the routes/entrances from a given line. I think it makes the code more readable, but also if you are dealing with list, calling <code>fields(0)</code> to <code>fields(25)</code> is not optimal since each call has to traverse the list. The extractor avoids this. For most Java csv parsers, you usually get <code>Array[String]</code>, so that's usually not an issue. Finally, the csv parsing usually doesn't return null strings, so you may want to use <code>if (adaNotes == "") None else Some(adaNotes)</code> instead of <code>Option(adaNotes)</code>. <pre><code>object Row { def unapply(s: Seq[String]) = s match { case List(division, line, station, rest @ _*) => val (routes, List(ada, adaNotes, freeCrossover, entranceType, entry, exitOnly, entranceStaffing, northSouthStreet, eastWestStreet, corner, latitude, longitude)) = rest splitAt 11 // 11 routes Some(( division, line, station, routes, Entrance( ada.toBoolean, Option(adaNotes), freeCrossover.toBoolean, entranceType, entry == "YES", exitOnly == "YES", entranceStaffing, northSouthStreet, eastWestStreet, corner, latitude.toInt, longitude.toInt))) case _ => None } } </code></pre>
Tags
Title
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. POOptimizing a denormalization of a csv file
 singulars
 PostTypePostTypeId
 PTQuestion
PostTypePostTypeId
1. PTAnswer
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. UShuynhjl
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. POOptimizing a denormalization of a csv file
 singulars
 PostTypePostTypeId
 PTQuestion
PostsParentIdCreationDate
1. This table or related slice is empty.
VotesPostIdCreationDate
1. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTUpMod
2. VO
 singulars
 PostPostId
 PO
 UserUserId
 This table or related slice is empty.
 VoteTypeVoteTypeId
 VTAcceptedByOriginator
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.