StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POpreserve formatting when updating xml file with groovy
primarykey
Id
20690526
data
AcceptedAnswerId
20720694
AnswerCount
2
ClosedDate
CommentCount
5
CommunityOwnedDate
CreationDate
2013-12-19T19:56:15.400
FavoriteCount
4
LastActivityDate
2015-05-05T07:01:54.520
LastEditDate
LastEditorUserId
0
OwnerUserId
701303
ParentId
0
PostTypeId
1
Score
12
ViewCount
4518
LastEditorDisplayName
text
Body
<p>I have a large number of XML files that contain URLs. I'm writing a groovy utility to find each URL and replace it with an updated version.</p> <p>Given example.xml:</p> <pre><code><?xml version="1.0" encoding="UTF-8"?> <page> <content> <section> <link> <url>/some/old/url</url> </link> <link> <url>/some/old/url</url> </link> </section> <section> <link> <url> /a/different/old/url?with=specialChars&amp;escaped=true </url> </link> </section> </content> </page> </code></pre> <p>Once the script has run, example.xml should contain:</p> <pre><code><?xml version="1.0" encoding="UTF-8"?> <page> <content> <section> <link> <url>/a/new/and/improved/url</url> </link> <link> <url>/a/new/and/improved/url</url> </link> </section> <section> <link> <url> /a/different/new/and/improved/url?with=specialChars&amp;stillEscaped=true </url> </link> </section> </content> </page> </code></pre> <p>This is easy to do using groovy's excellent xml support, except that I want to <strong>change the URLs and nothing else</strong> about the file.</p> <p>By that I mean:</p> <ul> <li>whitespace must not change (files might contain spaces, tabs, or both)</li> <li>comments must be preserved</li> <li>windows vs. unix-style line separators must be preserved</li> <li>the xml declaration at the top must not be added or removed</li> <li>attributes in tags must retain their order</li> </ul> <p>So far, after trying many combinations of XmlParser, DOMBuilder, XmlNodePrinter, XmlUtil.serialize(), and so on, I've landed on reading each file line-by-line and applying an ugly hybrid of the xml utilities and regular expressions.</p> <p>Reading and writing each file:</p> <pre><code>files.each { File file -> def lineEnding = file.text.contains('\r\n') ? '\r\n' : '\n' def newLineAtEof = file.text.endsWith(lineEnding) def lines = file.readLines() file.withWriter { w -> lines.eachWithIndex { line, index -> line = update(line) w.write(line) if (index < lines.size-1) w.write(lineEnding) else if (newLineAtEof) w.write(lineEnding) } } } </code></pre> <p>Searching for and updating URLs within a line:</p> <pre><code>def matcher = (line =~ urlTagRegexp) //matches a <url> element and its contents matcher.each { groups -> def urlNode = new XmlParser().parseText(line) def url = urlNode.text() def newUrl = translate(url) if (newUrl) { urlNode.value = newUrl def replacement = nodeToString(urlNode) line = matcher.replaceAll(replacement) } } def nodeToString(node) { def writer = new StringWriter() writer.withPrintWriter { printWriter -> def printer = new XmlNodePrinter(printWriter) printer.preserveWhitespace = true printer.print(node) } writer.toString().replaceAll(/[\r\n]/, '') } </code></pre> <p>This mostly works, except it can't handle a tag split over multiple lines, and messing with newlines when writing the files back out is cumbersome.</p> <p>I'm new to groovy, but I feel like there must be a groovier way of doing this.</p>
Tags
<xml><regex><groovy>
Title
preserve formatting when updating xml file with groovy
singulars
PostAcceptedAnswerId
1. PO
  singulars
  PostTypePostTypeId
  PTAnswer
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. This table or related slice is empty.
UserOwnerUserId
1. USAlex Wittig
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
  singulars
  PostTypePostTypeId
  PTAnswer
VotesPostIdCreationDate
1. VO
  singulars
  PostPostId
  POpreserve formatting when updating xml file with groovy
  UserUserId
  This table or related slice is empty.
  VoteTypeVoteTypeId
  VTUpMod
2. VO
  singulars
  PostPostId
  POpreserve formatting when updating xml file with groovy
  UserUserId
  This table or related slice is empty.
  VoteTypeVoteTypeId
  VTUpMod
3. VO
  singulars
  PostPostId
  POpreserve formatting when updating xml file with groovy
  UserUserId
  USakhikhl
  VoteTypeVoteTypeId
  VTFavorite
CommentsPostId

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.