StackOverflow2013

Note that there are some explanatory texts on larger screens.

plurals

POHow to read an XML input file, manipulate some nodes (remove and rename some) and write the output to a new XML output file?
primarykey
Id
8732868
data
AcceptedAnswerId
28120844
AnswerCount
3
ClosedDate
CommentCount
0
CommunityOwnedDate
CreationDate
2012-01-04T19:34:40.520
FavoriteCount
5
LastActivityDate
2018-06-04T09:31:39.353
LastEditDate
2018-06-04T09:31:39.353
LastEditorUserId
573546
OwnerUserId
984532
ParentId
0
PostTypeId
1
Score
4
ViewCount
3190
LastEditorDisplayName
text
Body
I need to read an XML file from internet and re-shape it. Here is the XML file and the code I have so far. <pre><code>library(XML) url='http://ClinicalTrials.gov/show/NCT00001400?displayxml=true' doc = xmlParse(url,useInternalNode=TRUE) </code></pre> I was able to use some functions within the XML package with sucess(e.g., getNodeSet), but I am not an expert and there are some examples on the internet but I was not able to crack this problem myself. I also know some XPath but this was 4 years ago and I am not an expert on sapply and similar functions. But my goal is this: <ol> <li>I need to remove a whole set of XML children branches about location, for example: <code><location> ... anything </location></code>. There can be multiple nodes with location data. I simply don't need that detail in the output. The XML file above always complies to an XSD schema. The root node is called <code><clinical_study></code>.</li> <li>The resulted simplified file should be written into a new XML file called "data-changed.xml".</li> <li>I also need to rename and move one branch from old nested place of <code><eligibility> <criteria> <textblock> Inclusion criteria are xyz </textblock/>...</code></li> <li>In new output ("data-changed.xml") the structure should say a different XML node and be directly under root node: <code><eligibility_criteria> Inclusion criteria are xyz </eligibility_criteria></code></li> </ol> So I need to: <ul> <li>read the XML into memory</li> <li>manipulate the tree (prune it somewhere)</li> <li>move some XML nodes to a new place and under a new name and </li> <li>write the resulting XML output file. </li> </ul> Any ideas are greatly appreciated? Also, if you know about a nice (recent !) tutorial on XML parsing within R (or book chapter which tackles it, please share the reference). (I read the vignettes by Duncan and these are too advanced (too concise)). 
Tags
<xml><r>
Title
How to read an XML input file, manipulate some nodes (remove and rename some) and write the output to a new XML output file?
singulars
PostAcceptedAnswerId
1. This table or related slice is empty.
PostParentId
1. This table or related slice is empty.
PostTypePostTypeId
1. PTQuestion
UserLastEditorUserId
1. USJ. Win.
UserOwnerUserId
1. USuserJT
plurals
PostLinksPostIdRelatedPostId
1. This table or related slice is empty.
PostLinksRelatedPostIdPostId
1. This table or related slice is empty.
PostsAcceptedAnswerId
1. This table or related slice is empty.
PostsParentIdCreationDate
1. PO
 singulars
 PostTypePostTypeId
 PTAnswer
2. PO
 singulars
 PostTypePostTypeId
 PTAnswer
VotesPostIdCreationDate
1. This table or related slice is empty.
CommentsPostId
1. This table or related slice is empty.

Querying!

Guidance

A row detail

Detail views are divided into sections. All the information in the data section comes from columns in the selected row. The other sections display data from other, related rows.

Related data can be related in a to-one or a to-many fashion. Captions of data related in a to-many fashion link to a list view showing a filtered view of the table.

Try moving around until you find a non-empty to-many entry and click on the label to get to one. You can move back to the root by clicking on the database name in the header.