Note that there are some explanatory texts on larger screens.

plurals
  1. POExtracting head tag
    primarykey
    data
    text
    <p>I try lots of things but I couldn't extract content of <code>head</code>. Any one help?</p> <p><strong>Original XML:</strong> <a href="https://dl.dropbox.com/u/3482709/English_sense_induction.xml.zip" rel="nofollow">https://dl.dropbox.com/u/3482709/English_sense_induction.xml.zip</a></p> <p>Here is the text:</p> <pre><code>&lt;?xml version="1.0" encoding="UTF-8"?&gt; &lt;!DOCTYPE corpus SYSTEM "sense-induction.dtd"&gt; &lt;corpus lang="en"&gt; &lt;lexelt item="explain.v"&gt; &lt;instance id="explain.v.4" corpus="wsj"&gt; For OPEC , that 's ideal . The resulting firm prices and stability `` will allow both producers and consumers to plan confidently , '' says Saudi Arabian Oil Minister Hisham Nazer . OPEC Secretary-General Subroto &lt;head&gt; explains &lt;/head&gt; : Consumers offer security of markets , while OPEC provides security of supply . `` This is an opportune time to find mutual ways { to prevent } price shocks from happening again , '' he says . To promote this balance , OPEC now is finally confronting a long-simmering internal problem . &lt;/instance&gt; &lt;instance id="explain.v.10" corpus="wsj"&gt; and given the right conditions , sympathetic to creating some form of life . Surely at some other cosmic address a Gouldoid creature would have risen out of the ooze to &lt;head&gt; explain &lt;/head&gt; why , paleontologically speaking , `` it is , indeed , a wonderful life . '' Mr. Holt is a columnist for the Literary Review in London . &lt;/instance&gt; &lt;instance id="explain.v.76" corpus="wsj"&gt; `` You ca n't build on your hit-and-miss five-seventeen '' . `` What are you playing '' ? ? Owen asked . `` I 'm just logging '' , the cowboy &lt;head&gt; explained &lt;/head&gt; . `` I keep all these plays in this little black book , and I watch over a twelve-hour period to find out what numbers are repeating . But roulette 's not my game . &lt;/instance&gt; &lt;/lexelt&gt; &lt;lexelt item="position.n"&gt; &lt;instance id="position.n.288" corpus="wsj"&gt; But not everybody was making money . The carnage on the Chicago Board Options Exchange , the nation 's major options market , was heavy after the trading in S&amp;amp;P 100 stock-index options was halted Friday . Many market makers in the S&amp;amp;P 100 index options contract had bullish &lt;head&gt; positions &lt;/head&gt; Friday , &lt;/instance&gt; &lt;instance id="position.n.123" corpus="wsj"&gt; An explosion at the Microbiology and Virology Institute in Sverdlovsk released anthrax germs that caused a significant number of deaths . Since Mr. Shevardnadze did not address this topic before the Supreme Soviet , the Soviet Union 's official &lt;head&gt; position &lt;/head&gt; remains that the anthrax deaths were caused by &lt;/instance&gt; &lt;/lexelt&gt; &lt;/corpus&gt; </code></pre> <p><strong>EDIT</strong></p> <p>Problem was that I forgot <code>xml</code> as second argument: Solution is <code>soup = BeautifulSoup(xml_data, 'xml')</code></p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload