Note that there are some explanatory texts on larger screens.

plurals
  1. POUnable to parse value from HTML using jsoup
    text
    copied!<p>I'm relatively new to using jsoup, and I can't seem to find the correct query to parse out the value I'm looking for. The HTML is as follows.</p> <p></p> <pre><code> &lt;img src='http://rootzwiki.com/public/style_images/ginger/t_unread.png' alt='New Replies' /&gt;&lt;br /&gt; &lt;/a&gt; &lt;/td&gt; &lt;td class='col_f_content '&gt; &lt;h4&gt;&lt;a id="tid-link-12251" href="http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/" title='View topic, started 17 December 2011 - 09:32 AM' class='topic_title'&gt;[ROM][LTE] RootzBoat 4.0.3 V6.1&lt;/a&gt;&lt;/h4&gt; &lt;br /&gt; &lt;span class='desc lighter blend_links'&gt; Started by &lt;a hovercard-ref="member" hovercard-id="5" class="_hovertrigger url fn " href='http://rootzwiki.com/user/5-birdman/'&gt;birdman&lt;/a&gt;, 17 Dec 2011 &lt;/span&gt; &lt;ul class='mini_pagination'&gt; &lt;li&gt;&lt;a href="http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/" title='Go to page 1'&gt;1&lt;/a&gt;&lt;/li&gt; &lt;li&gt;&lt;a href="http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/page__st__10" title='Go to page 2'&gt;2&lt;/a&gt;&lt;/li&gt; &lt;li&gt;&lt;a href="http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/page__st__20" title='Go to page 3'&gt;3&lt;/a&gt;&lt;/li&gt; &lt;li&gt;&lt;a href="http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/page__st__1990" title='Go to page 200'&gt;200 &amp;rarr;&lt;/a&gt;&lt;/li&gt; &lt;/ul&gt; &lt;/td&gt; &lt;td class='col_f_preview __topic_preview'&gt; &lt;a href='http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/' class='expander closed' title='Preview this topic'&gt;&amp;nbsp;&lt;/a&gt; &lt;/td&gt; &lt;td class='col_f_views desc blend_links'&gt; &lt;ul&gt; &lt;li&gt; &lt;span class='ipsBadge ipsBadge_orange'&gt;Hot&lt;/span&gt;&amp;nbsp; &lt;a href="http://rootzwiki.com/index.php?app=forums&amp;amp;module=extras&amp;amp;section=stats&amp;amp;do=who&amp;amp;t=12251" onclick="return ipb.forums.retrieveWhoPosted( 12251 );"&gt;1,999 replies&lt;/a&gt; &lt;/li&gt; &lt;li class='views desc'&gt;180,213 views&lt;/li&gt; &lt;/ul&gt; &lt;/td&gt; &lt;td class='col_f_post'&gt; &lt;a href='http://rootzwiki.com/user/49940-jakeday/' class='ipsUserPhotoLink left'&gt; &lt;img src='http://rootzwiki.com/uploads/profile/photo-thumb-49940.jpg' class='ipsUserPhoto ipsUserPhoto_mini' /&gt; &lt;/a&gt; &lt;ul class='last_post ipsType_small'&gt; &lt;li&gt;&lt;a hovercard-ref="member" hovercard-id="49940" class="_hovertrigger url fn " href='http://rootzwiki.com/user/49940-jakeday/'&gt;jakeday&lt;/a&gt;&lt;/li&gt; &lt;li&gt; &lt;a href='http://rootzwiki.com/topic/12251-romlte-rootzboat-403-v61/page__view__getlastpost' title='Go to last post'&gt;Today, 04:20 AM&lt;/a&gt; &lt;/li&gt; &lt;/ul&gt; &lt;/td&gt; </code></pre> <p></p> <p>I need to parse out <code>birdman</code> from there. I know that once I've defined the element, I can get "birdman" out with <code>author.text();</code>, but I cant figure out how to define the author element. I thought perhaps the following block of code would work, but as I mentioned, I'm pretty new to jsoup and html and it obviously didnt work. Theres nothing wrong with the connection, and jsoup is working for the other values I parsed out.</p> <pre><code> TitleResults titleArray = new TitleResults(); Document doc = null; try { doc = Jsoup.connect(Constants.FORUM).get(); } catch (IOException e) { e.printStackTrace(); } Elements threads = doc.select(".topic_title"); for (Element thread : threads) { titleArray = new TitleResults(); //Thread title threadTitle = thread.text(); titleArray.setItemName(threadTitle); //Thread link String threadStr = thread.attr("abs:href"); String endTag = "/page__view__getnewpost"; //trim link threadStr = new String(threadStr.replace(endTag, "")); threadArray.add(threadStr); titleArray.setAuthorDate("Author/Date"); results.add(titleArray); } Elements authors = doc.select("a[hovercard-ref]"); for (Element author : authors) { if (author.attr("abs:href").contains("/user/")){ Log.d("POC", "SUCCESS " + author.attr("abs:href")); } else { Log.d("POC", "FAILURE " + author.text()); } } } </code></pre>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload