Note that there are some explanatory texts on larger screens.

plurals
  1. PONeed some clarification regarding getting started with HTML Agility Pack
    text
    copied!<p><strong>My background</strong>:</p> <p>I am a newbie when it comes to HTML scrubbing. It has been about four years since I did my only work coding for with C# for html. My other coding with C# equally a while back was for forms to manipulate data in SQL Server databases.</p> <p><strong>What I have done to try to get started with HTML Agility Pack (HAP)</strong>:</p> <p>I have spent several days trying to make sense of instructions found from various online sources about how to get started with HTML Agility Pack. Some of what I have found so far is listed below:</p> <ul> <li>www.4guysfromrolla.com/articles/011211-1.aspx </li> <li>olussier.net/2010/03/30/easily-parse-html-documents-in-csharp/ </li> <li>stackoverflow.com/questions/846994/how-to-use-html-agility-pack </li> <li>shatalov.su/en/articles/web/parser_1.php </li> <li>still more referred to below...</li> </ul> <p><strong>My Results so far</strong>:</p> <p>I have found the material to be quite confusing with each source seeming to tell me something different. All my attempts have come to dead ends.</p> <hr> <p>So that you can efficiently sort out my confusion and reply to my specific situation I will describe in three sections below my project, my environment and my questions;</p> <p><strong>My Project</strong></p> <p>I am tasked with creating a process to scrub data from html files. I know the files well. The files will reside on the file system on local on the machine. The html file(s) will be created elsewhere by a process we do not own and will be placed in the local folder I just referred to above. (FYI - Though it is not a part of my question, I expect to create a project or app that will be run on a schedule to perform the scrubbing task and then input the collected data into a database table.)</p> <p><strong>My Environment</strong></p> <p>As stated above the html file(s) to be processed will reside on the local machine. I have newly installed Visual Studio 2010 Professional on this machine to code for this project. The HTML Agility Pack is now accessible to this machine on a file share.</p> <p>Under REGEIT: HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\NET Framework Setup\NDP are listed the following indicating the version of .NET framework installed on this machine;</p> <ul> <li>CDF</li> <li>V2.0.50727</li> <li>V3.0</li> <li>V3.5</li> <li>V4</li> <li>V4.0</li> </ul> <p><strong>My Questions</strong></p> <p>1.) I am told by some sites to download HTML Agility Pack and to use the file "HtmlAgilityPack.dll," however the zip file contains nine folders, each with a different copy of this file. Which one do I want?</p> <p>Here are the names of the folders;</p> <ul> <li>Net20</li> <li>Net40</li> <li>Net40-client</li> <li>Net45</li> <li>sl3-wp</li> <li>sl4</li> <li>sl4-windowsphone71</li> <li>sl5</li> <li>winrt45</li> </ul> <p>2.) An answer to a forum question “How to I use the HTML Agility Pack” at stackoverflow.com/questions/846994/how-to-use-html-agility-pack instructs the questioner to “Download and build the HTML Agility Pack Solution”, and directs the questioner to the site htmlagilitypack.codeplex.com which then has a link to nuget.org/packages/HtmlAgilityPack which says to ‘install’ the HTMLAgilityPack by running the command “PM> Install-Package HtmlAgilityPack” in the “Package Manager Console”</p> <p>What does all this mean? Other sites say to bout the dll in the bin folder. What is that telling me to do? Please explain with more detailed to get me started.</p> <p>3.) Assuming I am using C# what kind of project should I create?</p> <p>4.) Please direct me to any other resources that you believe is applicable to my project.</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload