Here is a very good class for analyzing HTML.
Saved a lot of time.
Project address
For example, here is how you would fix all hrefs in an HTML file:
HtmlDocument doc = new HtmlDocument();
foreach(HtmlNode link in doc.DocumentElement.SelectNodes(" //a@href ")
HtmlAttribute att = link"href";
att.Value = FixLink(att);
If you want to participate to the project - because that's the whole purpose of putting the source there, right - use the forums or drop me a note (simon underscore mourier at hotmail dot com) !
Happy coding, scraping, scanning, html-ing, xhtml-ing, etc... :^)
Simon Mourier.