HtmlNode Class

IronWebScraper

HtmlNode Class

The HtmlNode class represents a single DOM element in a HTML or XML document.
Inheritance Hierarchy
SystemObject  IronWebScraperHtmlNode

Namespace:  IronWebScraper
Assembly:  IronWebScraper (in IronWebScraper.dll) Version: 4.0.4.25470 (4.0.4.3)
Syntax
public class HtmlNode
Public Class HtmlNode

The HtmlNode type exposes the following members.

Properties
  NameDescription
Public propertyAttributes
Gets the attributes of the HTML node (such ash href, class, style etc).
Public propertyChildNodes
An array of nodes nested within the HtmlNode.
Public propertyInnerHtml
Gets the HtmlNode's inner Html as a string. OuterHTML will not include the current tag's own html markup, only that nested inside it.
Public propertyInnerText
Gets the text content of the HtmlNode as might be shown to a user. Text will be trimmed, with all groups of whitespace turned into a single space character.
Public propertyInnerTextClean
Gets the text content of the HtmlNode as might be shown to a user.
Public propertyNodeName
Name of the HtmlNode such as "div" "p" or "#text"
Public propertyNodeType

Type of the HtmlNode

Possible values are: "ELEMENT_NODE", "TEXT_NODE" ,"CDATA_SECTION_NODE" ,"COMMENT_NODE", "DOCUMENT_NODE","DOCUMENT_TYPE_NODE","DOCUMENT_FRAGMENT_NODE"
Public propertyOuterHtml
Gets the HtmlNode's outer Html as a string. OuterHTML will include the tag's own html markup.
Public propertyParentNode
Gets the Parent (enclosing) tag for this HtmlNode.
Public propertyTextContent
Gets the text content of the HtmlNode and all of its descendants.
Public propertyTextContentClean
Gets the text content of the HtmlNode, Trimmed, with all groups of whitespace turned into a single space character. TextContent will include the content of non-printable elements just as style and script nodes.
Top
Methods
  NameDescription
Public methodCss
Uses CSS selectors to find all child nodes matching selector. This works in the same way as $('.ClassName') in jQuery or querySelectorAll() in JavaScript.
Public methodCssExists
Uses CSS selectors to find if there are any matching nodes within the Response Document. This works in the same way as $('.ClassName').length > 0 in jQuery or querySelectorAll().length >0 in JavaScript.
Public methodEquals (Inherited from Object.)
Public methodGetAttribute

Gets a single attribute vale from the HtmlNode. Returns null if that attribute is not present.

Public methodGetElementById
Synonym of JavaScript's getElementById function. Searches inside the current HtmlNode.
Public methodGetElementsByTagName
Synonym of JavaScript's getElementsByTagName function. Searches inside the current HtmlNode.
Public methodGetHashCode (Inherited from Object.)
Public methodGetType (Inherited from Object.)
Public methodHasAtribute
Tests if the HtmlNode has a given attribute such as "class" or "href"
Public methodQuerySelector
Synonym of JavaScript's querySelector function. Searches inside the current HtmlNode.
Public methodQuerySelectorAll
Synonym of JavaScript's querySelectorAll function. Searches inside the current HtmlNode.
Public methodToString (Inherited from Object.)
Public methodXPath
Queries the HtmlNode to return all descendant nodes matching an XPath expression.
Top
See Also