class Nokogiri::XML::Node
Nokogiri::XML::Node is the primary API you’ll use to interact with your Document.
Attributes¶ ↑
A Nokogiri::XML::Node may be treated similarly to a hash with regard to attributes. For example:
node = Nokogiri::XML::DocumentFragment.parse("<a href='#foo' id='link'>link</a>").at_css("a") node.to_html # => "<a href=\"#foo\" id=\"link\">link</a>" node['href'] # => "#foo" node.keys # => ["href", "id"] node.values # => ["#foo", "link"] node['class'] = 'green' # => "green" node.to_html # => "<a href=\"#foo\" id=\"link\" class=\"green\">link</a>"
See the method group entitled Working With Node Attributes at Node for the full set of methods.
Navigation¶ ↑
Nokogiri::XML::Node also has methods that let you move around your tree:
- parent, children, next, previous
-
Navigate up, down, or through siblings.
See the method group entitled Traversing Document Structure at Node for the full set of methods.
Serialization¶ ↑
When printing or otherwise emitting a document or a node (and its subtree), there are a few methods you might want to use:
- content, text, inner_text, to_str
-
These methods will all **emit plaintext**, meaning that entities will be replaced (e.g., +<+ will be replaced with +<+), meaning that any sanitizing will likely be un-done in the output.
to_s,to_xml,to_html,inner_html-
These methods will all **emit properly-escaped markup**, meaning that it’s suitable for consumption by browsers, parsers, etc.
See the method group entitled Serialization and Generating Output at Node for the full set of methods.
Searching¶ ↑
You may search this node’s subtree using methods like xpath and css.
See the method group entitled Searching via XPath or CSS Queries at Node for the full set of methods.
Constants
- ATTRIBUTE_DECL
-
Attribute declaration type
- ATTRIBUTE_NODE
-
Attribute node type
- CDATA_SECTION_NODE
-
CDATAnode type, seeNokogiri::XML::Node#cdata? - COMMENT_NODE
-
Comment node type, see
Nokogiri::XML::Node#comment? - DOCB_DOCUMENT_NODE
-
DOCB document node type
- DOCUMENT_FRAG_NODE
-
Documentfragment node type - DOCUMENT_NODE
-
Documentnode type, seeNokogiri::XML::Node#xml? - DOCUMENT_TYPE_NODE
-
Documenttype node type - DTD_NODE
-
DTDnode type - ELEMENT_DECL
-
Element declaration type
- ELEMENT_NODE
-
Element node type, see
Nokogiri::XML::Node#element? - ENTITY_DECL
-
Entity declaration type
- ENTITY_NODE
-
Entity node type
- ENTITY_REF_NODE
-
Entity reference node type
- HTML_DOCUMENT_NODE
-
HTMLdocument node type, seeNokogiri::XML::Node#html? - IMPLIED_XPATH_CONTEXTS
- NAMESPACE_DECL
-
Namespacedeclaration type - NOTATION_NODE
-
Notationnode type - PI_NODE
-
PI node type
- TEXT_NODE
-
Textnode type, seeNokogiri::XML::Node#text? - XINCLUDE_END
-
XInclude end type
- XINCLUDE_START
-
XInclude start type
Public Class Methods
Source
# File lib/nokogiri/xml/node.rb, line 126 def initialize(name, document) # This is intentionally empty, and sets the method signature for subclasses. end
Create a new node with name that belongs to document.
If you intend to add a node to a document tree, it’s likely that you will prefer one of the Nokogiri::XML::Node methods like add_child, add_next_sibling, replace, etc. which will both create an element (or subtree) and place it in the document tree.
Another alternative, if you are concerned about performance, is Nokogiri::XML::Document#create_element which accepts additional arguments for contents or attributes but (like this method) avoids parsing markup.
- Parameters
-
name(String) -
document(Nokogiri::XML::Document) The document to which the the returned node will belong.
- Yields
- Returns
Public Instance Methods
Source
Source
# File lib/nokogiri/xml/node.rb, line 1330 def ==(other) return false unless other return false unless other.respond_to?(:pointer_id) pointer_id == other.pointer_id end
Test to see if this Node is equal to other
Source
# File lib/nokogiri/xml/node.rb, line 1324 def accept(visitor) visitor.visit(self) end
Accept a visitor. This method calls “visit” on visitor with self.
Source
# File lib/nokogiri/xml/node.rb, line 1293 def ancestors(selector = nil) return NodeSet.new(document) unless respond_to?(:parent) return NodeSet.new(document) unless parent parents = [parent] while parents.last.respond_to?(:parent) break unless (ctx_parent = parents.last.parent) parents << ctx_parent end return NodeSet.new(document, parents) unless selector root = parents.last search_results = root.search(selector) NodeSet.new(document, parents.find_all do |parent| search_results.include?(parent) end) end
Get a list of ancestor Node for this Node. If selector is given, the ancestors must match selector
Source
# File lib/nokogiri/xml/node.rb, line 1214 def cdata? type == CDATA_SECTION_NODE end
Returns true if this is a CDATA
Source
# File lib/nokogiri/xml/node.rb, line 162 def clone(level = 1, new_parent_doc = document) super().initialize_copy_with_args(self, level, new_parent_doc) end
Clone this node.
- Parameters
-
level(optional Integer). 0 is a shallow copy, 1 (the default) is a deep copy. -
new_parent_docThe new node’s parentDocument. Defaults to the theDocumentof the current node.
- Returns
-
The new
Nokogiri::XML::Node
Source
# File lib/nokogiri/xml/node.rb, line 1209 def comment? type == COMMENT_NODE end
Returns true if this is a Comment
Source
# File lib/nokogiri/xml/node.rb, line 1284 def css_path path.split(%r{/}).filter_map do |part| part.empty? ? nil : part.gsub(/\[(\d+)\]/, ':nth-of-type(\1)') end.join(" > ") end
Get the path to this node as a CSS expression
Source
# File lib/nokogiri/xml/node.rb, line 168 def decorate! document.decorate(self) end
Decorate this node with the decorators set up in this node’s Document
Source
# File lib/nokogiri/xml/node.rb, line 1251 def description return if document.xml? Nokogiri::HTML4::ElementDescription[name] end
Fetch the Nokogiri::HTML4::ElementDescription for this node. Returns nil on XML documents and on unknown tags.
Source
# File lib/nokogiri/xml/node.rb, line 1229 def document? is_a?(XML::Document) end
Returns true if this is a Document
Source
# File lib/nokogiri/xml/node.rb, line 144 def dup(level = 1, new_parent_doc = document) super().initialize_copy_with_args(self, level, new_parent_doc) end
Duplicate this node.
- Parameters
-
level(optional Integer). 0 is a shallow copy, 1 (the default) is a deep copy. -
new_parent_doc(optionalNokogiri::XML::Document) The new node’s parentDocument. Defaults to the theDocumentof the current node.
- Returns
-
The new
Nokogiri::XML::Node
Source
# File lib/nokogiri/xml/node.rb, line 1265 def element? type == ELEMENT_NODE end
Returns true if this is an Element node
Source
# File lib/nokogiri/xml/node.rb, line 1097 def fragment(tags) document.related_class("DocumentFragment").new(document, tags, self) end
Create a DocumentFragment containing tags that is relative to this context node.
Source
# File lib/nokogiri/xml/node.rb, line 1244 def fragment? type == DOCUMENT_FRAG_NODE end
Returns true if this is a DocumentFragment
Source
# File lib/nokogiri/xml/node.rb, line 1224 def html? type == HTML_DOCUMENT_NODE end
Returns true if this is an HTML4::Document or HTML5::Document node
Source
# File lib/nokogiri/xml/node.rb, line 1279 def inner_html(*args) children.map { |x| x.to_html(*args) }.join end
Get the inner_html for this node’s Node#children
Source
# File lib/nokogiri/xml/node.rb, line 1090 def matches?(selector) ancestors.last.search(selector).include?(self) end
Returns true if this Node matches selector
Source
# File lib/nokogiri/xml/node.rb, line 1200 def namespaces namespace_scopes.each_with_object({}) do |ns, hash| prefix = ns.prefix key = prefix ? "xmlns:#{prefix}" : "xmlns" hash[key] = ns.href end end
Fetch all the namespaces on this node and its ancestors.
Note that the keys in this hash XML attributes that would be used to define this namespace, such as “xmlns:prefix”, not just the prefix.
The default namespace for this node will be included with key “xmlns”.
See also namespace_scopes
- Returns
-
Hash containing all the namespaces on this node and its ancestors. The hash keys are the namespace prefix, and the hash value for each key is the namespace URI.
Example:
doc = Nokogiri::XML(<<~EOF) <root xmlns="http://example.com/root" xmlns:in_scope="http://example.com/in_scope"> <first/> <second xmlns="http://example.com/child"/> <third xmlns:foo="http://example.com/foo"/> </root> EOF doc.at_xpath("//root:first", "root" => "http://example.com/root").namespaces # => {"xmlns"=>"http://example.com/root", # "xmlns:in_scope"=>"http://example.com/in_scope"} doc.at_xpath("//child:second", "child" => "http://example.com/child").namespaces # => {"xmlns"=>"http://example.com/child", # "xmlns:in_scope"=>"http://example.com/in_scope"} doc.at_xpath("//root:third", "root" => "http://example.com/root").namespaces # => {"xmlns:foo"=>"http://example.com/foo", # "xmlns"=>"http://example.com/root", # "xmlns:in_scope"=>"http://example.com/in_scope"}
Source
# File lib/nokogiri/xml/node.rb, line 1105 def parse(string_or_io, options = nil) ## # When the current node is unparented and not an element node, use the # document as the parsing context instead. Otherwise, the in-context # parser cannot find an element or a document node. # Document Fragments are also not usable by the in-context parser. if !element? && !document? && (!parent || parent.fragment?) return document.parse(string_or_io, options) end options ||= (document.html? ? ParseOptions::DEFAULT_HTML : ParseOptions::DEFAULT_XML) options = Nokogiri::XML::ParseOptions.new(options) if Integer === options yield options if block_given? contents = if string_or_io.respond_to?(:read) string_or_io.read else string_or_io end return Nokogiri::XML::NodeSet.new(document) if contents.empty? error_count = document.errors.length node_set = in_context(contents, options.to_i) if document.errors.length > error_count raise document.errors[error_count] unless options.recover? # TODO: remove this block when libxml2 < 2.13 is no longer supported if node_set.empty? # libxml2 < 2.13 does not obey the +recover+ option after encountering errors during # +in_context+ parsing, and so this horrible hack is here to try to emulate recovery # behavior. # # (Note that HTML4 fragment parsing seems to have been fixed in abd74186, and XML # fragment parsing is fixed in 1c106edf. Both are in 2.13.) # # Unfortunately, this means we're no longer parsing "in context" and so namespaces that # would have been inherited from the context node won't be handled correctly. This hack # was written in 2010, and I regret it, because it's silently degrading functionality in # a way that's not easily prevented (or even detected). # # I think preferable behavior would be to either: # # a. add an error noting that we "fell back" and pointing the user to turning off the # +recover+ option # b. don't recover, but raise a sensible exception # # For context and background: # - https://github.com/sparklemotion/nokogiri/issues/313 # - https://github.com/sparklemotion/nokogiri/issues/2092 fragment = document.related_class("DocumentFragment").parse(contents) node_set = fragment.children end end node_set end
Parse string_or_io as a document fragment within the context of this node. Returns a XML::NodeSet containing the nodes parsed from string_or_io.
Source
# File lib/nokogiri/xml/node.rb, line 1234 def processing_instruction? type == PI_NODE end
Returns true if this is a ProcessingInstruction node
Source
# File lib/nokogiri/xml/node.rb, line 1259 def read_only? # According to gdome2, these are read-only node types [NOTATION_NODE, ENTITY_NODE, ENTITY_DECL].include?(type) end
Is this a read only node?
Source
# File lib/nokogiri/xml/node.rb, line 1239 def text? type == TEXT_NODE end
Returns true if this is a Text node
Source
Source
# File lib/nokogiri/xml/node.rb, line 1317 def traverse(&block) children.each { |j| j.traverse(&block) } yield(self) end
Yields self and all children to block recursively.
Source
# File lib/nokogiri/xml/node.rb, line 1219 def xml? type == DOCUMENT_NODE end
Returns true if this is an XML::Document node
Protected Instance Methods
Source
# File lib/nokogiri/xml/node.rb, line 1567 def coerce(data) case data when XML::NodeSet return data when XML::DocumentFragment return data.children when String return fragment(data).children when Document, XML::Attr # unacceptable when XML::Node return data end raise ArgumentError, <<~EOERR Requires a Node, NodeSet or String argument, and cannot accept a #{data.class}. (You probably want to select a node from the Document with at() or search(), or create a new Node via Node.new().) EOERR end
Manipulating Document Structure
↑ topPublic Instance Methods
Source
# File lib/nokogiri/xml/node.rb, line 292 def <<(node_or_tags) add_child(node_or_tags) self end
Add node_or_tags as a child of this Node.
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns self, to support chaining of calls (e.g., root << child1 << child2)
Also see related method add_child.
Source
# File lib/nokogiri/xml/node.rb, line 184 def add_child(node_or_tags) node_or_tags = coerce(node_or_tags) if node_or_tags.is_a?(XML::NodeSet) node_or_tags.each { |n| add_child_node_and_reparent_attrs(n) } else add_child_node_and_reparent_attrs(node_or_tags) end node_or_tags end
Add node_or_tags as a child of this Node.
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns the reparented node (if node_or_tags is a Node), or NodeSet (if node_or_tags is a DocumentFragment, NodeSet, or String).
Also see related method +<<+.
Source
# File lib/nokogiri/xml/node.rb, line 324 def add_next_sibling(node_or_tags) raise ArgumentError, "A document may not have multiple root nodes." if parent&.document? && !(node_or_tags.comment? || node_or_tags.processing_instruction?) add_sibling(:next, node_or_tags) end
Insert node_or_tags after this Node (as a sibling).
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns the reparented node (if node_or_tags is a Node), or NodeSet (if node_or_tags is a DocumentFragment, NodeSet, or String).
Also see related method after.
Source
# File lib/nokogiri/xml/node.rb, line 307 def add_previous_sibling(node_or_tags) raise ArgumentError, "A document may not have multiple root nodes." if parent&.document? && !(node_or_tags.comment? || node_or_tags.processing_instruction?) add_sibling(:previous, node_or_tags) end
Insert node_or_tags before this Node (as a sibling).
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns the reparented node (if node_or_tags is a Node), or NodeSet (if node_or_tags is a DocumentFragment, NodeSet, or String).
Also see related method before.
Source
# File lib/nokogiri/xml/node.rb, line 354 def after(node_or_tags) add_next_sibling(node_or_tags) self end
Insert node_or_tags after this node (as a sibling).
node_or_tags can be a Nokogiri::XML::Node, a Nokogiri::XML::DocumentFragment, or a String containing markup.
Returns self, to support chaining of calls.
Also see related method add_next_sibling.
Source
# File lib/nokogiri/xml/node.rb, line 340 def before(node_or_tags) add_previous_sibling(node_or_tags) self end
Insert node_or_tags before this node (as a sibling).
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns self, to support chaining of calls.
Also see related method add_previous_sibling.
Source
# File lib/nokogiri/xml/node.rb, line 385 def children=(node_or_tags) node_or_tags = coerce(node_or_tags) children.unlink if node_or_tags.is_a?(XML::NodeSet) node_or_tags.each { |n| add_child_node_and_reparent_attrs(n) } else add_child_node_and_reparent_attrs(node_or_tags) end end
Set the content for this Node node_or_tags
node_or_tags can be a Nokogiri::XML::Node, a Nokogiri::XML::DocumentFragment, or a String containing markup.
Also see related method inner_html=
Source
# File lib/nokogiri/xml/node.rb, line 487 def content=(string) self.native_content = encode_special_chars(string.to_s) end
Set the content of this node to input.
- Parameters
-
input(String) The new content for this node. Input is considered to be raw content, and so will be entity-escaped in the final DOM string.
- Example
-
Note how entities are handled:
doc = Nokogiri::HTML::Document.parse(<<~HTML) <html> <body> <div id="first">asdf</div> <div id="second">asdf</div> HTML text_node = doc.at_css("div#first").children.first div_node = doc.at_css("div#second") value = "You & Me" text_node.content = value div_node.content = value doc.css("div").to_html # => "<div id=\"first\">You &amp; Me</div> # <div id=\"second\">You &amp; Me</div>"
For content that is already entity-escaped, use CGI::unescapeHTML to decode it:
text_node.content = CGI::unescapeHTML(value) div_node.content = CGI::unescapeHTML(value) doc.css("div").to_html # => "<div id=\"first\">You & Me</div> # <div id=\"second\">You & Me</div>"
See also: native_content=
Source
# File lib/nokogiri/xml/node.rb, line 503 def default_namespace=(url) add_namespace_definition(nil, url) end
Adds a default namespace supplied as a string url href, to self. The consequence is as an xmlns attribute with supplied argument were present in parsed XML. A default namespace set with this method will now show up in attributes, but when this node is serialized to XML an “xmlns” attribute will appear. See also namespace and namespace=
Source
# File lib/nokogiri/xml/node.rb, line 530 def do_xinclude(options = XML::ParseOptions::DEFAULT_XML) options = Nokogiri::XML::ParseOptions.new(options) if Integer === options yield options if block_given? # call c extension process_xincludes(options.to_i) end
Do xinclude substitution on the subtree below node. If given a block, a Nokogiri::XML::ParseOptions object initialized from options, will be passed to it, allowing more convenient modification of the parser options.
Source
# File lib/nokogiri/xml/node.rb, line 374 def inner_html=(node_or_tags) self.children = node_or_tags end
Set the content for this Node to node_or_tags.
node_or_tags can be a Nokogiri::XML::Node, a Nokogiri::XML::DocumentFragment, or a String containing markup.
⚠ Please note that despite the name, this method will not always parse a String argument as HTML. A String argument will be parsed with the DocumentFragment parser related to this node’s document.
For example, if the document is an HTML4::Document then the string will be parsed as HTML4 using HTML4::DocumentFragment; but if the document is an XML::Document then it will parse the string as XML using XML::DocumentFragment.
Also see related method children=
Source
# File lib/nokogiri/xml/node.rb, line 513 def namespace=(ns) return set_namespace(ns) unless ns unless Nokogiri::XML::Namespace === ns raise TypeError, "#{ns.class} can't be coerced into Nokogiri::XML::Namespace" end if ns.document != document raise ArgumentError, "namespace must be declared on the same document" end set_namespace(ns) end
Set the default namespace on this node (as would be defined with an “xmlns=” attribute in XML source), as a Namespace object ns. Note that a Namespace added this way will NOT be serialized as an xmlns attribute for this node. You probably want default_namespace= instead, or perhaps add_namespace_definition with a nil prefix argument.
Source
Source
# File lib/nokogiri/xml/node.rb, line 204 def prepend_child(node_or_tags) if (first = children.first) # Mimic the error add_child would raise. raise "Document already has a root node" if document? && !(node_or_tags.comment? || node_or_tags.processing_instruction?) first.__send__(:add_sibling, :previous, node_or_tags) else add_child(node_or_tags) end end
Add node_or_tags as the first child of this Node.
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns the reparented node (if node_or_tags is a Node), or NodeSet (if node_or_tags is a DocumentFragment, NodeSet, or String).
Also see related method add_child.
Source
# File lib/nokogiri/xml/node.rb, line 405 def replace(node_or_tags) raise("Cannot replace a node with no parent") unless parent # We cannot replace a text node directly, otherwise libxml will return # an internal error at parser.c:13031, I don't know exactly why # libxml is trying to find a parent node that is an element or document # so I can't tell if this is bug in libxml or not. issue #775. if text? replacee = Nokogiri::XML::Node.new("dummy", document) add_previous_sibling_node(replacee) unlink return replacee.replace(node_or_tags) end node_or_tags = parent.coerce(node_or_tags) if node_or_tags.is_a?(XML::NodeSet) node_or_tags.each { |n| add_previous_sibling(n) } unlink else replace_node(node_or_tags) end node_or_tags end
Replace this Node with node_or_tags.
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String containing markup.
Returns the reparented node (if node_or_tags is a Node), or NodeSet (if node_or_tags is a DocumentFragment, NodeSet, or String).
Also see related method swap.
Source
# File lib/nokogiri/xml/node.rb, line 439 def swap(node_or_tags) replace(node_or_tags) self end
Swap this Node for node_or_tags
node_or_tags can be a Nokogiri::XML::Node, a ::DocumentFragment, a ::NodeSet, or a String Containing markup.
Returns self, to support chaining of calls.
Also see related method replace.
Source
# File lib/nokogiri/xml/node.rb, line 259 def wrap(node_or_tags) case node_or_tags when String context_node = parent || document new_parent = context_node.coerce(node_or_tags).first if new_parent.nil? raise "Failed to parse '#{node_or_tags}' in the context of a '#{context_node.name}' element" end when Node new_parent = node_or_tags.dup else raise ArgumentError, "Requires a String or Node argument, and cannot accept a #{node_or_tags.class}" end if parent add_next_sibling(new_parent) else new_parent.unlink end new_parent.add_child(self) self end
Wrap this Node with the node parsed from markup or a dup of the node.
- Parameters
-
markup (String) Markup that is parsed and used as the wrapper. This node’s parent, if it exists, is used as the context node for parsing; otherwise the associated document is used. If the parsed fragment has multiple roots, the first root node is used as the wrapper.
-
node (
Nokogiri::XML::Node) An element that is ‘#dup`ed and used as the wrapper.
- Returns
-
self, to support chaining.
Also see NodeSet#wrap
Example with a String argument:
doc = Nokogiri::HTML5(<<~HTML) <html><body> <a>asdf</a> </body></html> HTML doc.at_css("a").wrap("<div></div>") doc.to_html # => <html><head></head><body> # <div><a>asdf</a></div> # </body></html>
Example with a Node argument:
doc = Nokogiri::HTML5(<<~HTML) <html><body> <a>asdf</a> </body></html> HTML doc.at_css("a").wrap(doc.create_element("div")) doc.to_html # <html><head></head><body> # <div><a>asdf</a></div> # </body></html>
Serialization and Generating Output
↑ topPublic Instance Methods
Source
# File lib/nokogiri/xml/node.rb, line 1492 def canonicalize(mode = XML::XML_C14N_1_0, inclusive_namespaces = nil, with_comments = false) c14n_root = self document.canonicalize(mode, inclusive_namespaces, with_comments) do |node, parent| tn = node.is_a?(XML::Node) ? node : parent tn == c14n_root || tn.ancestors.include?(c14n_root) end end
Source
# File lib/nokogiri/xml/node.rb, line 1553 def deconstruct_keys(keys) requested_keys = DECONSTRUCT_KEYS & keys {}.tap do |values| requested_keys.each do |key| method = DECONSTRUCT_METHODS[key] || key values[key] = send(method) end end end
Returns a hash describing the Node, to use in pattern matching.
Valid keys and their values:
-
name→ (String) The name of this node, or “text” if it is aTextnode. -
namespace→ (Namespace, nil) The namespace of this node, or nil if there is no namespace. -
attributes→ (Array<Attr>) The attributes of this node. -
children→ (Array<Node>) The children of this node. 💡 Note this includes text nodes. -
elements→ (Array<Node>) The child elements of this node. 💡 Note this does not include text nodes. -
content→ (String) The contents of all the text nodes in this node’s subtree. See content. -
inner_html→ (String) The inner markup for the children of this node. Seeinner_html.
Example
doc = Nokogiri::XML.parse(<<~XML) <?xml version="1.0"?> <parent xmlns="http://nokogiri.org/ns/default" xmlns:noko="http://nokogiri.org/ns/noko"> <child1 foo="abc" noko:bar="def">First</child1> <noko:child2 foo="qwe" noko:bar="rty">Second</noko:child2> </parent> XML doc.root.deconstruct_keys([:name, :namespace]) # => {:name=>"parent", # :namespace=> # #(Namespace:0x35c { href = "http://nokogiri.org/ns/default" })} doc.root.deconstruct_keys([:inner_html, :content]) # => {:content=>"\n" + " First\n" + " Second\n", # :inner_html=> # "\n" + # " <child1 foo=\"abc\" noko:bar=\"def\">First</child1>\n" + # " <noko:child2 foo=\"qwe\" noko:bar=\"rty\">Second</noko:child2>\n"} doc.root.elements.first.deconstruct_keys([:attributes]) # => {:attributes=> # [#(Attr:0x370 { name = "foo", value = "abc" }), # #(Attr:0x384 { # name = "bar", # namespace = #(Namespace:0x398 { # prefix = "noko", # href = "http://nokogiri.org/ns/noko" # }), # value = "def" # })]}
Since v1.14.0
Source
# File lib/nokogiri/xml/node.rb, line 1364 def serialize(*args, &block) # TODO: deprecate non-hash options, see 46c68ed 2009-06-20 for context options = if args.first.is_a?(Hash) args.shift else { encoding: args[0], save_with: args[1], } end options[:encoding] ||= document.encoding encoding = Encoding.find(options[:encoding] || "UTF-8") io = StringIO.new(String.new(encoding: encoding)) write_to(io, options, &block) io.string end
Serialize Node using options. Save options can also be set using a block.
See also Nokogiri::XML::Node::SaveOptions and Serialization and Generating Output at Node.
These two statements are equivalent:
node.serialize(encoding: 'UTF-8', save_with: FORMAT | AS_XML)
or
node.serialize(encoding: 'UTF-8') do |config| config.format.as_xml end
Source
# File lib/nokogiri/xml/node.rb, line 1391 def to_html(options = {}) to_format(SaveOptions::DEFAULT_HTML, options) end
doc.to_html
See Node#write_to for a list of options. For formatted output, use Node#to_xhtml instead.
Source
# File lib/nokogiri/xml/node.rb, line 1412 def to_xhtml(options = {}) to_format(SaveOptions::DEFAULT_XHTML, options) end
Serialize this Node to XHTML using options
doc.to_xhtml(indent: 5, encoding: 'UTF-8')
See Node#write_to for a list of options
Source
# File lib/nokogiri/xml/node.rb, line 1401 def to_xml(options = {}) options[:save_with] ||= SaveOptions::DEFAULT_XML serialize(options) end
Serialize this Node to XML using options
doc.to_xml(indent: 5, encoding: 'UTF-8')
See Node#write_to for a list of options
Source
# File lib/nokogiri/xml/node.rb, line 1469 def write_html_to(io, options = {}) write_format_to(SaveOptions::DEFAULT_HTML, io, options) end
Write Node as HTML to io with options
See Node#write_to for a list of options
Source
# File lib/nokogiri/xml/node.rb, line 1440 def write_to(io, *options) options = options.first.is_a?(Hash) ? options.shift : {} encoding = options[:encoding] || options[0] || document.encoding if Nokogiri.jruby? save_options = options[:save_with] || options[1] indent_times = options[:indent] || 0 else save_options = options[:save_with] || options[1] || SaveOptions::FORMAT indent_times = options[:indent] || 2 end indent_text = options[:indent_text] || " " # Any string times 0 returns an empty string. Therefore, use the same # string instead of generating a new empty string for every node with # zero indentation. indentation = indent_times.zero? ? "" : (indent_text * indent_times) config = SaveOptions.new(save_options.to_i) yield config if block_given? encoding = encoding.is_a?(Encoding) ? encoding.name : encoding native_write_to(io, encoding, indentation, config.options) end
Serialize this node or document to io.
- Parameters
-
io(IO) An IO-like object to which the serialized content will be written. -
options(Hash) See below
- Options
-
:encoding(String or Encoding) specify the encoding of the output (defaults to document encoding) -
:indent_text(String) the indentation text (defaults to" ") -
:indent(Integer) the number of:indent_textto use (defaults to2) -
:save_with(Integer) a combination ofSaveOptionsconstants
To save with UTF-8 indented twice:
node.write_to(io, encoding: 'UTF-8', indent: 2)
To save indented with two dashes:
node.write_to(io, indent_text: '-', indent: 2)
Source
# File lib/nokogiri/xml/node.rb, line 1477 def write_xhtml_to(io, options = {}) write_format_to(SaveOptions::DEFAULT_XHTML, io, options) end
Write Node as XHTML to io with options
See Node#write_to for a list of options
Source
# File lib/nokogiri/xml/node.rb, line 1487 def write_xml_to(io, options = {}) options[:save_with] ||= SaveOptions::DEFAULT_XML write_to(io, options) end
Write Node as XML to io with options
doc.write_xml_to io, :encoding => 'UTF-8'
See Node#write_to for a list of options
Working With Node Attributes
↑ topPublic Instance Methods
Source
# File lib/nokogiri/xml/node.rb, line 587 def [](name) get(name.to_s) end
Fetch an attribute from this node.
⚠ Note that attributes with namespaces cannot be accessed with this method. To access namespaced attributes, use attribute_with_ns.
- Returns
-
(String, nil) value of the attribute
name, ornilif no matching attribute exists
Example
doc = Nokogiri::XML("<root><child size='large' class='big wide tall'/></root>") child = doc.at_css("child") child["size"] # => "large" child["class"] # => "big wide tall"
Example: Namespaced attributes will not be returned.
⚠ Note namespaced attributes may be accessed with attribute or attribute_with_ns
doc = Nokogiri::XML(<<~EOF) <root xmlns:width='http://example.com/widths'> <child width:size='broad'/> </root> EOF doc.at_css("child")["size"] # => nil doc.at_css("child").attribute("size").value # => "broad" doc.at_css("child").attribute_with_ns("size", "http://example.com/widths").value # => "broad"
Source
# File lib/nokogiri/xml/node.rb, line 625 def []=(name, value) set(name.to_s, value.to_s) end
Update the attribute name to value, or create the attribute if it does not exist.
⚠ Note that attributes with namespaces cannot be accessed with this method. To access namespaced attributes for update, use attribute_with_ns. To add a namespaced attribute, see the example below.
- Returns
-
value
Example
doc = Nokogiri::XML("<root><child/></root>") child = doc.at_css("child") child["size"] = "broad" child.to_html # => "<child size=\"broad\"></child>"
Example: Add a namespaced attribute.
doc = Nokogiri::XML(<<~EOF) <root xmlns:width='http://example.com/widths'> <child/> </root> EOF child = doc.at_css("child") child["size"] = "broad" ns = doc.root.namespace_definitions.find { |ns| ns.prefix == "width" } child.attribute("size").namespace = ns doc.to_html # => "<root xmlns:width=\"http://example.com/widths\">\n" + # " <child width:size=\"broad\"></child>\n" + # "</root>\n"
Source
# File lib/nokogiri/xml/node.rb, line 790 def add_class(names) kwattr_add("class", names) end
Ensure HTML CSS classes are present on self. Any CSS classes in names that already exist in the “class” attribute are not added. Note that any existing duplicates in the “class” attribute are not removed. Compare with append_class.
This is a convenience function and is equivalent to:
node.kwattr_add("class", names)
See related: kwattr_add, classes, append_class, remove_class
- Parameters
-
names(String, Array<String>)CSSclass names to be added to the Node’s “class” attribute. May be a string containing whitespace-delimited names, or an Array of String names. Any class names already present will not be added. Any class names not present will be added. If no “class” attribute exists, one is created.
- Returns
-
self(Node) for ease of chaining method calls.
Example: Ensure that the node has CSS class “section”
node # => <div></div> node.add_class("section") # => <div class="section"></div> node.add_class("section") # => <div class="section"></div> # duplicate not added
Example: Ensure that the node has CSS classes “section” and “header”, via a String argument
Note that the CSS class “section” is not added because it is already present. Note also that the pre-existing duplicate CSS class “section” is not removed.
node # => <div class="section section"></div> node.add_class("section header") # => <div class="section section header"></div>
Example: Ensure that the node has CSS classes “section” and “header”, via an Array argument
node # => <div></div> node.add_class(["section", "header"]) # => <div class="section header"></div>
Source
# File lib/nokogiri/xml/node.rb, line 834 def append_class(names) kwattr_append("class", names) end
Add HTML CSS classes to self, regardless of duplication. Compare with add_class.
This is a convenience function and is equivalent to:
node.kwattr_append("class", names)
See related: kwattr_append, classes, add_class, remove_class
- Parameters
-
names(String, Array<String>)CSSclass names to be appended to the Node’s “class” attribute. May be a string containing whitespace-delimited names, or an Array of String names. All class names passed in will be appended to the “class” attribute even if they are already present in the attribute value. If no “class” attribute exists, one is created.
- Returns
-
self(Node) for ease of chaining method calls.
Example: Append “section” to the node’s CSS “class” attribute
node # => <div></div> node.append_class("section") # => <div class="section"></div> node.append_class("section") # => <div class="section section"></div> # duplicate added!
Example: Append “section” and “header” to the noded’s CSS “class” attribute, via a String argument
Note that the CSS class “section” is appended even though it is already present
node # => <div class="section section"></div> node.append_class("section header") # => <div class="section section section header"></div>
Example: Append “section” and “header” to the node’s CSS “class” attribute, via an Array argument
node # => <div></div> node.append_class(["section", "header"]) # => <div class="section header"></div> node.append_class(["section", "header"]) # => <div class="section header section header"></div>
Source
# File lib/nokogiri/xml/node.rb, line 684 def attributes attribute_nodes.each_with_object({}) do |node, hash| hash[node.node_name] = node end end
Fetch this node’s attributes.
⚠ Because the keys do not include any namespace information for the attribute, in case of a simple name collision, not all attributes will be returned. In this case, you will need to use attribute_nodes.
- Returns
-
Hash containing attributes belonging to
self. The hash keys are String attribute names (without the namespace), and the hash values areNokogiri::XML::Attr.
Example with no namespaces:
doc = Nokogiri::XML("<root><child size='large' class='big wide tall'/></root>") doc.at_css("child").attributes # => {"size"=>#(Attr:0x550 { name = "size", value = "large" }), # "class"=>#(Attr:0x564 { name = "class", value = "big wide tall" })}
Example with a namespace:
doc = Nokogiri::XML("<root xmlns:desc='http://example.com/sizes'><child desc:size='large'/></root>") doc.at_css("child").attributes # => {"size"=> # #(Attr:0x550 { # name = "size", # namespace = #(Namespace:0x564 { # prefix = "desc", # href = "http://example.com/sizes" # }), # value = "large" # })}
Example with an attribute name collision:
⚠ Note that only one of the attributes is returned in the Hash.
doc = Nokogiri::XML(<<~EOF) <root xmlns:width='http://example.com/widths' xmlns:height='http://example.com/heights'> <child width:size='broad' height:size='tall'/> </root> EOF doc.at_css("child").attributes # => {"size"=> # #(Attr:0x550 { # name = "size", # namespace = #(Namespace:0x564 { # prefix = "height", # href = "http://example.com/heights" # }), # value = "tall" # })}
Source
# File lib/nokogiri/xml/node.rb, line 744 def classes kwattr_values("class") end
Fetch CSS class names of a Node.
This is a convenience function and is equivalent to:
node.kwattr_values("class")
See related: kwattr_values, add_class, append_class, remove_class
- Returns
-
The
CSSclasses (Array of String) present in the Node’s “class” attribute. If the attribute is empty or non-existent, the return value is an empty array.
Example
node # => <div class="section title header"></div> node.classes # => ["section", "title", "header"]
Source
# File lib/nokogiri/xml/node.rb, line 710 def each attribute_nodes.each do |node| yield [node.node_name, node.value] end end
Iterate over each attribute name and value pair for this Node.
Source
# File lib/nokogiri/xml/node.rb, line 704 def keys attribute_nodes.map(&:node_name) end
Get the attribute names for this Node.
Source
# File lib/nokogiri/xml/node.rb, line 967 def kwattr_add(attribute_name, keywords) keywords = keywordify(keywords) current_kws = kwattr_values(attribute_name) new_kws = (current_kws + (keywords - current_kws)).join(" ") set_attribute(attribute_name, new_kws) self end
Ensure that values are present in a keyword attribute.
Any values in keywords that already exist in the Node’s attribute values are not added. Note that any existing duplicates in the attribute values are not removed. Compare with kwattr_append.
A “keyword attribute” is a node attribute that contains a set of space-delimited values. Perhaps the most familiar example of this is the HTML “class” attribute used to contain CSS classes. But other keyword attributes exist, for instance the “rel” attribute.
See also add_class, kwattr_values, kwattr_append, kwattr_remove
- Parameters
-
attribute_name(String) The name of the keyword attribute to be modified. -
keywords(String, Array<String>) Keywords to be added to the attribute namedattribute_name. May be a string containing whitespace-delimited values, or an Array of String values. Any values already present will not be added. Any values not present will be added. If the named attribute does not exist, it is created.
- Returns
-
self(Nokogiri::XML::Node) for ease of chaining method calls.
Example: Ensure that a Node has “nofollow” in its rel attribute.
Note that duplicates are not added.
node # => <a></a> node.kwattr_add("rel", "nofollow") # => <a rel="nofollow"></a> node.kwattr_add("rel", "nofollow") # => <a rel="nofollow"></a>
Example: Ensure that a Node has “nofollow” and “noreferrer” in its rel attribute, via a String argument.
Note that "nofollow" is not added because it is already present. Note also that the
pre-existing duplicate "nofollow" is not removed.
node # => <a rel="nofollow nofollow"></a>
node.kwattr_add("rel", "nofollow noreferrer") # => <a rel="nofollow nofollow noreferrer"></a>
Example: Ensure that a Node has “nofollow” and “noreferrer” in its rel attribute, via an Array argument.
node # => <a></a> node.kwattr_add("rel", ["nofollow", "noreferrer"]) # => <a rel="nofollow noreferrer"></a>
Since v1.11.0
Source
# File lib/nokogiri/xml/node.rb, line 1020 def kwattr_append(attribute_name, keywords) keywords = keywordify(keywords) current_kws = kwattr_values(attribute_name) new_kws = (current_kws + keywords).join(" ") set_attribute(attribute_name, new_kws) self end
Add keywords to a Node’s keyword attribute, regardless of duplication. Compare with kwattr_add.
A “keyword attribute” is a node attribute that contains a set of space-delimited values. Perhaps the most familiar example of this is the HTML “class” attribute used to contain CSS classes. But other keyword attributes exist, for instance the “rel” attribute.
See also append_class, kwattr_values, kwattr_add, kwattr_remove
- Parameters
-
attribute_name(String) The name of the keyword attribute to be modified. -
keywords(String, Array<String>) Keywords to be added to the attribute namedattribute_name. May be a string containing whitespace-delimited values, or an Array of String values. All values passed in will be appended to the named attribute even if they are already present in the attribute. If the named attribute does not exist, it is created.
- Returns
-
self(Node) for ease of chaining method calls.
Example: Append “nofollow” to the rel attribute.
Note that duplicates are added.
node # => <a></a> node.kwattr_append("rel", "nofollow") # => <a rel="nofollow"></a> node.kwattr_append("rel", "nofollow") # => <a rel="nofollow nofollow"></a>
Example: Append “nofollow” and “noreferrer” to the rel attribute, via a String argument.
Note that “nofollow” is appended even though it is already present.
node # => <a rel="nofollow"></a> node.kwattr_append("rel", "nofollow noreferrer") # => <a rel="nofollow nofollow noreferrer"></a>
Example: Append “nofollow” and “noreferrer” to the rel attribute, via an Array argument.
node # => <a></a> node.kwattr_append("rel", ["nofollow", "noreferrer"]) # => <a rel="nofollow noreferrer"></a>
Since v1.11.0
Source
# File lib/nokogiri/xml/node.rb, line 1063 def kwattr_remove(attribute_name, keywords) if keywords.nil? remove_attribute(attribute_name) return self end keywords = keywordify(keywords) current_kws = kwattr_values(attribute_name) new_kws = current_kws - keywords if new_kws.empty? remove_attribute(attribute_name) else set_attribute(attribute_name, new_kws.join(" ")) end self end
Remove keywords from a keyword attribute. Any matching keywords that exist in the named attribute are removed, including any multiple entries.
If no keywords remain after this operation, or if keywords is nil, the attribute is deleted from the node.
A “keyword attribute” is a node attribute that contains a set of space-delimited values. Perhaps the most familiar example of this is the HTML “class” attribute used to contain CSS classes. But other keyword attributes exist, for instance the “rel” attribute.
See also remove_class, kwattr_values, kwattr_add, kwattr_append
- Parameters
-
attribute_name(String) The name of the keyword attribute to be modified. -
keywords(String, Array<String>) Keywords to be removed from the attribute namedattribute_name. May be a string containing whitespace-delimited values, or an Array of String values. Any keywords present in the named attribute will be removed. If no keywords remain, or ifkeywordsis nil, the attribute is deleted.
- Returns
-
self(Node) for ease of chaining method calls.
Example:
Note that the rel attribute is deleted when empty.
node # => <a rel="nofollow noreferrer">link</a> node.kwattr_remove("rel", "nofollow") # => <a rel="noreferrer">link</a> node.kwattr_remove("rel", "noreferrer") # => <a>link</a>
Since v1.11.0
Source
# File lib/nokogiri/xml/node.rb, line 913 def kwattr_values(attribute_name) keywordify(get_attribute(attribute_name) || []) end
Fetch values from a keyword attribute of a Node.
A “keyword attribute” is a node attribute that contains a set of space-delimited values. Perhaps the most familiar example of this is the HTML “class” attribute used to contain CSS classes. But other keyword attributes exist, for instance the “rel” attribute.
See also classes, kwattr_add, kwattr_append, kwattr_remove
- Parameters
-
attribute_name(String) The name of the keyword attribute to be inspected.
- Returns
-
(Array<String>) The values present in the Node’s
attribute_nameattribute. If the attribute is empty or non-existent, the return value is an empty array.
Example:
node # => <a rel="nofollow noopener external">link</a> node.kwattr_values("rel") # => ["nofollow", "noopener", "external"]
Since v1.11.0
Source
# File lib/nokogiri/xml/node.rb, line 718 def remove_attribute(name) attr = attributes[name].remove if key?(name) clear_xpath_context if Nokogiri.jruby? attr end
Remove the attribute named name
Source
# File lib/nokogiri/xml/node.rb, line 884 def remove_class(names = nil) kwattr_remove("class", names) end
Remove HTML CSS classes from this node. Any CSS class names in css_classes that exist in this node’s “class” attribute are removed, including any multiple entries.
If no CSS classes remain after this operation, or if css_classes is nil, the “class” attribute is deleted from the node.
This is a convenience function and is equivalent to:
node.kwattr_remove("class", css_classes)
Also see kwattr_remove, classes, add_class, append_class
- Parameters
-
css_classes(String, Array<String>)CSSclass names to be removed from the Node’s “class” attribute. May be a string containing whitespace-delimited names, or an Array of String names. Any class names already present will be removed. If noCSSclasses remain, the “class” attribute is deleted.
- Returns
-
self(Nokogiri::XML::Node) for ease of chaining method calls.
Example: Deleting a CSS class
Note that all instances of the class “section” are removed from the “class” attribute.
node # => <div class="section header section"></div> node.remove_class("section") # => <div class="header"></div>
Example: Deleting the only remaining CSS class
Note that the attribute is removed once there are no remaining classes.
node # => <div class="section"></div> node.remove_class("section") # => <div></div>
Example: Deleting multiple CSS classes
Note that the “class” attribute is deleted once it’s empty.
node # => <div class="section header float"></div> node.remove_class(["section", "float"]) # => <div class="header"></div>
Source
# File lib/nokogiri/xml/node.rb, line 698 def value?(value) values.include?(value) end
Does this Node’s attributes include <value>