19. Structured Markup Processing Tools
Python supports a variety of modules to work with various forms of structured data markup. This includes modules to work with the Standard Generalized Markup Language (SGML) and the Hypertext Markup Language (HTML), and several interfaces for working with the Extensible Markup Language (XML).
It is important to note that modules in the xml package require that there be at least one SAX-compliant XML parser available. The Expat parser is included with Python, so the xml.parsers.expat module will always be available.
The documentation for the xml.dom and xml.sax packages are the definition of the Python bindings for the DOM and SAX interfaces.
- 19.1. html — HyperText Markup Language support
- 19.2. html.parser — Simple HTML and XHTML parser
- 19.3. html.entities — Definitions of HTML general entities
- 19.4. xml.parsers.expat — Fast XML parsing using Expat
- 19.5. xml.dom — The Document Object Model API
- 19.5.1. Module Contents
- 19.5.2. Objects in the DOM
- 19.5.2.1. DOMImplementation Objects
- 19.5.2.2. Node Objects
- 19.5.2.3. NodeList Objects
- 19.5.2.4. DocumentType Objects
- 19.5.2.5. Document Objects
- 19.5.2.6. Element Objects
- 19.5.2.7. Attr Objects
- 19.5.2.8. NamedNodeMap Objects
- 19.5.2.9. Comment Objects
- 19.5.2.10. Text and CDATASection Objects
- 19.5.2.11. ProcessingInstruction Objects
- 19.5.2.12. Exceptions
- 19.5.3. Conformance
- 19.6. xml.dom.minidom — Lightweight DOM implementation
- 19.7. xml.dom.pulldom — Support for building partial DOM trees
- 19.8. xml.sax — Support for SAX2 parsers
- 19.9. xml.sax.handler — Base classes for SAX handlers
- 19.10. xml.sax.saxutils — SAX Utilities
- 19.11. xml.sax.xmlreader — Interface for XML parsers
- 19.12. xml.etree.ElementTree — The ElementTree XML API