An XML / XHTML / HTML parser that aims to be as lenient as possible.
A HTML5 parser for Common Lisp
cl-sanitize is a whitelist-based HTML sanitizer. Given a list of acceptable elements and attributes, cl-sanitize will remove all unacceptable HTML from a string.Homepage: https://github.com/archimag/cl-sanitizeLicense: LLGPL
Topics: HTML parser
Closure-html is a portable HTML parsing library that understands malformed HTML.
chtml, Closure HTMLOriginal author:
HTML, parser, HTML parserHomepage:
Yes, see belowDownload (git clone):
(gitweb)This project is asdf-installable (release tarball) and available in clbuild (git version).
An HTML parser written in Common Lisp.Feature overview
Supports HTML 4 and understands malformed HTML.
Serialization implemented using an event API called HAX, inspired by SAX.
Can be used with XML representations designed for cxml
In-memory representations that can be used with Closure HTML:
Lisp lists in "LHTML" format
Closure's internal PT structuresany representation supported by its sister project cxml, including DOM Core and STP
Relation to Closure XML
chtml is an error-correcting HTML 4 parser; cxml an XML parserEtymologychtml is short for Closure HTML, in reference to the Closure web
browser it was originally written for.Note that the Closure web browser is not related to the Clozure implementation of Common Lisp and works on various Common Lisp implementations.
URL-REWRITE programmatically rewrites (X)HTML documents such that certain attributes values are replaced by others. It was originally written to rewrite URLs for cookie-less session handling.This one's by Edi Weitz and can be found at http://weitz.de/url-rewrite/.