Quickdocs

cl-sanitize-html

2026-01-01

OWASP-style HTML sanitization library for Common Lisp

Upstream URL

github.com/atgreen/cl-sanitize-html

Author

Anthony Green <green@moxielogic.com>

License

MIT

README

Provided Systems

cl-sanitize-html

OWASP-style HTML sanitization library for Common Lisp, designed for safely rendering untrusted HTML content (like HTML emails or user-generated content).

Features

Whitelist-based sanitization - Only explicitly allowed tags and attributes pass through
Multiple security policies - Default, Strict, and Email policies included
XSS prevention - Blocks script tags, event handlers, javascript: URLs, and other attack vectors
CSS sanitization - Optional CSS property filtering for email content
Safe defaults - Automatically adds rel="noopener noreferrer" and target="_blank" to links
Plump-based - Built on the robust Plump HTML parser
Well-tested - Comprehensive test suite covering OWASP attack vectors

Quick Start

(use-package :sanitize-html)

;; Basic usage with default policy
(sanitize "<script>alert('XSS')</script><p>Hello</p>")
;; => "<p>Hello</p>"

;; Remove event handlers
(sanitize "<a href='#' onclick='alert(1)'>Click me</a>")
;; => "<a href=\"#\" rel=\"noopener noreferrer\" target=\"_blank\">Click me</a>"

;; Use email policy for HTML emails
(sanitize "<table><tr><td bgcolor='red'>Cell</td></tr></table>" *email-policy*)
;; => "<table><tr><td bgcolor=\"red\">Cell</td></tr></table>"

Security Policies

Default Policy (default-policy)

Balanced security and usability for general web content:

Allowed tags: Common formatting and semantic tags (p, div, span, a, strong, em, lists, tables, etc.)
Allowed protocols: http, https, mailto, ftp
Inline styles: Blocked
Comments: Removed

Strict Policy (strict-policy)

Maximum security with minimal formatting:

Allowed tags: Only basic formatting (a, b, em, strong, ul, ol, li, p, br, code, pre)
Allowed protocols: https, mailto only
Very limited attributes: Only href, title, and class

Email Policy (email-policy)

Designed for HTML emails with legacy formatting:

Allowed tags: All email-safe tags including tables, font, center
Allowed protocols: http, https, mailto, cid (inline images), data (base64)
Inline styles: Allowed with filtered CSS properties
Table attributes: bgcolor, cellpadding, cellspacing, etc.

API

Main Functions

(sanitize html-string &optional policy)
(sanitize-html html-string &optional policy)

Sanitize HTML string according to policy. Returns sanitized HTML string.

Parameters:

html-string - String containing HTML to sanitize
policy - Security policy to apply (defaults to *default-policy*)

Returns: Sanitized HTML string

Example:

(sanitize "<script>bad</script><p>good</p>")
;; => "<p>good</p>"

Utility Functions

(safe-url-p url &optional policy)

Check if URL uses a safe protocol according to policy.

(sanitize-url url &optional policy)

Return URL if safe, nil otherwise.

Custom Policies

(make-policy &key allowed-tags allowed-attributes allowed-protocols
                  allowed-css-properties remove-comments escape-cdata)

Create a custom security policy.

Example:

(defparameter *my-policy*
  (make-policy
   :allowed-tags '("p" "br" "a" "strong" "em")
   :allowed-attributes '(("a" . ("href" "title")))
   :allowed-protocols '("https")
   :remove-comments t))

(sanitize html-string *my-policy*)

Security Features

XSS Prevention

✅ Script tags removed
✅ Event handlers (onclick, onload, etc.) removed
✅ javascript: protocol blocked
✅ data: protocol blocked (except in email policy with validation)
✅ Inline styles blocked (except in email policy with CSS filtering)
✅ Form elements blocked
✅ iframe/object/embed blocked
✅ meta/link/style/base blocked

CSS Injection Prevention

CSS properties filtered by whitelist (email policy only)
javascript:, expression(), @import blocked in CSS values
behavior: property blocked (IE-specific XSS vector)

Safe Defaults

Links automatically get rel="noopener noreferrer" (prevents tabnabbing)
Links automatically get target="_blank" (open in new tab)
Comments removed by default
CDATA sections escaped by default

Email HTML Example

(defun render-email-html (email-html-body)
  "Safely render HTML email content"
  (sanitize-html email-html-body *email-policy*))

;; Typical email HTML with inline styles and tables
(render-email-html "
  <table border='1' cellpadding='10'>
    <tr>
      <td bgcolor='#ff0000'>
        <p style='color: white; font-size: 16px'>
          Welcome to our newsletter!
        </p>
      </td>
    </tr>
  </table>
  <p>
    <a href='https://example.com'>Visit our site</a>
  </p>
")

Running Tests

(asdf:test-system :sanitize-html)

Or manually:

(asdf:load-system :sanitize-html/tests)
(fiveam:run! :sanitize-html-tests)

Dependencies

plump - Lenient HTML/XML parser
lquery - DOM manipulation
cl-ppcre - Regular expressions for CSS parsing
alexandria - Utilities library

Test dependencies:

fiveam - Unit testing framework

Architecture

Parser - Uses Plump to parse HTML into a DOM tree
Tree Walker - Recursively visits each node in the DOM
Policy Enforcer - Checks each element/attribute against whitelist
Sanitizer - Removes or modifies unsafe content
Serializer - Converts sanitized DOM back to HTML string

Comparison with Other Libraries

Feature	sanitize-html	bluemonday (Go)	ammonia (Rust)	bleach (Python)
Whitelist-based	✅	✅	✅	✅
Multiple policies	✅	✅	✅	❌
CSS sanitization	✅	✅	✅	✅
URL validation	✅	✅	✅	✅
Link safety	✅	✅	❌	❌
OWASP-aligned	✅	✅	✅	✅

References

Author and License

sanitize-html was written by Anthony Green and is distributed under the terms of the MIT license.

cl-sanitize-html

Upstream URL

Author

License

cl-sanitize-html

Features

Quick Start

Security Policies

Default Policy (default-policy)

Strict Policy (strict-policy)

Email Policy (email-policy)

API

Main Functions

Utility Functions

Custom Policies

Security Features

XSS Prevention

CSS Injection Prevention

Safe Defaults

Email HTML Example

Running Tests

Dependencies

Architecture

Comparison with Other Libraries

References

Author and License

Dependencies (5)

Dependents (0)