Rich-text import SSRF testing¶

Use this when a CMS, page builder, document importer, HTML-to-PDF renderer, or WYSIWYG widget accepts HTML and rewrites embedded media by fetching it server-side. The durable bug-hunting pattern is: user-controlled markup crosses into an image/import/render pipeline, the backend resolves relative URLs against a supplied base URL, validates the URL, fetches the result, then stores, re-hosts, or embeds fetched bytes.

Authorized testing only

Run these checks only on systems where you have explicit permission. Keep targets to your own callback host, owned lab services, and in-scope internal canaries provided by the program. Do not probe cloud metadata, loopback admin panels, or private services unless the rules of engagement explicitly allow it.

Why this matters¶

GitHub advisory GHSA-pr28-mf3q-qpg6 / CVE-2026-45012 showed this shape in ApostropheCMS: an authenticated rich-text widget validation endpoint accepted an import.html value containing an image, resolved the src with new URL(src, import.baseUrl || baseUrl), fetched it server-side, and imported image-compatible responses into Apostrophe's attachment pipeline.

GitHub advisory GHSA-983w-rhvv-gwmv / CVE-2025-68616 exposed the same operator seam in WeasyPrint's default_url_fetcher: an application-level url_fetcher could validate the initial URL, then the underlying fetch layer followed an HTTP redirect to a destination that was not revalidated. Treat server-side renderers as URL-fetch pipelines with per-hop authority decisions, not one-time string filters.

GitHub advisory GHSA-jhhc-3hcp-qhm5 / CVE-2026-49452 added another WeasyPrint renderer seam: when applications enable presentational_hints=True, unescaped HTML presentational attributes can be embedded into CSS and parsed as stylesheet declarations. For operators, the reusable pattern is HTML attribute value to CSS url() fetch, not generic styling injection. Prove it only with owned callback URLs or explicitly approved internal canaries.

GitHub advisory GHSA-pj8j-p4g4-4vw8 / CVE-2026-49865 showed the same server-side fetch boundary in Kimai invoice PDF rendering: user-controlled invoice Markdown, such as customer invoice text, was converted to HTML and handed to mPDF, which fetched Markdown image URLs from the application server during PDF preview or generation. Treat invoice notes, quote terms, customer messages, receipt templates, and billing comments as renderer input whenever they are later embedded in a generated PDF.

That is more useful than a one-off product advisory because the same pattern appears in many CMS/editor features:

paste-from-URL and paste-from-Word/HTML import;
page-builder widget validation or preview APIs;
avatar, OpenGraph, favicon, media-library, and attachment import helpers;
Markdown/HTML-to-PDF previewers that rewrite or embed images, CSS, fonts, and linked resources;
invoice, quote, receipt, timesheet, and billing PDF generators that render Markdown or rich-text customer-controlled fields;
HTML-to-PDF renderers that enable legacy presentational hints and convert attributes such as background into CSS;
migration tools that ingest remote HTML and localize assets.

Inputs¶

An authenticated low-privilege account that can create, edit, preview, validate, or import rich-text content.
A controlled callback endpoint that logs method, path, headers, timing, and request body size.
A tiny valid image file, such as a 1x1 PNG, hosted on the callback server.
A list of in-scope editor/import endpoints from normal application traffic.
Written authorization before attempting internal reachability checks.

Find candidate endpoints¶

Look for requests that contain both widget/editor metadata and imported HTML:

POST /api/*/validate-widget
POST /api/*/preview
POST /api/*/import
POST /admin/*/media/import
POST /graphql  # mutations that validate blocks, widgets, or rich text

In captured JSON or multipart bodies, prioritize fields named:

html
content
body
import.html
import.baseUrl
baseUrl
sourceUrl
src
url
widgets
blocks
area
attachments

The strongest candidates parse HTML and also return image IDs, attachment IDs, rewritten media URLs, thumbnail paths, or validation errors mentioning image processing.

Safe callback proof¶

Start with an external callback you control. Serve a known image and unique path per attempt:

mkdir -p /tmp/import-ssrf
base64 -d > /tmp/import-ssrf/proof.png <<'EOF'
iVBORw0KGgoAAAANSUhEUgAAAAEAAAABCAQAAAC1HAwCAAAAC0lEQVR42mP8/x8AAwMCAO+y1n0AAAAASUVORK5CYII=
EOF
python3 -m http.server 7777 --directory /tmp/import-ssrf

Submit HTML that references the image:

<p>proof</p><img src="https://callback.example/proof-<run-id>.png">

For Markdown-to-PDF renderers, use image syntax in the lowest-privilege field that reaches the rendered document, such as invoice notes, customer text, quote terms, or a document comment:

![proof](https://callback.example/proof-<run-id>.png)

If the importer supports a separate base URL, test both absolute and relative forms:

{
  "import": {
    "baseUrl": "https://callback.example/assets/",
    "html": "<img src=\"proof-<run-id>.png\">"
  }
}

Record proof only when your callback logs a server-side request that cannot be explained by your browser loading the image. Useful separators:

use a callback hostname that your browser never loads directly;
put the image URL only inside the API payload, not in rendered page HTML;
compare callback User-Agent, source IP/ASN, and timing to the submitted request;
repeat with a new random path and confirm the request follows the server action.

For billing or invoice renderers, trigger both preview and final generation if they are in scope. Capture which action caused the callback, the source field, the generated document type, and whether the renderer user-agent or source IP differs from the browser session. Do not test with real customer records, live payment data, or production invoice content.

Redirect revalidation harness¶

Use this when the target claims to block private or loopback destinations before server-side rendering/fetching.

Preconditions: an owned redirector, an owned external callback endpoint, and explicit authorization before using any internal canary. Do not aim redirects at cloud metadata, admin panels, Kubernetes APIs, databases, or production intranet hosts.

Set up two controlled URLs:

https://redirector.example/to-external-canary -> 302 https://callback.example/proof-<run-id>.png
https://redirector.example/to-approved-internal-canary -> 302 http://127.0.0.1:<approved-canary-port>/proof.png

Submit the first URL through the renderer/importer and confirm the callback proves server-side redirect following. Only then, if the rules of engagement allow it, use a program-provided internal canary for the second URL.

Positive evidence is a decision-table mismatch:

Input	Expected policy	Observed behavior
Direct blocked URL	rejected before fetch	rejected
Allowed URL that redirects to blocked URL	rejected after redirect revalidation	fetched or attempted
Allowed URL that redirects to owned external canary	fetched	callback hit

Capture the redirect chain, normalized destination, status code, and whether the renderer stored or exposed fetched bytes. Stop at canary reachability; do not enumerate ports or fetch real internal resources.

Presentational-hint CSS injection harness¶

Use this when an HTML-to-PDF or document-rendering path uses WeasyPrint or a similar renderer with legacy presentational hints enabled.

Preconditions: an owned callback endpoint, an authorized render/preview/upload path, and evidence that the application enables presentational hints. Do not test production metadata, loopback admin panels, or private services.

First submit harmless HTML containing a normal external image or CSS resource pointing to your callback and confirm server-side fetch behavior.
Then place a fixed canary URL in a presentational attribute value that the renderer may transform into CSS. The proof goal is a callback hit for the injected CSS resource, not data retrieval.
Capture the rendered-document request, callback path, source IP/UA, and whether the application returned a PDF, preview, or renderer error.
Run negative controls: presentational hints disabled, patched/fixed renderer behavior when available, and a literal attribute value that should not create a second CSS declaration.

Positive evidence is a decision-table mismatch:

Input location	Expected policy	Observed behavior
Normal image/CSS URL to owned callback	fetched if remote resources are allowed	callback hit
Presentational attribute with callback canary embedded as CSS resource	treated as a literal attribute or rejected	callback hit from renderer
Same attribute with presentational hints disabled	no CSS-derived fetch	no callback hit

Keep the report scoped to attribute-to-CSS resource fetch unless you separately prove a stronger impact in a lab. Do not include cloud metadata URLs or internal hostnames in the wiki or report artifacts.

Exfiltration check for image-compatible responses¶

Some importers store and re-host fetched images. This turns blind SSRF into response exfiltration for image-compatible content.

After the callback fires, inspect the API response for:

imageId
imageIds
attachmentId
url
src
thumbnail
rehosted media path

Then fetch the generated media URL from the application and compare the bytes or image dimensions to your controlled file. Keep the proof benign: a 1x1 PNG with a unique color or embedded non-sensitive marker is enough.

Report the distinction clearly:

Blind SSRF: callback received a request, but the response was not exposed.
Semi-blind SSRF: status, timing, size, or processing error leaked reachability.
Non-blind/exfiltrating SSRF: fetched bytes were stored or re-hosted and could be retrieved through the app.

Authorized internal reachability checks¶

Only do this when the scope explicitly permits internal SSRF validation. Prefer program-provided canaries over real infrastructure.

Use low-risk probes:

http://127.0.0.1:<approved-canary-port>/proof.png
http://localhost:<approved-canary-port>/proof.png
http://[::1]:<approved-canary-port>/proof.png
http://10.0.0.10:<approved-canary-port>/proof.png

For rough port discovery, use per-port random paths and measure differences in:

callback or collaborator hits;
API response time;
importer error class;
stored-media success/failure;
returned byte count or image-processing errors.

Avoid dangerous targets such as metadata services, admin APIs, unauthenticated databases, cloud credentials endpoints, or anything outside the rules of engagement.

Bypass variants worth testing¶

If the application claims to filter private destinations, validate the filter at the same layer that performs the fetch:

relative src resolved against attacker-controlled baseUrl;
redirects from an allowed host to a blocked range;
DNS rebinding or changed A/AAAA records between validation and fetch;
IPv6 loopback and IPv4-mapped IPv6 forms;
decimal, octal, hexadecimal, or shortened IPv4 notations;
mixed-case schemes and whitespace/control-character normalization;
protocol-relative URLs such as //host/path;
multiple images where only the first URL is validated;
thumbnail or metadata extraction paths that refetch after initial validation;
presentational HTML attributes that are converted into CSS without escaping.

Document which variants were tested and which were intentionally not attempted because they were out of scope.

Reporting heuristic¶

A strong report includes:

The exact role/permission required to reach the importer.
The endpoint and JSON/form fields that control imported HTML and base URL resolution.
Callback evidence proving server-side fetch.
Whether the response body is blind, semi-blind, or retrievable through stored media.
The lowest-risk internal reachability proof allowed by the scope.
The impact path: internal service discovery, image-compatible response exfiltration, or pivot into a more privileged server-side fetcher.
The regression cases: absolute image URL, relative image with attacker base URL, redirect, IPv6/IPv4 canonicalization, and multi-image validation parity.

Operator notes¶

Authenticated SSRF is still valuable when the permission is common, such as author/editor/contributor.
Widget validation and preview APIs are often less protected than publish paths because teams treat them as temporary drafts.
Image pipelines can hide response exfiltration behind normal media-library behavior. Always look for returned IDs and rewritten URLs, not just immediate response bodies.
The safest proof is a controlled image callback plus stored-media byte comparison. Escalate beyond that only when the engagement explicitly allows it.