Open Source Tool

Read RTF as Structure, Not Just Text

Sourcetrace RTF is powered by rtfstruct, a free open-source Python parser for converting Rich Text Format into a structured document AST.

It is designed for AI ingestion, archival processing, legal workflows, and legacy document conversion where plain-text extraction loses useful document meaning.

Powered by rtfstruct · Apache-2.0 · Free open source

What It Preserves

Document AST, Diagnostics, JSON, and Markdown Export

01

Paragraphs

02

Inline styles

03

Lists

04

Tables

05

Links

06

Fields

07

Footnotes and endnotes where supported

08

Annotations

09

Images

10

Metadata

11

Source spans where available

12

Recoverable diagnostics

13

JSON and Markdown export

Why It Exists

RTF Still Carries Business Structure

RTF still appears in legal, medical, government, publishing, and legacy business systems.

Treating those files as plain text removes structure before AI or search systems ever see the document.

Sourcetrace RTF exists to expose that structure in a machine-readable form.

Licence

Apache-2.0

Sourcetrace RTF is powered by rtfstruct, released under Apache-2.0.

Commercial use is permitted under the Apache-2.0 licence.

Relationship to Lumen & Lever

Maintained as Part of Document-Structure Control Work

Sourcetrace RTF is maintained as part of Lumen & Lever’s document-structure work.

It is free because its purpose is adoption, inspection, and technical credibility.

For organisations using document extraction inside AI systems, Lumen & Lever provides document-ingestion review and structural AI architecture work.

View Document Structure Review