Direct access to embedded file metadata - PDF

In my project I need to reliably extract the embedded metadata from uploaded files: Page Count, CreationDate, Author, Title.

Currently, my “ExtractDataFromUploadedFile “" integration only lets AI infer these values from read content, which can lead to inconsistencies:

  • 15 pages detected instead of the actual 16, because of a blank page?

  • A creation date listed as jun 01, instead of the actual june 19th, because it the doc itself only says june 2025

Feature Request:
Please approve the open-source npm package pdf-lib:

Create and modify PDF documents in any JavaScript environment.

Designed to work in any modern JavaScript runtime. Tested in Node, Browser, Deno, and React Native environments.

Impact:

  • Higher accuracy for compliance-sensitive workflows

  • Prevents “hallucinated” metadata from AI, ensures consistency with document properties.

  • Increases trust in document processing pipelines.

Please authenticate to join the conversation.

Upvoters
Status

In Review

Board
💡

Feature Request

Date

6 months ago

Author

Remco Komen

Subscribe to post

Get notified by email when there are changes.