PDF Tools

Overview

Contains multiple Smart Services and Functions for interacting with PDF Documents.

Key Features & Functionality

Smart Services:

  • Merge PDF - Merges multiple PDF documents into a single document.
  • Extract PDF Pages - Extracts a range of pages from an existing PDF into a new PDF.
  • Fill PDF - Populates the fields of a PDF Form and optionally flattens it disallow further changes.
  • Create PDF Content - Allows text to be added to a PDF with control over the style, position, and angle. An existing PDF can be updated or a new one created from scratch.
  • Convert PDF to Image - Creates an array of images or a multi-page tiff from a PDF.
  • Compress PDF - Compresses the images in the PDF to make it smaller.
  • Un-protect and Copy PDF - Using the document password, create an un-protected copy of a protected PDF.
  • Convert Image to PDF - Creates a PDF starting from one or many images. It also supports multi-page tiff images.
  • Encrypt PDF - Encrypts an existing PDF with a password.

Functions:

  • Get PDF Metadata - Retrieves metadata on the PDF: page count, title, author, security, encryption, etc.
  • Get PDF Text - Retrieves the text content from a PDF.
  • Get PDF Form Fields - Retrieves the populated form field values of an unflattened PDF.
  • Get PDF Signature Fields - Retrieves the populated signature field values of an unflattened PDF.
  • Get PDF Bookmarks - Retrieves the list of bookmarks and associated page number in the PDF

Anonymous
Parents
  • Hello,

    I was having issues parsing a pdf.

    The pdf contains data in a table format with some blank data cells. When using getpdftext(), the delimiter between 2 adjacent cells and between 2 cells with a missing cell value in between is the same (<space>). Because of this, I'm unable to map the cell data with the correct column header.


    Any help is much appreciated. Thanks!

Comment
  • Hello,

    I was having issues parsing a pdf.

    The pdf contains data in a table format with some blank data cells. When using getpdftext(), the delimiter between 2 adjacent cells and between 2 cells with a missing cell value in between is the same (<space>). Because of this, I'm unable to map the cell data with the correct column header.


    Any help is much appreciated. Thanks!

Children
No Data