PDFix SDK is must to have SDK for any organization which process PDF documents. New 5.0.27 version is focused on accessibility, PDF/UA compliance and custom configurations. A new PDFix SDK Enterprise version brings the ability to customize layout detection, data extraction, and tagging.

New functions allow developers to programmatically create content structure (PdsStructTree, PdsStructElement) and also manipulate and edit existing structure elements, marked content, properties, and attributes. With manual adjustments, you will have full control over the process of creating PDF/UA compliant files.

The automated data extraction process is now fully customizable via the configuration file. In addition to the general set of thresholds, a developer can use special query language to dramatically improve the quality of data extraction. These two major features in combination can streamline your remediation workflows.

New Objects and Methods

  • Pdfix – RegisterPlugin, GetPluginByName, GetEvent
  • PdfixPlugin – GetPdfixVersionMajor, GetPdfixVersionMinor, GetPdfixVersionPatch
  • PsEvent – GetType, GetDoc, GetPage, GetAnnot
  • PdsStructTree – GetObject, RemoveKid, CreateStructElement, GetDoc, UpdateParentTree
  • PdsStructElement – SetAlt, AddAttrObj, RemoveAttrObj, RemoveKid, CreateStructElement, SetParent, AddPageObject, AddAnnot, GetStructTree, SetType
  • PdfPageView – Release
  • PdePageMap – Release, AcquireElements, CreateElement, AddTags
  • PdfPage – Release, GetLogicalRotate, CreatePageMap, GetNumPageObjects, GetPageObject, GetResources, GetObject, FlattenFormXObjects, GetContentFlags, SetContent, GetDoc
  • PdfDocTemplate – GetProperty, SetProperty, GetRegex, SetRegex
  • PdfDoc – RemoveTags, CreateDictObject, CreateArrayObject, CreateNameObject, CreateStringObject, CreateIntObject, CreateNumberObject, CreateStreamObject, GetObjectById, CreateStructTree, RemoveStructTree, RemoveBookmarks, CreateBookmarks, AddFontMissingUnicode
  • PdfAnnot – GetStructObject
  • PdfAction – GetDestPageNum
  • PdeText – GetTextFlags, GetLabelLevel, SetLabelLevel
  • PdeTextLine – GetTextLineFlags
  • PdeWord – GetWordFlags
  • PdeElement – SetBBox, SetRender, SetData, GetData, SetAlt, SetActualText, GetFlags, SetFlags, PdsContentMark, GetNumTags, GetTagName, GetTagObject, GetTagMcid, GetTagArtifact, AddTag, RemoveTag
  • PdsForm – GetNumPageObjects, GetPageObject
  • PdsText – GetTextState
  • PdsPageObject – GetId, SetRender, GetStructObject, GetContentMark, GetPage
  • PdsStream – IsEof, GetSize, Read, GetPos
  • PdsDictionary – Put, GetDictionary, GetArray, GetStream, GetString, GetText, GetNumber, GetInteger, GetBoolean
    PdsArray – Put, Insert, GetDictionary, GetArray, GetStream, GetString, GetText, GetNumber, GetInteger
  • PdsName – GetText
  • PdsString – GetText
  • PdsObject – GetId
  • OcrTesseract – SetDataPath, SetEngine
  • TesseractDoc – OcrImageToPage

Modified Objects and Methods

  • PdsStructElement – GetType, GetID
  • PdfDoc – GetRootObject, GetInfoObject

Removed Objects and Methods

  • Pdfix – SetRegex
  • PdsStructTree – ReleaseStructElement
  • PdfPage – ReleasePageMap, ReleasePageView, GetNumMcidPageObjects, GetMcidPageObject
  • PdfDoc – ReleasePage, RemoveStructTree
  • PdeTextLine – GetFlags
  • PdeWord – GetFlags
  • PdeElement – SetRenderMode
  • PdsReference – GetObjectNumber
  • OcrTesseract – SetData
  • TesseractDoc – Save

PDFix SDK Version 5.0.27 is available for Windows, Mac and Linux.

You can learn more and download trial from PDFix API here.

Related Articles

  1. Extract data from PDF with API from PDFix
  2. Responsive PDF, or how to convert PDF to HTML with PDFix
  3. Print2PDF version 8.0.08.1007 has been released
  4. Print2PDF version 8.0.10.0125 has been released
  5. Print2PDF version 8.0.08.1124 has been released

Tags: , , , , ,