HelpBot Assistant

How can I help you?

Welcome to Syncfusion Data Extraction Library

12 Mar 20261 minute to read

Syncfusion® Smart Data Extractor is a high performance, deterministic C# library that extracts complete document structures from PDFs and images.

List of Data Extraction Libraries

  • SmartDataExtractor - analyzes visual layout lines, boxes, labels, and alignment to locate and extract elements such as table structure, text elements, images, headers, footers, and form fields with per-field confidence scores for immediate review, export, or integration.
  • SmartTableExtractor - detects table regions, header rows, columns, and cell spans (merged cells) and provides per-cell confidence scores and structured exports ready for downstream processing.
  • SmartFormRecognizer - analyzes visual layout cues—lines, boxes and circles—it locates form regions and extracts common controls (text fields, checkboxes, radio buttons, signatures), producing clean JSON with per-field confidence scores. Where form fields are detected, the library can also produce a fillable PDF with corresponding form fields added for immediate use.