How can I help you?
Welcome to Syncfusion Data Extraction Library
12 Mar 20261 minute to read
Syncfusion® Smart Data Extractor is a high performance, deterministic C# library that extracts complete document structures from PDFs and images.
List of Data Extraction Libraries
- SmartDataExtractor - analyzes visual layout lines, boxes, labels, and alignment to locate and extract elements such as table structure, text elements, images, headers, footers, and form fields with per-field confidence scores for immediate review, export, or integration.
- SmartTableExtractor - detects table regions, header rows, columns, and cell spans (merged cells) and provides per-cell confidence scores and structured exports ready for downstream processing.
- SmartFormRecognizer - analyzes visual layout cues—lines, boxes and circles—it locates form regions and extracts common controls (text fields, checkboxes, radio buttons, signatures), producing clean JSON with per-field confidence scores. Where form fields are detected, the library can also produce a fillable PDF with corresponding form fields added for immediate use.