How to use Data Integration Platform in Production Server


System Requirement

The amount of hardware and memory needed will depend on the size and nature of the dataflow involved. High configuration machine with 16 GB RAM is capable to run most of the use cases with single Data Integration Platform installation.

The data is stored on disk while Data Integration is processing it. So, Data Integration needs to have sufficient disk space allocated for its various repositories. Particularly the content repository, flowfile repository, and provenance repository.

Hardware

  • Recommended Hardware requirements for

    local machine:


  • RAM required – 16 GB
  • Disk required – 150 GB
  • Recommended Hardware requirements for

    Azure VM:


  • RAM required – 14 GB
  • Disk required – 135 GB
  • Core – 2 cores (in built)

Software

  • Windows 7 or later
  • .NET Framework 4.5 or later


Installation

For Windows

  • For Downloading and Installing Data Integration Platform, refer below link

https://help.syncfusion.com/data-integration/getting-started#downloading-and-installing-data-integration

User and Policy Management


Defined Roles (Groups)

Instead of giving permissions to each individual user, admin can create separate groups with required policies. Then users can be added to group as needed.

Example

  • Consider three groups such as Administrators, Developersand Viewers with defined policies set by admin.

  • If any user need administrator privileges, admin can simply add that user to Administrator group.

To create users, groups, assigning global and component level policies, refer https://help.syncfusion.com/data-integration/security/user-and-policy-management

Accessibility


Restrict root process group access

  • For Production environment, you can restrict accessibility to the root process group for all users except admin users (Administrator group). Admin should have root process group access permission.

Create process groups for various teams

  • Create separate process groups for various teams like Management, Development and Monitoring. Set required policies for each process group. Therefore, admin will have the full rights to access all process group including root process group.

  • For example: users belong to Viewers group, can able to access only Monitoring process group.

Access Policies

Refer below tabular column for defined group and Global Policies

Groups

Access Policies

Administrator

– should able to read and modify all process groups
  • View the user interface
  • Access the Controller - Read, Write
  • Query provenance
  • Access restricted components
  • Access all policies – Read, Write
  • Access user/user group – Read, Write
  • Access configuration files – Read, Write
  • Retrieve site-to-site details
  • View system diagnostics
  • Proxy user requests
  • Access Counters - Read, Write

Developers

– should able to read and modify data only for Development process group. Also, they can have access to read data in Monitoring process group. Root and Management process groups write access should be restricted for this group
  • View the user interface
  • Access the Controller - Read, Write
  • Query provenance
  • Access configuration files – Read
  • View system diagnostics
  • Proxy user requests
  • Access Counters - Read, Write

Viewers –

should able to access only Monitoring process group
  • View the user interface
  • Access the Controller – Read
  • View system diagnostics
  • NOTE

    Policies will be inherited automatically to new process group from parent(root) process group while creating
    Refer below tabular column for defined groups and assign Component Policies accordingly

    Groups

    Process Groups

    Root

    Management

    Development

    Monitoring

    Administrators

    • View the component
    • </li>Modify the component
      </li>
    • View the data
    • Modify the data
    • View the policies
    • Modify the policies
    • View the component
    • Modify the component
    • View the data
    • Modify the data
    • View the policies
    • Modify the policies
    • View the component
    • Modify the component
    • View the data
    • Modify the data
    • View the policies
    • Modify the policies
    • View the component
    • Modify the component
    • View the data
    • Modify the data
    • View the policies
    • Modify the policies

    Developers

    • View the component
    • Modify the component
    • View the data
    • Modify the data
    No Access (Restricted)
    • View the component
    • Modify the component
    • View the data
    • Modify the data
    • View the component
    • View the data

    Viewers

    • View the component
    • View the data
    No Access (Restricted) No Access (Restricted)
    • View the component
    • Modify the component
    • View the data
    • Modify the data

    For more information, refer this link: https://help.syncfusion.com/data-integration/security/user-and-policy-management#policy-details

    Below example illustrate how it will be accessible when different users logged in to Data Integration Platform once groups and policies defined based on above steps.

    Login as Administrators

    Administrator users have permission to read and modify all process groups.



    Login as Developers

    Developers have permission to read and modify the Development process group and able to view data in Monitoring process group. Please refer below screenshots.

    Inside Development process group, Developers will have all permission to read and write.

    Inside Monitoring process group, Developers will have only read permission.



    Login as Viewers

    Viewers can access only Monitoring process group. Please refer the below screenshot.



    Workflow Development

    Workflow development process involves developing, testing, quality check and production deployment. Workflow should be shareable to other servers for testing and production as well. It is also important to maintain Version Control for the workflow in each stage of development cycle. To understand the complete development process, look at the below pictorial illustration.

    Step 1: Develop a workflow in developing environment. After development, create template and download in local drive. Store template in Version Control System.

    Step 2: Share the workflow to testing server (staging) to check the quality of the workflow. After quality check, create it as template and update the template in Version Control System.

    Sharing workflow can be done in 2 ways. It will be explained in detail in deployment section.

    Step 3: After quality check and testing, workflow can publish it into Production Server. In case of any further workflow changes, you can create and update template in Version Control repository. So that the committed template will be reusable and sharable to any other Data Integration Server as per requirement.

    Template

    Templates are XML file. Template can be created for the workflow in the UI and it is downloadable. They can be share and run on other Data Integration Server by uploading the templates.

    For detailed explanation about the templates,(see Working with Templates)


    Configure Data Integration Platform in production

    Production environment port number

    Production environment mode

    Port number

    Secure Data Integration Platform 443
    Non-secure Data Integration Platform 80

    For configuring Data Integration Platform in production, change the hostname and port number details in four components.

    Syncfusion Data Integration Platform

    1. Open the nifi.properties file from C:\Syncfusion\DataIntegration\{build-version}\SDK\NIFI\conf.
    2. Change the nifi.web.https.host and nifi.web.https.port properties to your required host name and port number. Refer to the following screenshot.

    Syncfusion Data Integration agent

    1. Open the DataIntegrationAgent.xml file in notepad from C:\Syncfusion\DataIntegration\{build-version} \Utilities\SyncfusionDataIntegrationAgent
    2. Change the required port number in the corresponding property as shown in the following screenshot.


    3. Click Windows button and press CTRL + R. Enter the services.msc in run command prompt. Search Syncfusion Data Integration Agent in service.msc dialog and, then right-click it and restart.

    Syncfusion Data Integration service manager

    1. Open the DataIntegrationManager.xml file from C:\Syncfusion\DataIntegration\{build-version}\Utilities\SyncfusionDataIntegrationManager
    2. Change the Hostname property and Data Integration Service port number as you require. Refer to the following screenshot.


    3. In system tray, right-click the Syncfusion Data Integration Platform Service Manager and click Quit option. Refer to the following screenshot.


    4. Launch the Data Integration Service Manager by double clicking its exe from C:\Syncfusion\DataIntegration\{build-version}\Utilities\SyncfusionDataIntegrationManager\Syncfusion Data Integration Manager.exe


    5. Restart the service using Syncfusion Data Integration Service Manager to reflect the changes in the Data Integration service.


      Syncfusion User Management Server
    6. In User Management Server home page, click view Applications.


    7. In application page, click Edit option for the added Data Integration application.


    8. Add the changed URL using Add Url option and click Update.


    9. Now, URL will be added in application details and it will be accessible.



    Monitoring

    Data Integration Platform provides various processors, reporting tasks and notification services to monitor your workflow. It is easy to send mail along with the workflow status and logs as needed. Also, it is possible to monitor the memory usage and getting alert when reaches the configured threshold through reporting tasks. Bulletin board has a support to monitor all error messages of running processors in single dialog. Session view allow you to monitor the complete status of a workflow and its status in easy manner.

    For more information for Data Integration Platform monitoring solution, you can refer our user guide documentation https://help.syncfusion.com/data-integration/how-to/monitor-data-integration-platform

    Deployment

    In Data Integration Platform, there are two ways to deploy the workflow into the Production Server.

    Publish Option

    • Right click selected data flow and click Publish.

    • Templates can also be published to production server. Go to templates tab, click option menu for specific template and click Publish option.

    • Enter valid production server details in Publish Template dialog and click Publish button.

    • New process group with template will be published to Production Server.

    Download and Upload Template

    • Navigate to the required process group in which you need to create template.

    • Navigate to Template tab and click Create button.

    • In Create Template dialog, enter Template name and description if required.

    • You can find created template in left side template panel. Click Download in dropdown option to download it.

    • Once template downloaded, you need to move it to production server and then upload it using Upload option.

    • Once Upload operation is completed, then uploaded template will be listed in template list.

    • For version control, downloaded template can be saved in any storage repositories such as SVN, GitHub, GitLab, etc.

    Fine Tuning

    To improve overall performance of Data Integration Platform, you can increase Heap and PermGen memory configuration manually based on recommendation provided in user guide documentation

    https://help.syncfusion.com/data-integration/admin-guide#fine-tuning-memory