Description:

Fetches data from an HTTP or HTTPS URL and writes the data to the content of a FlowFile. Once the content has been fetched, the ETag and Last Modified dates are remembered (if the web server supports these concepts). This allows the Processor to fetch new data only if the remote data has changed or until the state is cleared. That is, once the content has been fetched from the given URL, it will not be fetched again until the content on the remote server changes. Note that due to limitations on state management, stored “last modified” and etag fields never expire. If the URL in GetHttp uses Expression Language that is unbounded, there is the potential for Out of Memory Errors to occur.

Tags:

get, fetch, poll, http, https, ingest, source, input

Properties:

In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, whether a property supports the Expression Language Guide, and whether a property is considered “sensitive”, meaning that its value will be encrypted. Before entering a value in a sensitive property, ensure that the nifi.properties file has an entry for the property nifi.sensitive.props.key.

Name Default Value Allowable Values Description
URL The URL to pull from
Supports Expression Language: true
Filename The filename to assign to the file when pulled
Supports Expression Language: true
SSL Context Service Controller Service API:
SSLContextService
Implementation:
StandardSSLContextService
The Controller Service to use in order to obtain an SSL Context
Username Username required to access the URL
Password Password required to access the URL
Sensitive Property: true
Connection Timeout 30 sec How long to wait when attempting to connect to the remote server before giving up
Data Timeout 30 sec How long to wait between receiving segments of data from the remote server before giving up and discarding the partial file
User Agent What to report as the User Agent when we connect to the remote server
Accept Content-Type If specified, requests will only accept the provided Content-Type
Follow Redirects false * true
* false
If we receive a 3xx HTTP Status Code from the server, indicates whether or not we should follow the redirect that the server specifies
Proxy Host The fully qualified hostname or IP address of the proxy server
Proxy Port The port of the proxy server

Relationships:

Name Description
success All files are transferred to the success relationship

Reads Attributes:

None specified.

Writes Attributes:

Name Description
filename The filename is set to the name of the file on the remote server
mime.type The MIME Type of the FlowFile, as reported by the HTTP Content-Type header