Description:
Fetches data from an HTTP or HTTPS URL and writes the data to the content of a FlowFile. Once the content has been fetched, the ETag and Last Modified dates are remembered (if the web server supports these concepts). This allows the Processor to fetch new data only if the remote data has changed or until the state is cleared. That is, once the content has been fetched from the given URL, it will not be fetched again until the content on the remote server changes. Note that due to limitations on state management, stored “last modified” and etag fields never expire. If the URL in GetHttp uses Expression Language that is unbounded, there is the potential for Out of Memory Errors to occur.
Tags:
get, fetch, poll, http, https, ingest, source, input
Properties:
In the list below, the names of required properties appear in bold. Any other properties (not in bold) are considered optional. The table also indicates any default values, whether a property supports the Expression Language Guide, and whether a property is considered “sensitive”, meaning that its value will be encrypted. Before entering a value in a sensitive property, ensure that the nifi.properties file has an entry for the property nifi.sensitive.props.key.
Name | Default Value | Allowable Values | Description |
URL |
The URL to pull from Supports Expression Language: true |
||
Filename |
The filename to assign to the file when pulled Supports Expression Language: true |
||
SSL Context Service |
Controller Service API: SSLContextService Implementation: StandardSSLContextService |
The Controller Service to use in order to obtain an SSL Context | |
Username | Username required to access the URL | ||
Password |
Password required to access the URL Sensitive Property: true |
||
Connection Timeout | 30 sec | How long to wait when attempting to connect to the remote server before giving up | |
Data Timeout | 30 sec | How long to wait between receiving segments of data from the remote server before giving up and discarding the partial file | |
User Agent | What to report as the User Agent when we connect to the remote server | ||
Accept Content-Type | If specified, requests will only accept the provided Content-Type | ||
Follow Redirects | false |
* true * false |
If we receive a 3xx HTTP Status Code from the server, indicates whether or not we should follow the redirect that the server specifies |
Proxy Host | The fully qualified hostname or IP address of the proxy server | ||
Proxy Port | The port of the proxy server |
Relationships:
Name | Description |
success | All files are transferred to the success relationship |
Reads Attributes:
None specified.
Writes Attributes:
Name | Description |
filename | The filename is set to the name of the file on the remote server |
mime.type | The MIME Type of the FlowFile, as reported by the HTTP Content-Type header |