Set Up a Pepperdata Proxy (Parcel)

If your cluster hosts must be “air gapped” from the internet or otherwise isolated, you can use a proxy server on your network to enable Pepperdata functionality. Pepperdata is fully integrated with the standard https_proxy environment variable, which you can configure in the Pepperdata configuration file, /etc/pepperdata/pepperdata-config.sh.

Procedure

  1. Configure the environment variable for the proxy host and port, https_proxy.

    In Cloudera Manager, locate the Pepperdata > Configuration > PepCollector > PepCollector Environment Advanced Configuration Snippet (Safety Valve) template.

    Add the environment variable for the proxy host and port, in the following format. Be sure to replace the my_proxy_url and my_proxy_port placeholders with your actual proxy server name and port number.

    https_proxy=my_proxy_url:my_proxy_port
    
  2. Restart the Pepperdata services.

    In Cloudera Manager, select the Restart action for the PepCollector service.

  3. (Optional) Verify that your hosts can connect to the Pepperdata dashboard through the proxy server that you configured.

    1. Log in to a cluster node.

    2. Try to connect to the Pepperdata dashboard.

      Be sure to replace the <my_proxy_url> and <my_proxy_port> placeholders with the same proxy server name and port number that you configured in step 1.

      curl --proxy <my_proxy_url>:<my_proxy_port> --tlsv1.2 -v https://upload-main.pepperdata.com