Disable/Enable Pepperdata Data Collection for a Host (RPM/DEB)

Occasionally you might want Pepperdata to not collect data from a cluster host on which Pepperdata is installed. Or, you might want to re-enable data collection for a host where you previously disabled data collection. In such cases, you can disable or enable the host from Pepperdata data collection by configuring the host’s PD_COLLECT_AND_UPLOAD environment variable.

Disable Pepperdata Data Collection for a Host

A typical reason for disabling Pepperdata data collection for a host is when you want to install Pepperdata in a test environment that mimics your Pepperdata-installed production environment as closely as possible, but you do not want Pepperdata to collect data from the test hosts. Or, you might want to install Pepperdata to manage edge hosts that are not managed by YARN, and you want to omit those hosts from Pepperdata capacity planning calculations and consideration.

When you disable data collection for a host, the Collector (the pepcollectd agent) stops collecting the data from the other Pepperdata agents, and stops sending data to the Pepperdata dashboard. Charts, tables, and reports include all the data that was collected and sent to the dashboard before data collection was disabled, but contain no data for times during which data collection is disabled. The remaining Pepperdata agents, such as PepAgent, continue to run, collecting metrics and performing calculations for dynamic allocation.

Procedure

  1. On the host you want to disable from Pepperdata data collection, open the /etc/pepperdata/pepperdata-config.sh file for editing.

  2. Find the PD_COLLECT_AND_UPLOAD environment variable, and change its value from 1 to 0.

  3. Save your changes and close the file.

  4. Restart the Pepperdata Collector.

    You can use either the service (if provided by your OS) or systemctl command:

    • sudo service pepcollectd restart
    • sudo systemctl restart pepcollectd

    Data collection immediately stops, and no more data (including data that was already collected but that hasn’t yet been uploaded due to the two to five minute upload interval) appears in the Pepperdata dashboard. Data previously uploaded to the dashboard remains available for charts, tables, and reports.

Enable Pepperdata Data Collection for a Host

If data collection is disabled for a host, during installation or sometime later, you can manually re-enable it whenever you want. When you re-enable data collection for a host, data uploading resumes immediately. Depending on the configured disk space and data retention settings (see Configure Pepperdata Logs Retention and Disk Usage), up to seven days of previous data is also uploaded.

Procedure

  1. On the host for which you want to enable Pepperdata data collection, open the /etc/pepperdata/pepperdata-config.sh file for editing.

  2. Find the PD_COLLECT_AND_UPLOAD environment variable, and change its value from 0 to 1.

  3. Save your changes and close the file.

  4. Restart the Pepperdata Collector.

    You can use either the service (if provided by your OS) or systemctl command:

    • sudo service pepcollectd restart
    • sudo systemctl restart pepcollectd