Disable/Enable Pepperdata Data Collection for a Host (Parcel)

Occasionally you might want Pepperdata to not collect data from a cluster host on which Pepperdata is installed. Or, you might want to re-enable data collection for a host where you previously disabled data collection. In such cases, you can disable or enable the host from Pepperdata data collection by configuring the host’s PD_COLLECT_AND_UPLOAD environment variable.

Disable Pepperdata Data Collection for a Host

A typical reason for disabling Pepperdata data collection for a host is when you want to install Pepperdata in a test environment that mimics your Pepperdata-installed production environment as closely as possible, but you do not want Pepperdata to collect data from the test hosts. Or, you might want to install Pepperdata to manage edge hosts that are not managed by YARN, and you want to omit those hosts from Pepperdata capacity planning calculations and consideration.

When you disable data collection for a host, the Collector (the pepcollectd agent) stops collecting the data from the other Pepperdata agents, and stops sending data to the Pepperdata dashboard. Charts, tables, and reports include all the data that was collected and sent to the dashboard before data collection was disabled, but contain no data for times during which data collection is disabled. The remaining Pepperdata agents, such as PepAgent, continue to run, collecting metrics and performing calculations for dynamic allocation.

Procedure

  1. In Cloudera Manager, create a PepCollector Disabled Group role group in the Pepperdata configuration:

    • Group Name: PepCollector Disabled Group
    • Role Type: PepCollector
    • Copy From: PepCollector Default Group
  2. Move the hosts that you want to omit from data collection from the PepCollector Default Group to the PepCollector Disabled Group.

  3. Disable data collection for the host(s) in the PepCollector Disabled Group.

    1. Locate the Collect and upload metrics parameter.

    2. Click Edit Individual Values

    3. For the PepCollector Disabled Group, deselect the Collect and upload metrics parameter.

  4. Select the Restart action for the PepCollector service.

    Data collection immediately stops for the hosts in the PepCollector Disabled Group, and no more data (including data that was already collected but that hasn’t yet been uploaded due to the two to five minute upload interval) appears for them in the Pepperdata dashboard. Data previously uploaded to the dashboard remains available for charts, tables, and reports.

Enable Pepperdata Data Collection for a Host

If data collection is disabled for a host, during installation or sometime later, you can manually re-enable it whenever you want. When you re-enable data collection for a host, data uploading resumes immediately. Depending on the configured disk space and data retention settings (see Configure Pepperdata Logs Retention and Disk Usage), up to seven days of previous data is also uploaded.

Assumptions

This procedure assumes that you want to re-enable host(s) that are in a previously-created PepCollector Disabled Group role group. To enable hosts in a more granular fashion, create separate role groups and configure their Collect and upload metrics parameters as applicable.

Procedure

  1. In Cloudera Manager, move the hosts for which you again want to collect data from the PepCollector Disabled Group to the PepCollector Default Group

  2. Select the Restart action for the PepCollector service.