Creating a User-Defined Subclient to Back Up Hadoop Data

Subclients contain information about what data is backed up. You can create a user-defined subclient to manage and back up specific data.

Before You Begin

You can use wildcards to define the subclient content. For more information, see Wildcards for the UNIX File System Agent.


  1. From the CommCell Browser, expand Client Computers > pseudo-client > Big Data Apps.
  2. Right-click the instance that you want to create subclient for, point to All Tasks, and then click New Subclient.

    The Subclient Properties dialog box appears.

  3. Specify the basic settings for the subclient:
    1. In the Subclient Name box, type a name.
    2. On the Data Access Nodes tab, select the data access nodes that you want to add to the subclient, and then click Add.
    3. On the Content tab, click Browse to select the directory or file that you want to back up, and then click Add.

      Repeat this step to include all the files and directories that you want to back up.

      Note: The default subclient does not back up the content that you specify in the user-subclients that are within the same instance.

  4. Click Advanced.

    The Advanced Subclient Properties dialog box appears.

  5. To configure multiple streams for backups, on the Performance tab, specify the number of data streams:
    1. In the Number of Data Readers box, enter the number of data streams.


      • For optimal sharing of the backup load, the number of data readers must be greater than the number of data access nodes.
      • The number of streams configured in the storage policy must be equal to or greater than the value entered in the Number of Data Readers box.
    2. Select the Allow multiple data readers within a drive or mount point check box
    3. Click OK.
  6. On the Storage Device tab, select a storage policy from the Storage Policy list.
  7. To create a new storage policy, click Create Storage Policy, and then follow the instructions in the storage policy creation wizard.
  8. To perform LAN-free backups and restores, select a grid storage policy.

    For more information, see GridStor® (Alternate Data Paths) - Overview.

  9. Optional: Select the subclient options.

    Configuring Backups for Recently Modified or Changed Data

    Retaining Additional Versions of a File During Synthetic Full Jobs

    Setting Up Pre-processes and Post-processes

    Setting Up Network Bandwidth Throttling for a Subclient

    Modifying Software Compression on a Subclient

    Viewing Data Paths

    Configuring Activity Control

    Configuring Data Encryption

  10. Click OK.

A subclient with the content that you want to back up is created under the instance that you selected.

Last modified: 5/18/2018 2:46:02 PM