5.6 TSM Client-side deduplication Overview

With this method, the deduplication of files occurs during client backup to a deduplication-enabled storage pool.
  • Less data travels over LAN because deduplication occurs during backup.
  • Client nodes share the work with the server, identifying duplicate data.
  • Requires processing to remove duplicate data that is no longer needed.
  • Does not backup all data to the server.
  • Became available in version6.2.0.

How to configure client-side deduplication ?

  • Primary storage pool must be a sequential-access disk (FILE) storage pool that is enabled for data deduplication.
  • Client-side data deduplication must be enabled on the server by using the DEDUPLICATION parameter on the REGISTER NODE or UPDATE NODE server command.
  • TheDEDUPLICATION option on the client must be set to yes in the client options file.
  • Filesthatareintendedfor deduplication cannot be excluded.
  • Filewithencryption enabled cannot be deduplicated.
Also Read: Difference between Server-side and Client-side Deduplication
Check the below video to learn to configure TSM Client-side deduplication 

Client-side deduplication process

Client-side deduplication uses the following process:
  • The client creates extents. Extents are parts of files that are compared with other file extents to identify duplicates.
  • The client and server work together to identify the extents that the client must send to the server for storage. Only those extents that are identified by the client that do not match extents that exist on the server are sent.
Also Read: What is Directory Container Storagepool ?
  • The client sends the extents and hash values to a deduplication- enabled storage pool that the server manages. A hash value is a cryptographic irreversible data value that the hashing algorithm generates. The hash value enables comparing two values without the values being disclosed. Tivoli Storage Manager uses hash values to verify data integrity.

Benefits of client-side deduplication

  • Deduplicates file and application data
  • Reduces network traffic by deduplicating data before transfer
  • Reduces storage pool space requirements
  • Faster backups
  • Optimized to reduce network communications when identifying duplicate data
  • New backup-archive client statistics report on deduplication and data reduction savings
  • Exclude specific files on a client from data deduplication with the exclude.dedup client option
  • Data deduplication cache can reduce network traffic

0 Comment to "5.6 TSM Client-side deduplication Overview"

Post a Comment