Is Data Deduplication Important? Why?

 Among the greatest challenges to the data storage community is how to effectively store information without selecting the exact same data and storing over and over again in different locations on the same servers, hard drives, magnetic tape libraries and so forth. There have been numerous efforts to handle these redundancie a few more successful than others. There’s been an attitude in the information storage cmmuninty that as we experienced substantial cost reductions in the price of many information storage alternatives that data storage savings was an exercise whose time had passed. With the regulatory enviorment becming tighter, the mass of saved information once again begain to explode and increasingly options started to be considered to address information storage worries. The latest resolution proposed by the information storage camp is the technology called data deduplication. A.k.a. “single-instance storage” and “intelligent compression.” This advanced data storage process takes a piece of data and stores it one time. It then refers to this data as frequently as it’s called for by a pointer (or pointers) that substitutes the entire string of data. These pointers then refer back to the master string of data. This is particularly efficient when multiple instances of the same data are being archived. The archiving of just one instance of the information is needed. This trims back storage requirements and back-up times considerably.

Whenever a department wide e-mail attachment,(two megaytes in size) is circulated to fifty different e-mail accounts and each one must be archived, and so intead of saving the attachment fifty times, it’s saved one time with a savings of ninety-eight megabytes of storage space for this one attachment. Multiply this across numerous departments and thousands of e-mails over the course of a year and the savings can be rather significant. Recovery time objectives (RTO)improve significantly with the employment of Data Deduplication cutting down the

need for back-up magnetic tape libraries.This also lowers most storage space requirements actualizing substantial savings in all areas of hardware storage procurement

needs.

Operating at the block(sometimes byte)level provides for smaller pieces of data to saved, because the unique iterations of each block or bit that’s been modified are acknowledged and saved. Rather than having an entire file saved each time there’s a modification in a bit of information contained in that file, just the modified data is saved. Hash algorithms such as SHA-1 or MD5 are used to generate unique numbers for blocks of data that’s changed.Most efficient data deplication is applied in conjunction

with additional methods data reduction delta differencing and traditional compression are two such methods. This combination can greatly reduce any errors non-redundant sytems might incur.

 

 

 

 

Share and Enjoy:
  • Print
  • email
  • LinkedIn
  • Facebook
  • Google Bookmarks
  • Yahoo! Buzz
  • Digg
  • Live
  • del.icio.us
  • Technorati

Amazon.com

Share

One Response to “Is Data Deduplication Important? Why?”

  1. Ashley says:

    More people are turning up towards Deduplication Software plays main part in storage and eventually reduce the storage spaces as well as energy consumption.

Leave a Response