Skip to main content

Turn on Data Deduplication

Over many years as a Network Administrator, I constantly struggled with the data storage needs of my users. Not only did we need to allocate funds for greater amounts of storage, but also greater amounts of funds for backup and recovery operations that would meet the needs of the organization.

One of the big problems was duplicate information being stored by multiple users. Duplicate data adds to your cost in several ways.

· Increase in storage cost due to capacity depletion from duplicated data.

· Increased number of backup media, and the cost associated with storage, transportation, and replacement of the media.

· Increased recovery times.

· Purchasing and deployment of new disaster recovery hardware so backup and recovery operations can stay within established time frames.

Data Deduplication can help reduce the cost of the above bulleted points. Data deduplication will remove duplicated blocks of data and place references to a single copy stored on the volume. It works well for data stores that are not frequently changed and will not work on boot partitions or partitions containing the operating system. You can achieve reduced storage capacity for file shares, software deployment shares, and virtual hard disk libraries. Data deduplication is only support on the NTFS file system and not on the new Resilient File System (ReFS).

Here is how you turn it on.

First we need some duplicate files. (OK, not really, but I wanted to have some files on the drive.)

image

Here you can see that we have a couple of files that are store in different locations, but are duplicates of each other. They also reside on an NTFS formatted volume that does not contain the boot or OS partition.

Open Server Manager and click Manager –> Add Roles and Features.

Click Next three times.

Expand File and Storage Services –> File and iSCSI Services.

Check Data Deduplication and click Next.

clip_image002

Click Next and then Install.

Click Close.

You can monitor the installation in Server Manager.

clip_image003

No restart is necessary.

On the Server Manager click File and Storage Services.

clip_image004

Click Volumes

clip_image005

Right click the E: drive and select Configure Data Deduplication. (Note: it may take a few minutes before you can select Configure Data Deduplication.)

clip_image006

Check Enable data deduplication.

Click Set Deduplication Schedule

Check Enable throughput optimization. This will set the time when data deduplication will run with normal priority. This allows time for more processor capacity to be dedicated to the deduplication process.

Click OK twice.

Data Deduplication is now set up.

Comments

Popular posts from this blog

How to list all the AD LDS instances on a server

AD LDS allows you to provide directory services to applications that are free of the confines of Active Directory.  To list all the AD LDS instances on a server, follow this procedure: Log into the server in question Open a command prompt. Type dsdbutil and press Enter Type List Instances and press Enter . You will receive a list of the instance name, both the LDAP and SSL port numbers, the location of the database, and its status.

How to run GPResult on a remote client with PowerShell

In the past, to run the GPResult command, you would need to either physically visit this client, have the user do it, or use and RDP connection.  In all cases, this will disrupt the user.  First, you need PowerShell remoting enabled on the target machine.  You can do this via Group Policy . Open PowerShell and type this command. Invoke-Command –ScriptBlock {GPResult /r} –ComputerName <ComputerName> Replace <ComputerName> with the name of the target.  Remember, the target needs to be online and accessible to you.

Error icon when creating a GPO Preference drive map

You may not have an error at all.  Take a look at the drive mapping below. The red triangle is what threw us off.  It is not an error.  It is simply a color representation of the Replace option of the Action field in the properties of the drive mappings. Create action This give you a green triangle. The Create action creates a new mapped drive for users. Replace Action The Replace action gives you a red triangle.  This action will delete and recreate mapped drives for users. The net result of the Replace action is to overwrite all existing settings associated with the mapped drive. If the drive mapping does not exist, then the Replace action creates a new drive mapping. Update Action The Update action will have a yellow triangle. Update will modify settings of an existing mapped drive for users. This action differs from Replace in that it only updates settings defined within the preference item. All other settings remain as configured on the mapped drive. If the