Over the last few years, it started sounding like deduplication was becoming commoditized and turning into just a feature. FalconStor, Sepaton and Quantum announced massive clustered dedupe projects in 2005-7 bolted onto existing VTL products. They lined up OEMs or acquirers, including HP, Sun and EMC. The specs were unbelievable.
If it is too good to be true, sometimes there is a reason. A year or more later, Data Domain is scaling as promised, but the bolt-ons are struggling to meet expectations in robustness and economic impact. Dedupe is an effect, not a method, and the methods are clearly not a commodity, as can be seen in examples like this.
Bolt-on specs are sometimes a wish list. For example, Quantum's DXi 7500 maximum speed specs need a dual-controller system. But that configuration is apparently not available, more than a year since it was announced. What else is lurking that could throw claimed specs into a new light?
For example, in that same system (the basis of the EMC DL3D 3000)
- Backups are written to a non-dedupe disk cache to get higher speed, except in the narrow case of the "adaptive" or "dedupe on" modes, in which writes that are a third or less of their benchmark rates will dedupe inlines; and
- All cached data is kept after deduping as long as possible, so benchmark reads are generally from the pre-dedupe cache to get higher speed (ask about their truncation model).
So their benchmarks could all involve the cache, not the dedupe store. What's the real dedupe rate? Can you know when dedupe on the non-dedupe cache will complete? Can you trust the specs? Time will tell.
Here is the problem. If dedupe matters, you have to be able to finish backup and dedupe in a day. A post-process system will fill up much faster than expected and/or get controller-bound and slow if it gets too far behind..If you replicate, the DR site will always be behind. It would really be a bummer if you didn't expect this.
So here's an experiment you can try at home as a reality check. (These products still don't have much of a reference base that will look like your situation, and without that, a reasonable buyer should do a proof of concept lab test.) If you do a bolt-on test, do it with a pair of large dedupe products, replicating across a LAN to avoid latency issues.
- Backup 30 or 40 TBs as fast as you can (or whatever they say they can dedupe in a day.) Bolt-on system designs can be very curious under stress, so don't go small.
- Time the duration from start of backup to finish or dedupe replication. The end-to-end rate of this test will be a fair proxy for the internal dedupe rate as long as it replicates only deduped data. (Good luck estimating dedupe completion time with their GUIs - it is often left as a daily calculus project.)
- Now try restoring from the replica to see how fast restores are; the replica only has deduped data, so it won't fake you out by restoring from the non-dedupe cache.
Data Domain's systems are easy: with inline dedupe, the only throughput is dedupe throughput.