On the surface, the logic behind data lakes is compelling: Put the full volume and variety of your enterprise’s data in one place and you’ll be able to manage it better, protect it better and, ultimately, derive valuable analytic insights. But while data lakes may indeed be able to help enterprises manage the Data Variety problem, without proper attention to the curation of this data, many data lakes are bound to turn into expensive, unproductive Data Swamps. Also, in many (probably most) cases, a single data lake will not be a practical answer. So data curation across multiple lakes, ponds or other storage locations may well be the most important investment. We call this Data Unification.
Jerry Held, Tamr.com - October 2015















