Data engineering is the process of building analytic data infrastructure, or internal data products, that supports the collection, cleansing, storage, and processing (in batch or real time) of data for answering business questions (usually, by a data scientist, a statistician, or someone in similar role, but in some cases these functions overlap).

