Difference between dataset datalake and datawarehouse

Difference between Database, Datalake and Datawarehouse.

Database support an application. Any interactive application needs database to store streaming data. Thus database is good at real-time and streaming data ingest.

Datalake stores raw, untouched data from multiple applications. Data lake can support structured, semi-structured and unstructured data. Data lake is not designed for handling real-time data ingest. The frequency of data lake update depends on the frequency of data ETL (Extract, Transform, Load). Data lake provides data for data mining and machine learning.

Datawarehouse is designed for business intelligence. It stores predefined, processed data and only supports structured data and semi-structured data. Datawarehouse provides data for business report and business dashboard.