Dl/Best Practices: Difference between revisions

From stonehomewiki
Jump to navigationJump to search
Line 17: Line 17:
<div class="mw-collapsible-preview">Keep good structure of your data</div>
<div class="mw-collapsible-preview">Keep good structure of your data</div>
<div class="mw-collapsible-content">
<div class="mw-collapsible-content">
raw, sometime unstructured data:
- you stage the raw data (to be ingested) here, sometimes, these data can be unstructured.
raw, ingested data
- they are structured, e.g. in parquet format. They captured all the information you interested from raw data. They may orgnized well -- the purpose is to capture all raw information with minimum processing.
logical data layer
- well modeled, maybe around a subject model. (a fact table with bunch of dimension tables)
See also [https://lingarogroup.com/blog/data-lake-architecture Data Lake Architecture: How to Create a Well Designed Data Lake]
</div>
</div>
</div>
</div>
<p></p>
<p></p>

Revision as of 09:20, 9 September 2024

Data Lake Knowledge Center

Data Ingestion

Data Governance