Dl/Best Practices: Difference between revisions
From stonehomewiki
Jump to navigationJump to search
Stonezhong (talk | contribs) |
Stonezhong (talk | contribs) |
||
| Line 17: | Line 17: | ||
<div class="mw-collapsible-preview">Keep good structure of your data</div> | <div class="mw-collapsible-preview">Keep good structure of your data</div> | ||
<div class="mw-collapsible-content"> | <div class="mw-collapsible-content"> | ||
raw, sometime unstructured data: | * raw, sometime unstructured data: | ||
- you stage the raw data (to be ingested) here, sometimes, these data can be unstructured. | - you stage the raw data (to be ingested) here, sometimes, these data can be unstructured. | ||
raw, ingested data | * raw, ingested data | ||
- they are structured, e.g. in parquet format. They captured all the information you interested from raw data. They may orgnized well -- the purpose is to capture all raw information with minimum processing. | - they are structured, e.g. in parquet format. They captured all the information you interested from raw data. They may orgnized well -- the purpose is to capture all raw information with minimum processing. | ||
logical data layer | * logical data layer | ||
- well modeled, maybe around a subject model. (a fact table with bunch of dimension tables) | - well modeled, maybe around a subject model. (a fact table with bunch of dimension tables) | ||
See also [https://lingarogroup.com/blog/data-lake-architecture Data Lake Architecture: How to Create a Well Designed Data Lake] | See also | ||
* [https://lingarogroup.com/blog/data-lake-architecture Data Lake Architecture: How to Create a Well Designed Data Lake] | |||
</div> | </div> | ||
</div> | </div> | ||
<p></p> | <p></p> | ||