Dl/Best Practices: Difference between revisions
From stonehomewiki
Jump to navigationJump to search
Stonezhong (talk | contribs) |
Stonezhong (talk | contribs) |
||
| Line 8: | Line 8: | ||
* Your ingestion pipeline may have bugs, saving raw data allows you to fix bugs and re-populate the data | * Your ingestion pipeline may have bugs, saving raw data allows you to fix bugs and re-populate the data | ||
* Raw data may not meed the data quality and you may ignore it, in case you ignore it, keep the raw data allows you to check what kind of data quality problem they are, and sometimes you can inform the data producer to have it fixed. | * Raw data may not meed the data quality and you may ignore it, in case you ignore it, keep the raw data allows you to check what kind of data quality problem they are, and sometimes you can inform the data producer to have it fixed. | ||
* Raw data is owned by data source team and they have their own retention policy -- raw data is not always accessible afterwards. | |||
</div> | </div> | ||
</div> | </div> | ||
<p></p> | <p></p> | ||