Dl/glossary: Difference between revisions

From stonehomewiki
Jump to navigationJump to search
No edit summary
No edit summary
 
(2 intermediate revisions by the same user not shown)
Line 11: Line 11:
</div>
</div>
</div>
</div>
<p></p>
<p></p>
<p></p>


Line 58: Line 57:
* <code>dataframe://stock.msft/1.4/1/2023-01-01</code>: represent a dataframe, it belongs to dataset, dataset name is stock.msft, major_version is 1.4, minor version is 1, and dataframe key is 2023-01-01.
* <code>dataframe://stock.msft/1.4/1/2023-01-01</code>: represent a dataframe, it belongs to dataset, dataset name is stock.msft, major_version is 1.4, minor version is 1, and dataframe key is 2023-01-01.


</div>
</div>
<p></p>
<div class="toccolours mw-collapsible mw-collapsed expandable">
<div class="mw-collapsible-preview">Data Application</div>
<div class="mw-collapsible-content">
* Data Application consumes 0 or more dataset as input, and generate dataframes and update dataset with new dataframes
** A data ingestion application is a data application that does not consume any dataset
** A data transformation application takes one or more dataset as input.
</div>
</div>
<p></p>
<div class="toccolours mw-collapsible mw-collapsed expandable">
<div class="mw-collapsible-preview">Data Pipeline</div>
<div class="mw-collapsible-content">
* A bunch of data application with dependency defined
* Runs on a regular basis (has schedule information)
</div>
</div>
</div>
</div>
<p></p>
<p></p>

Latest revision as of 00:52, 7 March 2023

Data Lake Knowledge Center