|
To build data products, you require varity of capabilities. Your goal is to take raw data and convert into meaningful higher level signals. To do this you may be sourcing raw data, integrating with outher sources, may use web scrparing, apply ML and statistical model for preiction etc. You can also use generative AI to generate new data. Here are list of capabilities need to build ne dataset.
Core capability
Data ingestion
Data Transformation
Data integration
Statistics to understand data
Data science and ML
Web scraping
Geneerative AI
Once you build new dataset, there are additional capabilities require for
Lineage
Governance
Quality
Meta data
On top of these you also want feedback on data/ recommendation of data
Endorsement
Certificate
When an eneterprise build data product, it also pay attention to
Cost spend on producing data
Benefit from data
|