Data Discovery

Data Discovery (also known as Business Intelligence, or BI) is the process of identifying data needed for building Machine Learning models.

Processes

Generalized Data Pipeline processes can be expressed as follows:

Key Factors

Key factors for successful data discovery include:

Data Sources

Data sources often considered include:

Objectives

Key objectives can include:

  • models: business goals, model accuracies

  • prediction objectives: prediction accuracy goals over time, individual prediction confidence level goals

References