Data is the main fuel of the modern world, enabling artificial intelligence and driving technological growth. The demand for data has grown substantially, and data products have become valuable assets to purchase and sell since it is extremely valuable for sectors to acquire high-quality data to discover knowledge. As a valuable resource, it is important to establish a principled method to quantify the worth of the data and its value for the data seekers. This is addressed via data valuation, which is the essential component for the realization of a fair data trading platform for owners and seekers.
Problem Statement
Consider the case when a Pharma Company would like to purchase data from a Hospital. The challenge is how the Pharma Company can value the worth of the data available at the Hospital without having access to it. In other words, the challenge is valuing invisible decentralized data unavailable locally. Furthermore, we consider data valuation without focusing on a specific task; that is, the Pharma Company would like to know the worth of the data available only at the Hospital without disclosing the task that they may want to purchase the data. This is called an intrinsic data valuation or a data-driven data valuation approach.