Our Data Deduplication solution (D³) is unlike the default deduplication tools. Where they focus on an absolute definition of identical data, D³ is able to virtually map all information and create connections between relevant data points. This way, we create an entire network of data elements. This is not unlike the map of a metro network. All stops are interlinked and accessible via multiple routes. You don’t have to look through all the stops to find your shortest route, just follow the path that is most convenient. This ensures a high performance whilst processing large datasets compared to traditional tools that have to scan record by record..
Because of this differentiating network-approach to data, the size of the dataset does not affect its performance. This is something that differentiates it from the classic SQL tools.
Our D³ solution is not an A.I. model. Therefore, there is no need to (re)train specific models, nor to provide sufficient training data. Configuration is done during the network setup, based on the initial parameters. This configuration can be modified when there are additional datasets available, or when a dataset has structurally changed.