proem

Decision Sciences

5 years ago

Faster, More Efficient Data Matching Transforms Data Quality Management

International Journal of Computational Science and Engineering

Incremental processing for string similarity join

Yanglan Gan, Guangwei Xu, Cairong Yan, Bin Zhu

Paper Summary

Related papers

Theme background image

Computer Science

Faster database joins could revolutionize data-driven industries and transform business operations.

Data Stream Management

Text Compression

Trajectory Analysis

Economics, Econometrics and Finance

3D Urban Planning Breakthrough: Buildings Cluster in Unexpected Ways, Transforming City Landscapes

Spatial Econometrics

Ecosystem Services

Theme background image

Computer Science

Faster, more efficient data queries to revolutionize how we access information online.

Content-Centric Networking

Data Stream Management

The article introduces a new way to compare strings called incremental processing for string similarity join. This method helps manage data quality and find valuable information in data. By using this approach, the researchers were able to reduce the time and space needed for comparing similar strings, especially in data streams. They tested two algorithms, Inc-Join and Inp-Join, and found that Inp-Join performed better when dealing with large amounts of data by taking advantage of parallel processing. This means that the new framework can speed up the process of finding similar strings without sacrificing accuracy.