proem

Computer Science

5 years ago

Spark's Recursive Join Breakthrough Slashes Big Data Processing Time

1 view

Advances in intelligent systems and computing

Efficient Processing of Recursive Joins on Large-Scale Datasets in Spark

Thuong‐Cang Phan, Anh-Cang Phan, Thi-To-Quyen Tran, Thanh-Ngoan Trieu

Paper Summary

The goal of the research was to make processing recursive joins on big datasets in Spark faster and more efficient than in MapReduce. They found that Spark's in-memory computing and caching features could help solve the problems faced in recursive join operations that were complex and expensive in MapReduce. By cutting down on redundant data and using memory smartly, the researchers made significant performance improvements in handling recursive joins with large datasets. Their solution allows for quicker and more effective processing of these complex operations, making data analysis on a large scale smoother and more cost-effective.

Spark's Recursive Join Breakthrough Slashes Big Data Processing Time

Paper Summary

Spark's Recursive Join Breakthrough Slashes Big Data Processing Time

Paper Summary

Related papers

Related papers