Massively Parallel Tree Embeddings for High Dimensional Spaces

dc.contributor.authorAhanchi, Amirmohsen
dc.contributor.authorAndoni, Alexandr
dc.contributor.authorHajiaghayi, MohammadTaghi
dc.contributor.authorKnittel, Marina
dc.contributor.authorZhong, Peilin
dc.date.accessioned2023-09-14T17:01:34Z
dc.date.available2023-09-14T17:01:34Z
dc.date.issued2023-06-17
dc.description.abstractEfficient computation on massive high-dimensional data greatly benefits from efficient embedding techniques into simpler metrics. Perhaps the most celebrated technique is the dimension reduction à-la Johnson and Lindenstrauss [46]. Another important method embeds the data into a tree metric space, first efficiently achieved by Bartal [14]. Both of these algorithmic tools are among the most general theorems with numerous applications. In this paper, we study these two embedding methods in the Massively Parallel Computation (MPC) model. We develop a new hybrid partitioning algorithm which generalizes both random shifted grid and ball partitioning methods for generating tree embeddings. This leads to an 𝑂(1)-round randomized MPC algorithm for embedding high-dimensional data into a tree while approximating the distance between any two points within a factor of 𝑂e(log1.5 𝑛) (and thus distortion 𝑂e(log1.5 𝑛)) in expectation as long as the aspect ratio is 𝑂(poly(𝑛)). This Euclidean result beats the lower bound of Ω(log𝑛) MPC rounds for tree embeddings of general metric spaces and can extend to a number of problems, including densest ball, minimum spanning tree, and Earth-Mover distance. Along the way, we implement and use Ailon and Chazelle’s Fast Johnson Lindenstrauss Transform [2] with sublinear memory and 𝑂(1) MPC rounds, which is of its own interest.
dc.description.urihttps://doi.org/10.1145/3558481.3591096
dc.identifierhttps://doi.org/10.13016/dspace/p4mc-0xuu
dc.identifier.citationAmirmohsen Ahanchi, Alexandr Andoni, MohammadTaghi Hajiaghayi, Marina Knittel, and Peilin Zhong. 2023. Massively Parallel Tree Embeddings for High Dimensional Spaces. In Proceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures (SPAA ’23), June 17–19, 2023, Orlando, FL, USA. ACM, New York, NY, USA, 12 pages.
dc.identifier.urihttp://hdl.handle.net/1903/30479
dc.language.isoen_US
dc.publisherAssociation for Computer Machinery (ACM)
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectMassively Parallel Computation
dc.subjectembeddings
dc.titleMassively Parallel Tree Embeddings for High Dimensional Spaces
dc.typeArticle
local.equitableAccessSubmissionNo

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Ahanchi, A et al.pdf
Size:
1.21 MB
Format:
Adobe Portable Document Format