Spatio-textual search on Spark
Abstract
The amount of spatially aware data is growing at a rapid rate, and challenges both processing and organizing such data is in focus in the scientific world and the industry. But spatial data seldom exists alone, usually accompanied by some form of textual property. The challenges increase as we attempt to process the spatio-textual documents that are created, and the usage of Big Data platforms become a necessity. This paper provides an insight into different approaches on how to meet the spatial challenges on Big Data platforms, and provides a way to extend a solution to a spatio-textual index on top of Apache Spark. The approach is evaluated to show good results on very large datasets.