Found another possibility for APACHE. Ties in to Strava heatmap.
https:/ /medium.com/strava-engineering/the-global-heatmap-now-6x-hotter-23fc01d301de
Excerpt:
The heatmap generation code has been fully rewritten using Apache Spark and Scala. The new code leverages new infrastructure enabling bulk activity stream access and is parallelized at every step from input to output. With these changes, we have fully conquered all scaling challenges. The full global heatmap was built across several hundred machines in just a few hours, with a total compute cost of only a few hundred dollars.