Large Scale Estimation in Cyberphysical Systems using Streaming Data: a Case Study with Smartphone Traces

Abstract: 

Controlling and analyzing cyberphysical and robotics systems is increasingly becoming a Big Data challenge. Pushing this data to, and processing in the cloud is more efficient than on-board processing. However, current cloud-based solutions are not suitable for the latency requirements of these applications. We present a new concept, Discretized Streams or D-Streams, that enables massively scalable computations on streaming data with latencies as short as a second. We experiment with an implementation of D-Streams on top of the Spark computing framework. We demonstrate the usefulness of this concept with a novel algorithm to estimate vehicular traffic in urban networks. Our online EM algorithm can estimate traffic on a very large city network (the San Francisco Bay Area) by processing tens of thousands of observations per second, with a latency of a few seconds.

Author: 
Hunter, Timothy
Das, Tathagata
Zaharia, Matei
Abbeel, Pieter
Bayen, Alexandre M.
Publication date: 
December 14, 2012
Publication type: 
Preprint
Citation: 
Hunter, T., Das, T., Zaharia, M., Abbeel, P., & Bayen, A. M. (2012). Large Scale Estimation in Cyberphysical Systems using Streaming Data: A Case Study with Smartphone Traces (arXiv:1212.3393). arXiv. https://doi.org/10.48550/arXiv.1212.3393