Home
DRIVE AI Consortium
- A pre-competitive applied research consortium uniting industry, government, and academia to advance connected, electrified, and autonomous systems
  
  Overview
  
  DRIVE AI Events
  
  Request a Campus Tour and Faculty Intro
  
  DRIVE AI Brochure
About Us
- ITS hosts a number of faculty members from nine UC Berkeley academic departments and schools and approximately 150 researchers and students are associated with ITS through our various research and educational activities.
  
  Overview
  
  Welcome
  
  ITS Leadership
  
  Directory
  
  Faculty and Lead Researchers
  
  Academic Partners
  
  ITS Senior Fellows
  
  History
  
  ITS Brand Toolkit
  
  ITS Publications
  
  Jobs
Research
- ITS Berkeley hosts many research opportunities and research centers
  
  Overview
  
  Research Centers
  
  Transportation Data and Statistics
  
  Berkeley Request for Proposals
  
  ITS Center Pages
  
  ITS Library
  
  Student-Researcher Repository
News & Events
- Learn more about the research and people at ITS Berkeley through our news and events.
  
  Overview
  
  Our Stories
  
  Signature and special Events
  
  ITS in the News
  
  Events & Seminars
  
  Past Seminars
  
  ITS Berkeley Newsletter
Publications
- Our faculty, staff, and students are well published in a variety of journals, publications and books.
  
  Overview
  
  Publications
  
  Books by ITS Faculty
Students
- Our students are an integral part of the Institute through our research and activities.
  
  Overview
  
  Student-Researcher Repository
  
  Academic Affilations
  
  Connect
  
  TRANSOC
  
  Degree Programs
  
  Commencement
Alumni
- Our alumni are a valued resource at ITS Berkeley, and we like to stay connected with them as they continue their career.
  
  Overview
  
  Alumni in Academia
  
  Alumni in the Private Sector
  
  Alumni in the Public Sector
  
  Donate
  
  Connect
ITS Library
- Overview
  
  Research Resources

Secondary navigation

sparkmobility: A Spark-based Python Library for Processing, Modeling, and Analyzing Large Mobility Datasets

Abstract:

Location-Based Service (LBS) data, collected from personal mobile devices, have enabled significant advances in understanding human mobility patterns over the past decade. Extracting insights from these datasets typically involves using complex data-mining algorithms to detect, filter, and cluster stay locations. However, LBS datasets are often massive—ranging from tens to hundreds of gigabytes per day—posing serious computational challenges for traditional data processing tools. Libraries such as Pandas operate in a single-machine environment and require the entire dataset to fit into memory, making them unsuitable for processing LBS data at scale [3]. sparkmobility allows students and researchers to process large LBS dataset with improved memory management.

Author:

Cao, Shangqing

Publication date:

December 12, 2025

Publication type:

Conference Paper

Citation:

Cao, S., & Gonzalez, M. C. (2025). sparkmobility: A Spark-based Python Library for Processing, Modeling, and Analyzing Large Mobility Datasets. Proceedings of the 33rd ACM International Conference on Advances in Geographic Information Systems, 1296–1297. https://doi.org/10.1145/3748636.3766538

Document

https://dl.acm.org/doi/10.1145/3748636.3766538

Topics

ITS Berkeley topic page, Data topic page, Travel Behavior topic page