論文誌 (国際) Development of people mass movement simulation framework based on reinforcement learning
Yanbo Pang (The University of Tokyo), Takehiro Kashiyama (The University of Tokyo), Takahiro Yabe (Purdue University), Kota Tsubouchi, Yoshihide Sekimoto (The University of Tokyo)
Transportation Research Part C (TRC journal)
Understanding individual and crowd dynamics in urban environments is critical for numerous ap- plications, such as urban planning, traffic forecasting, and location-based services. However, re- searchers have developed travel demand models to accomplish this task with survey data that are expensive and acquired at low frequencies. In contrast, emerging data collection methods have ena- bled researchers to leverage machine learning techniques with a tremendous amount of mobility data for analyzing and forecasting people’s behaviors. In this study, we developed a reinforcement learning-based approach for modeling and simulation of people mass movement using the global positioning system (GPS) data. Unlike traditional travel demand modeling approaches, our method focuses on the problem of inferring the spatio-temporal preferences of individuals from the ob- served trajectories, and is based on inverse reinforcement learning (IRL) techniques. We applied the model to the data collected from a smartphone application and attempted to replicate a large amount of the population’s daily movement by incorporating with agent-based multi-modal traffic simula- tion technologies. The simulation results indicate that agents can successfully learn and generate human-like travel activities. Furthermore, the proposed model performance significantly outper- forms the existing methods in synthetic urban dynamics.