A traffic accident dataset for Chattanooga, Tennessee
Andy Berres,
Pablo Moriano,
Haowen Xu,
Sarah Tennille,
Lee Smith,
Jonathan Storey,
Jibonananda Sanyal
Affiliations
Andy Berres
Corresponding author at: Energy Conversion and Storage Systems Center, National Renewable Energy Laboratory, 15301 Denver West Parkway, Mail Stop RSF 042, Golden, CO 80401, United States.; Energy Conversion and Storage Systems Center, National Renewable Energy Laboratory, 15301 Denver West Parkway, Mail Stop RSF 042, Golden, CO 80401, United States; Computational Sciences and Engineering Division, Oak Ridge National Laboratory, PO Box 2008, MS-6085, Oak Ridge, TN 37830, United States
Pablo Moriano
Computer Science and Mathematics Division, Oak Ridge National Laboratory, PO Box 2008, MS-6013, Oak Ridge, TN 37830, United States
Haowen Xu
Computational Sciences and Engineering Division, Oak Ridge National Laboratory, PO Box 2008, MS-6085, Oak Ridge, TN 37830, United States
Sarah Tennille
Computational Sciences and Engineering Division, Oak Ridge National Laboratory, PO Box 2008, MS-6085, Oak Ridge, TN 37830, United States
Lee Smith
Tennessee Department of Transportation, James K. Polk Bldg., Suite 700, 505 Deaderick Street, Nashville, TN 37243, United States
Jonathan Storey
Tennessee Department of Transportation, James K. Polk Bldg., Suite 700, 505 Deaderick Street, Nashville, TN 37243, United States
Jibonananda Sanyal
Energy Conversion and Storage Systems Center, National Renewable Energy Laboratory, 15301 Denver West Parkway, Mail Stop RSF 042, Golden, CO 80401, United States
This publication presents an annotated accident dataset which fuses traffic data from radar detection sensors, weather condition data, and light condition data with traffic accident data (as illustrated in Fig. 1) in a format that is easy to process using machine learning tools, databases, or data workflows. The purpose of this data is to analyze, predict, and detect traffic patterns when accidents occur. Each file contains a timeseries of traffic speeds, flows, and occupancies at the sensor nearest to the accident, as well as 5 neighboring sensors upstream and downstream. It also contains information about the accident type, date, and time. In addition to the accident data, we provide baseline data for typical traffic patterns during a given time of day. Overall, the dataset contains 6 months of annotated traffic data from November 2020 to April 2021. During this timeframe, and 361 accidents occurred in the monitored area around Chattanooga, Tennessee. This dataset served as the basis for a study on topology-aware automated accident detection for a companion publication [1].