Promet (Zagreb) (Apr 2024)

Analysis of Beijing Traffic Violations Based on the BERT-CRF Model

  • Jie Li,
  • Yuntao Shi,
  • Shuqin Li

DOI
https://doi.org/10.7307/ptt.v36i2.366
Journal volume & issue
Vol. 36, no. 2
pp. 279 – 293

Abstract

Read online

Traffic violations are a major cause of traffic accidents, yet current research falls short in comprehensively analysing these violations and the named entity method fails to extract the name of traffic violation events from records, thereby lacking in providing guidance for managing urban traffic violations. By expanding the People’s Daily dataset from 71,456 words to 95,291 words, the BERT-CRF (Bidirectional Encoder Representations from Transformers-Conditional Random Field) model achieves an accuracy rate of 88.53%, a recall rate of 92.90% and an F1 score of 90.66%, successfully identifying event, time and location named entities within traffic violations. The data of traffic violations is then enhanced through forward geocoding and the Bayesian formula, and traffic violations are analysed from time, space, administrative region, gender and weather, to provide support for the dynamic allocation of law enforcement forces on traffic scenes and the precise management of traffic violations.

Keywords