Cybergeo (Jan 2024)

Intercity relationships between 293 Chinese cities quantified based on toponym co-occurrence

  • Wang Tongjing,
  • Zhao Yin,
  • Ziyu Bao,
  • Evert Meijers

DOI
https://doi.org/10.4000/cybergeo.40721

Abstract

Read online

This dataset presents relationships between 293 Chinese cities, derived using a toponym co-occurrence method. By employing this toponym co-occurrence analysis method, the strength of an intercity relationship is determined by the frequency at which both city names appear on the same webpage. The data was sourced from the Common Crawl web archive's 2019 April Corpus, which contains approximately 2.5 billion web pages. The primary aim of this dataset is to provide a fresh perspective on intercity relationships, thereby facilitating studies on city network analysis. The dataset not only encourages further research into comparing this innovative city relationship with other established networks but is also a showcase that presents a straightforward methodology that can be applied to other archives within Common Crawl. As such, it paves the way for longitudinal studies that probe the evolution of city networks.

Keywords