Leveraging NLP to Analyze Regulatory Document Interconnections: A Systematic Review

Yudi Agusta; I Gusti Ayu Aprilia Santi; Ni Putu Putri Intan Maharani

doi:10.51519/journalisi.v6i3.861

Journal of Information Systems and Informatics (Sep 2024)

Leveraging NLP to Analyze Regulatory Document Interconnections: A Systematic Review

Yudi Agusta,
I Gusti Ayu Aprilia Santi,
Ni Putu Putri Intan Maharani

Affiliations

Yudi Agusta: Institute of Technology and Business STIKOM Bali
I Gusti Ayu Aprilia Santi: Institute of Technology and Business STIKOM Bali
Ni Putu Putri Intan Maharani: Institute of Technology and Business STIKOM Bali

DOI: https://doi.org/10.51519/journalisi.v6i3.861
Journal volume & issue: Vol. 6, no. 3
pp. 2130 – 2158

Abstract

Read online

A sustainable digital village requires an effective policy management mechanism to deliver relevant regulatory information to the community. Management information systems for regulations play a crucial role in achieving this. However, communities still face challenges in understanding and navigating the relationships between various regulations. To address this issue, this study conducts a systematic review of the components found in regulatory documents and the methods used to analyze them. The review identifies eight key components in regulatory documents: topic, structure, category, initiator, level, considerations, related regulations, and content. Natural Language Processing (NLP) techniques can be employed for data preprocessing, including tokenization, lowercasing, stop word removal, stemming, filtering, part-of-speech tagging, lemmatization, and chunking. For feature extraction, methods such as TF-IDF, bag-of-words, WordCount, N-grams, and word embeddings can be applied. To measure the interconnection between regulations, techniques like cosine similarity and K-Means clustering can be utilized. Experimental results demonstrate that combining different methods significantly influences the accuracy of identifying regulatory interconnections. The choice of methods whether simple or complex depends on the context, and confirmation through manual analysis is often required to ensure accuracy.

regulatory documents, natural language processing, text mining, systematic literature review

Published in Journal of Information Systems and Informatics

ISSN: 2656-5935 (Print); 2656-4882 (Online)
Publisher: Informatics Department, Faculty of Computer Science Bina Darma University
Country of publisher: Indonesia
LCC subjects: Science: Mathematics: Instruments and machines: Electronic computers. Computer science
Website: http://journal-isi.org/index.php/isi

About the journal

Abstract

Keywords