Inge-Cuc (Sep 2022)

Bot crawler to retrieve data from Facebook based on the selection of posts and the extraction of user profiles

  • Ariel Guillermo Sánchez Paipilla,
  • Mónica Katherine Durán Vaca,
  • Javier Antonio Ballesteros Ricaurte,
  • Angela María González Amarillo,
  • Pedro Nel López

DOI
https://doi.org/10.17981/ingecuc.18.2.2022.08
Journal volume & issue
Vol. 18, no. 2
pp. 101 – 113

Abstract

Read online

Introduction: Data can currently be found within organizations and outside of them, they are growing exponentially. Today, the information available on the Internet and social networks has become a generator of value, through the effective analysis of a specific situation, using techniques and methodologies with which content-based solutions can be proposed, and thus achieve, execute timely, intelligent and assertive decision-making processes. Objective: The main objective of this work is to development of a Bot Crawler, which allows extracting information from Facebook without access restrictions, or request for credentials, based on web crawling and scraping techniques, through the selection of HTML tags, to track and be able to define patterns. Method: The development of this project consisted of four main stages: A) Teamwork with SCRUM, B) Comparison of web data extraction techniques, C) Extraction and validation of permissions to access the data in Facebook, D) Development of the bor crawler. Results: Briefly, mention the main results of the research Conclusions: As a result of this process, a graphical interface is created that allows checking the process of obtaining data derived from user profiles of this social network.

Keywords