Integration of the Peruvian Citizen’s Public Information by Applying Web Scraping Under SCRUM Methodology

Hugo Vega-Huerta, Ronald Cardeña-Ccahuata, Percy De La Cruz Velez de Villa, Ernesto Cancho-Rodriguez, Gisella Luisa Elena Maquen-Niño

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

Resumen

The Digital Platform of the Peruvian State is mainly composed of seven websites. To obtain complete information about a citizen, information must be extracted from each website and integrated manually, which can take more than 3 min. The objective is to centralize the public information coming from the seven websites through a single web platform by applying web scraping. The methodology to implement the web scraping technique, the Selenium tool was used to simulate the information query process by a user entering an ID number, and the web platform was developed based on the Scrum methodology divided into three Sprints. As a result, users can visualize with a simple query the public information of a citizen stored and available on different websites, and the average time of information search of the citizen was reduced from 136 to 24 s. In conclusion, it can be affirmed that the use of web scraping can extract from different governmental websites the information of a citizen with a simple query in a fast and complete way.

Idioma originalInglés
Título de la publicación alojadaProceedings of the 9th Brazilian Technology Symposium (BTSym’23) - Emerging Trends and Challenges in Technology
EditoresYuzo Iano, Rangel Arthur, Osamu Saotome, Guillermo Leopoldo Kemper Vásquez, Maria Thereza de Moraes Gomes Rosa, Gabriel Gomes de Oliveira
EditorialSpringer Science and Business Media Deutschland GmbH
Páginas558-567
Número de páginas10
ISBN (versión impresa)9783031669606
DOI
EstadoPublicada - 2024
Evento9th Brazilian Technology Symposium on Emerging Trends and Challenges in Technology, BTSym 2023 - Campinas, Brasil
Duración: 24 oct. 202326 oct. 2023

Serie de la publicación

NombreSmart Innovation, Systems and Technologies
Volumen402 SIST
ISSN (versión impresa)2190-3018
ISSN (versión digital)2190-3026

Conferencia

Conferencia9th Brazilian Technology Symposium on Emerging Trends and Challenges in Technology, BTSym 2023
País/TerritorioBrasil
CiudadCampinas
Período24/10/2326/10/23

Nota bibliográfica

Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

Huella

Profundice en los temas de investigación de 'Integration of the Peruvian Citizen’s Public Information by Applying Web Scraping Under SCRUM Methodology'. En conjunto forman una huella única.

Citar esto