Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Cited by Google
- Similars in SciELO
- Similars in Google
Share
Ingeniería y competitividad
Print version ISSN 0123-3033On-line version ISSN 2027-8284
Abstract
ACEVEDO-CASTIBLANCO, Jorge-Alexander; SUAREZ-BARON, Marco-Javier and GONZALEZ-SANABRIA, Juan-Sebastian. Categorization and Integration of Opinion Columns Content in Web Pages Applying Natural Language Processing Techniques. Ing. compet. [online]. 2023, vol.25, n.3, e-22313220. Epub Dec 30, 2023. ISSN 0123-3033. https://doi.org/10.25100/iyc.v25i3.13220.
The application of Natural Language Processing techniques for text analysis is presented, describing the process carried out from data extraction to the identification and detection of opinions automatically. The texts analyzed were opinion columns that reflect the criteria of people on current issues. The foregoing to provide an agile way to identify topics of interest in the community to provide those interested in a summary of what is expressed on these topics. For this purpose, an algorithm was implemented that allows information to be extracted accurately and cleanly from web pages and later another algorithm that oversees carrying out the automatic categorization of the extracted information, generating an accurate summary of the main topics in each writing.
Keywords : Text Classification; Opinion Columns; Natural Language Processing; Web Scrapping.