SciELO - Scientific Electronic Library Online

 
vol.36 número2The Quechua of Some and the Quechua of Others: Challenges of Learning the Indigenous Language in the City índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Não possue artigos similaresSimilares em SciELO
  • Em processo de indexaçãoSimilares em Google

Compartilhar


Forma y Función

versão impressa ISSN 0120-338X

Resumo

PEMBERTY TAMAYO, José Luis; MOLINA MEJIA, Jorge Mauricio  e  VALLEJO ZAPATA, Víctor Julián. UnderRL Tagger: A Grammar Tagger for Technologically Under-Supported and Minority Languages. Forma. func. [online]. 2023, vol.36, n.2, e1984.  Epub 08-Jun-2023. ISSN 0120-338X.  https://doi.org/10.15446/fyf.v36n2.101984.

This paper presents UnderRL Tagger, a freely available software program designed for morphosyntactic tagging (POS tagging) in languages that do not have automatic taggers. The program aims to facilitate working with corpora in these technologically under-supported languages and in minority languages, thus contributing to revitalization processes based on descriptive research and computational tools. UnderRL Tagger allows the manual tagging process to gradually become automatic thanks to a system that allows remembering and reusing tags, handling large amounts of text and generating output files in XML format with tags based on the standardized EAGLES system. This article shows the process of modeling and development of the system, its different functionalities and the prospects for further work.

Palavras-chave : morphosyntactic tagging; technologically under-supported languages; minority languages; text corpora; natural language processing.

        · resumo em Espanhol     · texto em Espanhol     · Espanhol ( pdf )