SciELO - Scientific Electronic Library Online

 
vol.24 número52Support Vector Machines for Biomarkers Detection in in vitro and in vivo Experiments of Organochlorines ExposureImpact of Clean Architecture and ISO/IEC 25010 on the Maintainability of Android Applications índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Não possue artigos similaresSimilares em SciELO
  • Em processo de indexaçãoSimilares em Google

Compartilhar


TecnoLógicas

versão impressa ISSN 0123-7799versão On-line ISSN 2256-5337

Resumo

ESCOBAR-GRISALES, Daniel; VASQUEZ-CORREA, Juan Camilo  e  OROZCO-ARROYAVE, Juan Rafael. Author Profiling in Informal and Formal Language Scenarios Via Transfer Learning. TecnoL. [online]. 2021, vol.24, n.52, pp.212-225.  Epub 15-Fev-2022. ISSN 0123-7799.  https://doi.org/10.22430/22565337.2166.

The interest in author profiling tasks has increased in the research community because computer applications have shown success in different sectors such as security, marketing, healthcare, and others. Recognition and identification of traits such as gender, age or location based on text data can help to improve different marketing strategies. This type of technology has been widely discussed regarding documents taken from social media. However, its methods have been poorly studied using data with a more formal structure, where there is no access to emoticons, mentions, and other linguistic phenomena that are only present in social media. This paper proposes the use of recurrent and convolutional neural networks and a transfer learning strategy to recognize two demographic traits, i.e., gender and language variety, in documents written in informal and formal language. The models were tested in two different databases consisting of tweets (informal) and call-center conversations (formal). Accuracies of up to 75 % and 68 % were achieved in the recognition of gender in documents with informal and formal language, respectively. Moreover, regarding language variety recognition, accuracies of 92 % and 72 % were obtained in informal and formal text scenarios, respectively. The results indicate that, in relation to the traits considered in this paper, it is possible to transfer the knowledge from a system trained on a specific type of expressions to another one where the structure is completely different and data are scarcer.

Palavras-chave : Author profiling; Gender Recognition; Language variety recognition; Transfer learning; Natural language processing.

        · resumo em Espanhol     · texto em Inglês     · Inglês ( pdf )