SciELO - Scientific Electronic Library Online

 
vol.42 issue2Assessment of the Compressive Strength of Lime Mortars with Admixtures Subjected to Two Curing EnvironmentsTypifying Students' Help-Seeking Behavior in an Intelligent Tutoring System for Mathematics author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Ingeniería e Investigación

Print version ISSN 0120-5609

Abstract

LOPEZ-PABON, Felipe O.  and  OROZCO-ARROYAVE, Juan R.. Automatic Personality Evaluation from Transliterations of YouTube Vlogs Using Classical and State-of-the-Art Word Embeddings. Ing. Investig. [online]. 2022, vol.42, n.2, e209.  Epub June 13, 2022. ISSN 0120-5609.  https://doi.org/10.15446/ing.investig.93803.

The study of automatic personality recognition has gained attention in the last decade thanks to a variety of applications deriving from this field. The Big Five model (also known as OCEAN) constitutes a well-known method to label different personality traits. This work considered transliterations of video recordings collected from YouTube (originally provided by the Idiap research institute) and automatically generated scores for the Big Five personality traits, which were also in the database. The transliterations were modeled with three different word embedding approaches (Word2Vec, GloVe, and BERT) and three different levels of analysis, namely a regression to predict the score of each personality trait, a binary classification between the strong vs. weak presence of each trait, and a tri-class classification according to three different levels of manifestations in each trait (low, medium, and high). According to our findings, the proposed approach provides similar results to others reported in the specialized literature. We believe that further research is required to find better results. Our results, as well as others reported in the literature, suggest that there is a big gap in the study of personality traits based on linguistic patterns, which highlights the need to work on collecting and labeling data considering the knowledge of expert psychologists and psycholinguists.

Keywords : personality; word embeddings; YouTube; regression; classification.

        · abstract in Spanish     · text in English     · English ( pdf )