REVEALING NON-ALPHABETICAL GUISES OF SPAM-TRIGGER VOCABLES

ROJAS-GALEANO, SERGIO A.

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

DYNA

Print version ISSN 0012-7353

Abstract

ROJAS-GALEANO, SERGIO A.. REVEALING NON-ALPHABETICAL GUISES OF SPAM-TRIGGER VOCABLES. Dyna rev.fac.nac.minas [online]. 2013, vol.80, n.182, pp.15-24. ISSN 0012-7353.

Unsolicited bulk email (spam) nowadays accounts for nearly 75% of daily email traffic, a figure that speaks strongly for the need of finding better protection mechanisms against its dissemination. A clever trick recently exploited by email spammers in order to circumvent textual-based filters, involves obfuscation of black-listed words with visually equivalent text substitutions from non-alphabetic symbols, in such a way it still conveys the semantics of the original word to the human eye (e.g. masking viagra as v1@gr@ or as v-i-a-g-r-a). In this paper we discuss how a simple-yet-effective adaptation of a classical algorithm for string matching may meet this stylish challenge to effectively reveal the similarity between genuine spam-trigger terms with their disguised alpha-numeric variants.

Keywords : Uncovering of spam vocables; approximate string matching algorithm.

· abstract in Spanish · text in Spanish · Spanish (

pdf )