Paper published: Complexity measurement of natural and artificial languages

April 02, 2014

We compared entropy for texts written in natural languages (English, Spanish) and artificial languages (computer software) based on a simple expression for the entropy as a function of message length and specific word diversity. Code text written in artificial languages showed higher entropy than text of similar length expressed in natural languages. Spanish texts exhibit more symbolic diversity than English ones. Results showed that algorithms based on complexity measures differentiate artificial from natural languages, and that text analysis based on complexity measures allows the unveiling of important aspects of their nature. We propose specific expressions to examine entropy related aspects of tests and estimate the values of entropy, emergence, self-organization, and complexity based on specific diversity and message length.

Complexity measurement of natural and artificial languages
Gerardo Febres, Klaus Jaffé and Carlos Gershenson
Complexity, Early View
http://dx.doi.org/10.1002/cplx.21529

Search This Blog

Complexes

Paper published: Complexity measurement of natural and artificial languages

Comments

Popular posts from this blog

Postdoctoral fellowships at UNAM

Call for Applications: Cátedra Germinal Cocho en Ciencias de la Complejidad (Senior posdoc)

Complex Systems Society Seminars