PLoS ONE (Jan 2014)

When a text is translated does the complexity of its vocabulary change? Translations and target readerships.

  • Hênio Henrique Aragão Rêgo,
  • Lidia A Braunstein,
  • Gregorio D'Agostino,
  • H Eugene Stanley,
  • Sasuke Miyazima

DOI
https://doi.org/10.1371/journal.pone.0110213
Journal volume & issue
Vol. 9, no. 10
p. e110213

Abstract

Read online

In linguistic studies, the academic level of the vocabulary in a text can be described in terms of statistical physics by using a "temperature" concept related to the text's word-frequency distribution. We propose a "comparative thermo-linguistic" technique to analyze the vocabulary of a text to determine its academic level and its target readership in any given language. We apply this technique to a large number of books by several authors and examine how the vocabulary of a text changes when it is translated from one language to another. Unlike the uniform results produced using the Zipf law, using our "word energy" distribution technique we find variations in the power-law behavior. We also examine some common features that span across languages and identify some intriguing questions concerning how to determine when a text is suitable for its intended readership.