Issue #48 – It’s all French Belgian Fries to me… or The Art of Multilingual e-Disclosure (Part II)

01 Aug19

Issue #48 – It’s all French Belgian Fries to me… or The Art of Multilingual e-Disclosure (Part II)

Author: Jérôme Torres Lozano, Director of Professional Services, Inventus

This is the second of a two-part guest post from Jérôme Torres Lozano, the Director of Professional Services at Inventus, who shares his perspective on The Art of Multilingual e-Disclosure. In Part I,  we learned about the challenges of languages in e-disclosure.  In this post he will discuss language identification and translation options available in e-discovery. 

It’s all Greek to me!

Earlier options for language detection were crude and rudimentary, mostly based on alphabets or writing systems so it could detect and differentiate Arabic, Cyrillic, Chinese and Latin Scripts, for example, but could not provide any more granularity. Is it Farsi or Arabic, Russian or Ukrainian, Cantonese or Mandarin, Spanish or Italian…or is it all Greek?

Then it all got a bit clever, using dictionaries, but as always with languages, nothing is that straightforward. Little use is a dictionary when you need to differentiate the Spanish word fresco from the Portuguese, Italian, Dutch or English fresco (meaning fresh, insolent, wall painting…). Languages within the same family tree share many
To finish reading, please visit source site

Leave a Reply