This is from last March, but I found it today, and the title alone is worth passing along this set of slides. "Natural language isn’t just English, and NLP work should stop pretending that it is. If you’re a consumer of NLP tech (e.g. for text as data research), demand better." See also: Wenyan, "an esoteric programming language that closely follows the grammar and tone of classical Chinese literature. Moreover, the alphabet of wenyan contains only traditional Chinese characters and 「」 quotes, so it is guaranteed to be readable by ancient Chinese people."