Free online, PhD dissertation defended 10 December 2012, 321 pages.
The new H2020 European project “OpenMinTeD” will be the first to build an infrastructure that makes it much easier to mine, by making existing text mining tools discoverable, interoperable and usable. The infrastructure brings together text and data registries/databases and will develop a standard layer, that makes it possible to easily find text and data from all over the world, and mine it. . . . It brings together the different stakeholders, content providers and scientific communities, text mining and infrastructure builders, legal experts, data and computing centres, industrial players and SMEs.
Sahil Kini: The India Internet – in 780 Languages, 86 Scripts, 29 Major Languages, 22 of Them Official
India today speaks 780 languages represented in 86 different scripts. 29 of them are spoken by at least a million people, 22 are recognized by the Constitution as official languages. One of them, of course, is English. From being the language of colonial oppression, it has now risen to the status of the language of aspiration. Whenever I bring up the topic of the need for the promotion of India languages, I’m waved away by most who say that the rise of English speakers is inevitable. And perhaps it is.