Invisible East Documents included in latest OpenITI Release

We are delighted to announce that documentary materials from the Invisible East corpus are included in the latest version (2025.1.9) of the OpenITI corpus, available at Zenodo

This release includes documents written in Arabic, New Persian, Middle Persian, Bactrian and Judeo-Persian (Persian written in Hebrew script) amongst 9,106 unique books authored by 3,618 authors. The OpenITI corpus now consists of 14,107 text files. The complete collection—comprising books and manuscripts in multiple languages—contains a total of 2,348,893,857 words. For more information please see the blog post on the KITAB site.

screenshot 2026 02 12 at 15 40 39

 

Screenshot of document in OpenITI corpus.