About pamiri.online
Pamiri.online is a project dedicated to the study of Pamir languages. Our objectives encompass the development of digital language resources for studying these languages and their application in our research on grammar and phonetics. Currently, we offer online dictionaries for several Pamir languages, including Shughni, Khufi, Rushani, Bartangi, Wakhi, and Sarikoli. Additionally, our platform provides an online morphological analyzer and a corpus for the Shughni language. Please note that these resources are based on sources written in Russian, requiring some knowledge of the language or the use of online translators. Moreover, we host the research seminar on Iranian languages, where professional linguists and linguistics students present the results of their research in Iranian languages and related fields.
Citing our project
Please cite this publication when referring to the project or any tool developed by us:
Acknowledgements
First versions of the online dictionary of Shughni were created as part of projects Computational and Linguistic Resources for the Shughni Language (2020–2021) and Computational and Corpus Instruments for Iranian Studies (2021–2022), which were supported by the Faculty of Humanities, HSE University. In 2023, we continued to digitize dictionaries of Pamir languages with the support of Linguistic Convergence Laboratory.
Thanks to Umed Kalandarov for helping to cover the costs of website hosting in 2022–2023.
We are grateful to the University of Central Asia for supporting our fieldwork in Khorugh, Tajikistan.
Tools we host
Online dictionary
The dictionary enables users to search through a collection of dictionaries featuring various Pamir languages, currently represented by Shughni, Rushani, and Khufi (with new languages to be added soon). This compilation includes digitized dictionaries (as mentioned above) and entries created by our project.
Morphological analyzer
The morphological analyzer splits words into morphemes and glosses them, that is, assigns labels for their grammatical and lexical meaning. This allows one to perform automatic grammatical analysis of large text corpora.
Corpus
A corpus is a collection of annotated texts that allows users to make search queries within grammatical features, morphemes, words and their combinations, translations, etc. In addition to written texts, there are transcribed oral stories aligned with audio files. The corpus operates on the Tsakorpus platform developed by Timofey Arkhangelsky.
Our team




