About pamiri.online

Pamiri.online is a project dedicated to the study of Pamir languages. Our objectives encompass the development of digital language resources for studying these languages and their application in our research on grammar and phonetics. Currently, we offer online dictionaries for several Pamir languages, including Shughni, Khufi, Rushani, Bartangi, Wakhi, and Sarikoli. Additionally, our platform provides an online morphological analyzer and a corpus for the Shughni language. Please note that these resources are based on sources written in Russian, requiring some knowledge of the language or the use of online translators. Moreover, we host the research seminar on Iranian languages, where professional linguists and linguistics students present the results of their research in Iranian languages and related fields. 

Citing our project

Please cite this publication when referring to the project or any tool developed by us:

Yury Makarov, Maksim Melenchenko, and Dmitry Novokshanov. (2022). Digital Resources for the Shughni Language. Proceedings of The Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-Resourced Languages in Eurasia within the 13th Language Resources and Evaluation Conference, 61–64. https://aclanthology.org/2022.eurali-1.9

Acknowledgements

First versions of the online dictionary of Shughni were created as part of projects Computational and Linguistic Resources for the Shughni Language (2020–2021) and Computational and Corpus Instruments for Iranian Studies (2021–2022), which were supported by the Faculty of Humanities, HSE University. In 2023, we continued to digitize dictionaries of Pamir languages with the support of Linguistic Convergence Laboratory.

Thanks to Umed Kalandarov for helping to cover the costs of website hosting in 2022–2023.

We are grateful to the University of Central Asia for supporting our fieldwork in Khorugh, Tajikistan.

Tools we host

Online dictionary

The dictionary enables users to search through a collection of dictionaries featuring various Pamir languages, currently represented by Shughni, Rushani, and Khufi (with new languages to be added soon). This compilation includes digitized dictionaries (as mentioned above) and entries created by our project.

Morphological analyzer

The morphological analyzer splits words into morphemes and glosses them, that is, assigns labels for their grammatical and lexical meaning. This allows one to perform automatic grammatical analysis of large text corpora.

Corpus

A corpus is a collection of annotated texts that allows users to make search queries within grammatical features, morphemes, words and their combinations, translations, etc. In addition to written texts, there are transcribed oral stories aligned with audio files. The corpus operates on the Tsakorpus platform developed by Timofey Arkhangelsky.

Our team

Profile picture of
Artyom Badeev

Pamir languages Old Iranian languages typology field linguistics deixis demonstratives gender indefinite pronouns lexicography language contacts

Profile picture of
Yury Makarov

phonetics phonology digital lexicography

Profile picture of
Maks Melenchenko

evidentiality tense aspect modality narratives complex verbs coordination verbal morphology syntax writing system

Profile picture of
Ekaterina Rakhilina

semantics lexicology corpus linguistics cognitive linguistics construction grammar lexical typology history of the Russian language

Profile picture of
Alexander Sergienko

negation Pamir languages formal morphology ergativity