2 views
Nacionalniy korpus krimskotatarskoyi movi
National Corpus of the Crimean Tatar Language (NKKM) span> is a one-of-a-kindelectronic array of Crimean Tatar texts covering different genres and historical eras. NKKMshould be considered as atool for comprehensive linguistic research, as well as the creation and development of a wide variety of automated systems (recognition language, machine translation, information search). The creation and development of the linguistic corpus is a particularly important factor in the development of the Crimean Tatar language, which is currently classified by UNESCO as a language under serious threat of extinction.
The purpose of this task is >develop a web service that will be a separate page on the siteCorpus, which will transliterate from Crimean Tatar Cyrillic to Latin using the current transliteration code at< span>https://github.com/qirimca/Transliterator.
. The web service will also provide the possibility ofreverse transliteration from Latin to Cyrillic. The web service will be compatible with the latest versions of browsers and will support Windows, Mac and Linux operating systems.Functional requirements
The web service will bea separate page on the website > ://ctcorpus.org /index.php/crh/ for transliteration from Crimean Tatar Cyrillic to Latin.
The web service will use the existing code to transliterate to < span>https://github.com/qirimca/Transliterator span> as the main logic for converting between Cyrillic and Latin.
The web service must allow users to download a transliterated document in format.txt or .docx, or even better if, in addition, conversion to .pptx, .xlsx is possible.
The web service will provide a user interface that will allow users to select text written in Crimean Tatar Cyrillic and click a button to translate it into Latin (and vice versa). The converted text will display the new text in a separate field. The web service will also provide a button for reverse transliteration from Latin to Cyrillic if the user wants to undo the conversion.
The web servicewill preserve the formatting of the original text.
If possible, the web servicewill determine the language of the selected text< span> and will transliterate only if the text is written in the Crimean Tatar language. If the text is written in another language, the web service will display a message that the tool cannot transliterate the text.
Non-functional requirements
A web service should be designed with a flexible architecture to accommodate future improvements and updates.
The web service must comply with all relevant data privacy and security regulations.
Additional requirements
The web service will supportmass transliteration span> documents.
The web service will allow users tocheck requirements and foreign words in the text.
The web service will be open source and published in our GitHub repository.
A web service should include a feedback mechanism for users to report about problems or suggest improvements.