UralicNLP
UralicNLP | |
---|---|
Field: | Science |
Type: | API |
github.com/mikahama/uralicNLP | |
Status: | stable |
Programming language: | Python |
Category:License: | Apache-2.0 License |
Maintainer: | Mika Hämäläinen |
UralicNLP is a natural language processing library targeted mainly for Uralic languages.
UralicNLP can produce morphological analyses, generate morphological forms, lemmatize words and give lexical information about words in Uralic and other languages. The languages we support include the following languages: Finnish, Russian, German, English, Norwegian, Swedish, Arabic, Ingrian, Meadow & Eastern Mari, Votic, Olonets-Karelian, Erzya, Moksha, Hill Mari, Udmurt, Tundra Nenets, Komi-Permyak, North Sami, South Sami and Skolt Sami. The information originates mainly in FST tools and dictionaries developed in the GiellaLT infrastructure. Currently, UralicNLP uses nightly builds for most of the supported languages.