UralicNLP

From Openresearch
Revision as of 16:57, 6 November 2020 by Mikahama (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
UralicNLP
Field: Science
Type: API
github.com/mikahama/uralicNLP
Status: stable
Programming language: Python
Category:License: Apache-2.0 License
Maintainer: Mika Hämäläinen

UralicNLP is a natural language processing library targeted mainly for Uralic languages.

UralicNLP can produce morphological analyses, generate morphological forms, lemmatize words and give lexical information about words in Uralic and other languages. The languages we support include the following languages: Finnish, Russian, German, English, Norwegian, Swedish, Arabic, Ingrian, Meadow & Eastern Mari, Votic, Olonets-Karelian, Erzya, Moksha, Hill Mari, Udmurt, Tundra Nenets, Komi-Permyak, North Sami, South Sami and Skolt Sami. The information originates mainly in FST tools and dictionaries developed in the GiellaLT infrastructure. Currently, UralicNLP uses nightly builds for most of the supported languages.

Developers