Language Technology at UiT

The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages

View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no

Se also seperate pages on [compound CompoundTags.html], [syntax docu-sme-syntaxtags.html] and dependency tags.

Guidelines for semantic tagging of proper nouns

The tags used are the following:

If +Sem/Plc, then +Sem/Sur is added in the affixfile, except for Saami placenames

Other combination tags (objplc, femsur, malfem, objplc, orgsur, malplc, surmal, femplc) are lexical.

Semantic tagging of other lemmas

Format

Note the tag format when the same entry has more than one tag:

Sem/Tagname1
Sem/Tagname1_Tagname2

All combinations shall be listed in the file root.lexc. In the cg3-files there will be sets:

Sem/Tagname1 = Sem/Tagname1 Sem/Tagname1_Tagname2
Sem/Tagname2 = Sem/Tagname1_Tagname2

Documentation on the tags forthcoming.