Language Technology at UiT The Arctic University of Norway

The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages

View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no

Page Content

Speller meeting

What is needed for a new release:

Installer

There’s a separate document containing the test results for different combinations of MS Office and Windows versions.

DONE:

Z'KpbtLFg=q~u8wq9[\[]6fSpeller>o?4P8a4R~=zb`Eohjn9!

TODO:

POSTPONED:

PLX conversion

Nothing new since last meeting 14.12. Tomi has made a new speller, but the test results were identical to the previous version.

Remaining PLX bugs:

Second PRI

REGRESSIONS      
doesn’t follow cmp-tags OR vowel-shortening searvipresideanta > searvepresideanta 489  
doesn’t follow cmp-tags OR vowel-shortening sámediggepresideanta Sámediggeáirrasin 489 FIXED
alphabet as non-first compound part & suggested CV-s 913  

More regressions this week:      
noun+Pro Num+Noun/Prop wo hyph máliSoussiid, guovttiolbmo 397,461,642,721,804,805 FIXED
noun+Pro Num+Noun/Prop wo hyph uvdnaLot, muorraNRK 595,649,805  
doesn’t follow cmp-tags ránubiellu > rátnobiellu beavddeguorra > beavdeguorra 489,535,539,604 FIXED
prop+noun not rec Finnmárku-duoddara 611,633  
prop+noun not rec Koskivuori-plánenreaiddut 633 FIXED
non-ex word accepted loahpet, duvnnii, njealjat 909,962 FIXED
non-ex word accepted adnii 1143  
compound not recognized maŋŋegeašgálvu, lámpočuvggodeapmi 408,419,451,489,522,535,536,541  
double hyph-sugg SF–ákkasteapmifierbmi 536  
Px-forms make comp muorrastávrátgeavaheapmi, muorrastávrádegeavaheapmi 786  

BUT: Tomi has only worked with nouns, not adjectives. IF all the regressions involve adjectives, then the regressions are fine, since we then need to apply the noun changes also to the adjectives. BUT if the regressions also involve nouns, then it is a more serious problem.

TODO - ALT.1:

TODO - ALT.2:

Second Last PRI

Bugs from here on can be left out of the next release if we are short on time.

Compounds    
num cmp:s on 0- 051-nummarat 631
non-ex. word accepted saame 658

Last PRI

  | Capitals | — | doesn’t understand caps | 1700-LOHKU | 647


Compound regressions      
imposs” cmps along w num. 0-geažideapmigárvu (geažideapmigárvu is impossible) 536,1145 NO SUGGESTIONS - GOOD - BUT:
imposs” cmps sákkasteapmifierbmi > Fs-ákkasteapmifierbmi etc 536  

DONE:

TODO:

Release plan

The December 1 goal has passed, without meeting the targets. On the plus side is that the number of open PLX bugs have been greatly reduced, and Tomi are squashinhg PLX bugs all the time. It just takes more time than anticipated.

The installer was not easily solved by the WiX alternative - it turned out that the Greenlandic proofing tools installer has the same problems as we have.

Re-scheduling the release plan:


Latest speller:    
non-words accepted váigas, saame 581,658,912
prop-noun cmps doesnt work Oslo-biila, Pieski-lávvu 397,426,593,609,611,633,649,802,930
prop-prop cmps doesnt work Børde-Rene 575,634
prop-acr cmps doesnt work Seskarö-cd 805
noun-hyphnoun cmps doesnt work juleva-gielas 629
doesn’t follow cmp-tags OR vowel-shortening sámediggepresideanta Sámediggeáirrasin 489,535,604,639
Px-forms make comp muorrastávrátgeavaheapmi, muorrastávrádegeavaheapmi 786
alphabet as non-first compound part & suggested CV-s 913
num cmp:s on 0- 051-nummarat 631
imposs” cmps sákkasteapmifierbmi > ásaákkasteapmifierbmi 536