Language Technology at UiT

The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages

View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no

Gramatikkontrollmøte 6.12.2017

Til stades: Duommá, Kevin, Linda, Sjur

Saker:

Sidan sist

Duommá

Problem:

sme $ echo “vuostái barge..“ | hfst-tokenise -g 'tools/grammarcheckers/tokeniser-gramcheck-gt-desc.pmhfst'  | vislcg3 --trace -g 'tools/grammarcheckers/../../src/syntax/valency.cg3' | vislcg3 --trace -g 'tools/grammarcheckers/../tokenisers/mwe-dis.cg3'  | cg-mwesplit |  divvun-blanktag 'tools/grammarcheckers/blanktagger.hfst'  | vislcg3 --trace -g 'tools/grammarcheckers/disambiguator.cg3' | vislcg3 --trace -g 'tools/grammarcheckers/grammarchecker.cg3'
"<“>"
"“" PUNCT LEFT <W:0.0000000000> &punct-aistton ADD:12802:punct-aistton ADD:12809:punct-aistton
"”" PUNCT <W:0.0000000000> RIGHT &SUGGEST ADD:12802:punct-aistton COPY:12804:punct-aistton
"<vuostái>"
"vuosti" N Sem/Wthr Sg Ill <W:0.0000000000> @ADVL> MAP:16947
"vuostá" N Sem/Food Sg Ill <W:0.0000000000> @ADVL> MAP:16947
"vuostái" Adv <W:0.0000000000> @ADVL> MAP:16947
* **"vuostái" Pr <W:0.0000000000> REMOVE:6141**: r1123
* **"vuostái" Po <W:0.0000000000> REMOVE:6230**: r1144
:
"<barge>"

Kevin

Problem:

LibreOffice-plugg - ulike måtar å integrera på:

Linda

Sjur

Fleire forslag på same ord, overlapp, sletting

implementert i divvun-suggest; webdemoen må sannsynlegvis(?) oppdaterast til å handtera slikt òg. For å bruka det, må forslags-readingen (&SUGGEST) ha den relevante feiltaggen på seg.

$ cat test/suggest/input.delete-and-suggest.cg
"<dego>"
        "dego" CS <W:0> @CNP ID:11
:
"<lávvomuorran>"
        "lávvomuorra" N Sem/Dummytag Ess <W:0> @COMP-CS< &syn-not-dego R:DELETE1:11
        "lávvomuorra" N Sem/Dummytag Sg Nom <W:0> @COMP-CS< &syn-dego-nom &SUGGEST
$ cat test/suggest/expected.delete-and-suggest.json |jq .
{
  "errs": [
    [
      "dego lávvomuorran",
      0,
      17,
      "syn-not-dego",
      "not dego (dette er ei leseleg forklaring)",
      [
        "lávvomuorran"
      ]
    ],
    [
      "dego lávvomuorran",
      0,
      17,
      "syn-dego-nom",
      "dego nom (dette er ei leseleg forklaring)",
      [
        "dego lávvomuorra"
      ]
    ]
  ],
  "text": "dego lávvomuorran"
}

Veldig bra! Akkurat det vi vil ha!

Samarbeid med Hanna O?

Utsett til etter Linda sin disputas.

Arbeid framover

Duommá

Kevin

Linda

Sjur