Language Technology at UiT

The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages

View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no

Meeting setup

Agenda

  1. Opening, agenda review
  2. Infrastructure:

    documentation system status - still issues open?

    still UTF-8 issues?

    physical infrastructure - network for Thomas & Maaren?

    (still big problems in the Samediggi network, although things have improved)

  3. Corpus gathering
  4. Corpus infrastructure
  5. Linguistics
  6. Term db:

    grammar(s)

    db editing

  7. Other issues
  8. Summary, task lists
  9. Closing

1. Opening, agenda review

Started 10.05

2. Infrastructure:

3. Corpus gathering

More addresses gathered. Asked Anne Britt to send the letter now to the ones we have the address of. No response so far.

Finnish addresses still missing.

Thomas: called Anders Kintel, positive to share his work, but has legal questions. Thomas has sent Børre’s letter, will call him again.

4. Corpus infrastructure

Tomi has been reading Saara’s notes, and the CSC site, getting into the DTD now.

Will construct a tree of the document structure.

Tiger-XML: [http://www.ims.uni-stuttgart.de/projekte/TIGER/]

TEI: [http://www.tei-c.org/]

Schema types:

5. Linguistics

Working with Trond’s list and adjectives.

Normativity is still an open question in many cases. Orthographic status can (and should) be tagged in the disambiguator output. To be discussed.

6. Term db

7. Other issues

8. Summary, task lists

TODO:

9. Closing

Next meeting: Monday 14.2., 10 AM.

Closed at 11.30.