Language Technology at UiT

The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages

View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no

Page Content

OCR service

The OCR service consists of a backend API, which is a third-party written Rust api, which enables sending images up to it, and choosing a Tesseract model to read the text out of the image.

We run Nasjonalbiblioteket’s models, as well as our own. The ones from NB generally produces more accurate readings.

Accompanying the backend API, there is also a simple frontend application, written in SvelteKit.

Sitemap

Howtos
Language technology in Tromsø
Oversyn over arbeidsoppgåver i Divvun og Giellatekno
Personvern og Github
Reworkingthedivvunweb
admin/ - Admin:
apps/ - Apps:
- Gïelese documentation
- gielese/ - Gielese:
borin/ - Samarbeid med Lars Borin:
events/ - List of earlier events:
infra/ - Infrastructure:
lang/ - Lang:
presentations/ - Presentations by GiellaLT people at UiT:
proof/ - General documentation: