The Divvun and Giellatekno teams build language technology aimed at minority and indigenous languages
View GiellaLT on GitHub divvungiellatekno/giellalt.uit.no
These notes are not complete at the moment, but contains a number of things observed when helping a Cygwin user.
See the Cygwin home page, and follow the instructions there.
See the Java home page, and follow the instructions there.
The Cygwin environment is pretty bare-bones. This means that you need to install almost every tool and library using the Cygwin Setup.exe tool. Things you need to install:
To setup these tools, you use the Setup.exe tool that you used to install Cygwin. It will also install all dependencies required by each tool.
To install py-yaml, go to [http://pyyaml.org/wiki/PyYAML], download the tar.gz package, and follow the instructions on that page.
Download the latest saxon9he.jar
file from the Saxon home page, and store it in the directory lib/
in your Cygwin home directory (create the lib/
dir if it isn’t there already).
Lexc and Twolc are working with some quirks, but the build environment is now updated to work around those quirks.
Xfst is NOT working ATM, but the issues will be reported and hopefully fixed. The best alternative is to use HFST instead of Xerox (see below).
The latest HFST source code compiles on Cygwin after one local modification:
/usr/lib/gcc/i686-pc-cygwin/4.5.3/include/c++/tr1/functional_hash.h
in
an editor_TR1_hashtable_define_trivial_hash(unsigned long long);
//
at the beginning of the lineWhen done, follow these instructions:
wget http://sourceforge.net/projects/hfst/files/hfst/hfst-3.4.2.tar.gz
-
please check whether there is any newer version available by visiting
http://hfst.sf.net/
tar -xzf hfst-3.4.2.tar.gz
cd hfst-3.4.2
./configure --with-foma --enable-lexc --enable-windows
make
make install
Now you should have a working HFST installation!
As of version 3.4.2, all Hfst tools are working, except hfst-xfst
. This is a pretty serious limitation, and makes Hfst just as useful as the Xerox tools. But the bug has been reported, and will hopefully be fixed soon.
Assuming that the Xerox tools remain useless and that Hfst is soon fixed, you need to configure your languages to use Hfst and not Xerox. You do this as follows:
./configure --without-xfst --with-hfst
The reason is that the Xerox tools remain default for the time being, and hfst is only made available if explicitly asked for.