Don’t Panic
This page is for people with access to the Tromsø servers!
Get an overview on the status monitor for Giellatekno and Divvun services running on our servers The overview is refreshed every 5 minutes.
For the divvun.no, gtdict.uit.no, gtoahpa.uit.no, gtsvn.uit.no, gtweb.uit.no, satni.org servers at uit.no:
- Relevant persons are Børre Gaup, Anders Lorentsen, Sjur Moshagen, Trond Trosterud.
- Ultimately, IT, Seksjon for applikasjoner og tjenester (see
$GTPRIV/admin/emergency.txt)
The servers are down
If you cannot ssh to a server, contact ITA immediately.
The general route is to ssh into the server, and restart the service that is
not working.
Restart procedures for each server
- Server: api-giellalt.uit.no
- Domain on server: api-giellalt.uit.no
- TODO: Write documentation on this server
- Server: divvun.uit.no
- Domains on server: divvun.uit.no, divvun.no, divvun.org, indigenous-langtech.uit.no, giellalt.uit.no, giellatekno.uit.no, dicts.uit.no
- Restart instructions for divvun.no (the speller server)
- Server: gtdict.uit.no
- Domains on server: *.oahpa.no (all NDS versions)
- Restart instructions for gtdict.uit.no (NDS)
- Server: gtoahpa.uit.no
- Domain on server: oahpa.no, kursa.oahpa.no, kuvsje.oahpa.no
- Restart instructions for gtoahpa.uit.no (oahpa.no, Konteaksta, kursa and kuvsje)
- Server: gtsvn.uit.no
- Domain on server: pahkat.uit.no
- Restart instructions for gtsvn.uit.no (svn)
- Server: gtweb.uit.no
- Domain on server: jorgal.uit.no, gtweb.uit.no
- Services on server: MT, korp, cgi-bin
- Restart instructions for gtweb-01 (interactive web programs, MT, webdicts)
- Server: gtweb-02.uit.no
- Server: satni.uit.no
- Domains on server: satni.org, sátni.org, bahko.org, báhko.org, baakoe.org
- Restart instructions for satni.org
Specific processes
nginx is the web server on most servers. It is very robust, but in the event
that it is down, run sudo systemctl restart nginx to restart it.
- instructions for Gïelese (TODO: Update documentation)
- Look for commands with restart…
Test that everything is ok
Go back to the interactive monitor, refresh, and check whether the report has turned greener. (Remember: The monitor tests every 5 minutes).
A server is very slow
Log in to the machine via ssh and find out which processes are using most CPU resource by giving the command:
htop
The top part of the screen shows CPU usage per core, as well as memory usage. Sort by CPU or Memory in the process list below, by typing shift-p or shift-m, respectively. Processes that seem to use a lot of either CPU or Memory should be restarted.
The output looks something like
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
15221 neahtta 20 0 512m 303m 2848 R 19.2 3.9 2:24.15 python
9405 neahtta 20 0 512m 303m 2848 S 13.9 3.9 1:28.60 python
6239 mysql 20 0 896m 185m 5500 S 0.3 2.4 19:10.63 mysqld
.......
Observe the column %CPU. If some of the processes remains in the top of the table with a high value in this column then this is obviously the cause of the problem. To get more information about the problematic process give the command
ps -fp *PID*
where you replace PID by the concrete number, e.g. 15221 in the given example.
If it is an Oahpa (python2.7) process then you can see from this output which Oahpa it is. First, you can try to restart the particular Oahpa instance. After doing that check with command top if the problem has disappeared. If not then you can stop the process using the command
kill *PID*
(requires sudo rights)
Analysers do not work
The analysers are installed via the package manager (apt for our Ubuntu servers), from
apertium nightly, and is located in /usr/share/giella/XXX (where XXX is replaced by
the 3-letter language code. Any additional FSTs not stemming from apertium nightly is
also placed in these directories.
If an FST is missing, you may have to install the FSTs for that language, if it hasn’t
been installed yet. Try apt-get install giella-XXX, or install them manually if they
are not available in apertium nightly (which they should be, unless it’s a brand new
language).
If FSTs are not being updated, there is a build error in that language’s source code.
Check the build logs for that language, at https://apertium.projectjj.com/apt/logs/,
for example, for the sme logs: https://apertium.projectjj.com/apt/logs/giella-sme/build.log. Consult the relevant people for that language to resolve the issue. Notice that it
may take some time before the build system will build the language, and the server pulls
the new version.
The svn application is down, yet the gtsvn server is up-n-running (test it by logging into the server)
- error messages
sme$ svnup Updating '/Users/xxxxxx/gtsvn': svn: E000060: Unable to connect to a repository at URL 'https://gtsvn.uit.no/langtech/trunk' svn: E000060: Error running context: Operation timed out -
log into the gtsvn server
- find the processes running
[root@gtsvn ~]# lsof -i | grep http httpd 26024 apache 4u IPv6 15925 0t0 TCP *:http (LISTEN) httpd 26024 apache 6u IPv6 15929 0t0 TCP *:8099 (LISTEN) httpd 26024 apache 8u IPv6 15935 0t0 TCP *:https (LISTEN) - kill them
[root@gtsvn ~]# kill -KILL 26024 - start the service anew
ot@gtsvn ~]# service httpd start Starting httpd: [ OK ]