Filter and SQL service are stopping after some time

t.zilz

New Member
Jan 12, 2007
6
0
1
Hello,

i installed Proxmox 1.4 some months before and everything went fine. Yesterday i decided to upgrade to version 1.6. The upgrade process from 1.4 to 1.5 and from 1.5 to 1.6 was successful without any errors. But some hours later i recognized that a lot of mails was hanging in the mail queue. A look to the services shows me that the sql and filter service have been stopped. After restarting the services and flushing the mail queue the server worked well for some hours, but then again the two services was stopped. After that i downloaded the 1.6 cd image and installed and configured the server from the scratch (i configured the server manually, so that a corrupt backup configuration could not be the reason). This morning the two services has stopped again. I activated razor network but i'm not sure if the port is open on the firewall - could that be the reason for that?

Greetings, Thomas
 
Hello,

i installed Proxmox 1.4 some months before and everything went fine. Yesterday i decided to upgrade to version 1.6. The upgrade process from 1.4 to 1.5 and from 1.5 to 1.6 was successful without any errors. But some hours later i recognized that a lot of mails was hanging in the mail queue. A look to the services shows me that the sql and filter service have been stopped. After restarting the services and flushing the mail queue the server worked well for some hours, but then again the two services was stopped. After that i downloaded the 1.6 cd image and installed and configured the server from the scratch (i configured the server manually, so that a corrupt backup configuration could not be the reason). This morning the two services has stopped again. I activated razor network but i'm not sure if the port is open on the firewall - could that be the reason for that?

Greetings, Thomas

Hi Thomas,
maybe a hardware (disk) issue. the ports cannot be the reason. if you like, we can check your system via remote login (SSH).
For details send an email to support@proxmox.com.
 
Hi Thomas,
maybe a hardware (disk) issue. the ports cannot be the reason. if you like, we can check your system via remote login (SSH).
For details send an email to support@proxmox.com.

I installed the new proxmox 1.6 on a different system with hardware raid 5 and spare - array logs seems to be just fine.

There are some entries i found in the postgres.log that maybe could help:

This one have been generated at the beginning of the log (i guess directly after proxmox installation):

2007-01-11 17:18:51 [2977] LOG: database system was shut down at 2007-01-11 17:07:15 CET
2007-01-11 17:18:51 [2977] LOG: checkpoint record is at 0/9B0B8C
2007-01-11 17:18:51 [2977] LOG: redo record is at 0/9B0B8C; undo record is at 0/0; shutdown TRUE
2007-01-11 17:18:51 [2977] LOG: next transaction ID: 536; next OID: 17142
2007-01-11 17:18:51 [2977] LOG: database system is ready
2007-01-11 17:18:56 [2984] LOG: connection received: host=[local] port=
2007-01-11 17:18:56 [2984] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:18:57 [2998] LOG: connection received: host=[local] port=
2007-01-11 17:18:57 [2998] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:18:59 [3005] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3005] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:18:59 [3005] FATAL: database "Proxmox_ruledb" does not exist
2007-01-11 17:18:59 [3006] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3006] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:18:59 [3006] FATAL: database "Proxmox_ruledb" does not exist
2007-01-11 17:18:59 [3014] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3014] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:19:01 [3016] LOG: connection received: host=[local] port=
2007-01-11 17:19:01 [3016] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "attribut_pkey" for table "attribut"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "object_id_seq" for "serial" column "object.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "object_pkey" for table "object"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "objectgroup_id_seq" for "serial" column "objectgroup.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "objectgroup_pkey" for table "objectgroup"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "rule_id_seq" for "serial" column "rule.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "rule_pkey" for table "rule"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "rulegroup_pkey" for table "rulegroup"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cgreylist_pkey" for table "cgreylist"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "clusterinfo_pkey" for table "clusterinfo"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "dailystat_pkey" for table "dailystat"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "domainstat_pkey" for table "domainstat"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "statinfo_pkey" for table "statinfo"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "cmailstore_id_seq" for "serial" column "cmailstore.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cmailstore_pkey" for table "cmailstore"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / UNIQUE will create implicit index "cmailstore_id_key" for table "cmailstore"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "cstatistic_id_seq" for "serial" column "cstatistic.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cstatistic_pkey" for table "cstatistic"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / UNIQUE will create implicit index "cstatistic_id_key" for table "cstatistic"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "userprefs_pkey" for table "userprefs"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "virusinfo_pkey" for table "virusinfo"

a second one created after i had to cold restart the system because changing the IP-Adress freezes the Server:


2007-01-11 18:36:34 [3006] LOG: database system was interrupted at 2007-01-11 18:14:15 CET
2007-01-11 18:36:34 [3006] LOG: checkpoint record is at 0/A8DB6C
2007-01-11 18:36:34 [3006] LOG: redo record is at 0/A8DB6C; undo record is at 0/0; shutdown FALSE
2007-01-11 18:36:34 [3006] LOG: next transaction ID: 4521; next OID: 25334
2007-01-11 18:36:34 [3006] LOG: database system was not properly shut down; automatic recovery in progress
2007-01-11 18:36:34 [3006] LOG: record with zero length at 0/A8DBAC
2007-01-11 18:36:34 [3006] LOG: redo is not required
2007-01-11 18:36:37 [3006] LOG: database system is ready
2007-01-11 18:36:39 [3013] LOG: connection received: host=[local] port=
2007-01-11 18:36:39 [3013] LOG: connection authorized: user=postgres database=template1
2007-01-11 18:36:40 [3027] LOG: connection received: host=[local] port=
2007-01-11 18:36:40 [3027] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 18:36:40 [3028] LOG: connection received: host=[local] port=
2007-01-11 18:36:40 [3028] LOG: connection authorized: user=postgres database=template1
2007-01-11 18:36:43 [3035] LOG: connection received: host=[local] port=
2007-01-11 18:36:43 [3035] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 18:36:43 [3035] ERROR: language "plpgsql" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "userprefs" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "dailystat" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "domainstat" already exists

and a last one generated when the service crashed i think:


2007-01-12 00:30:17 [6538] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-12 00:31:53 [3000] LOG: statistics collector process (PID 5741) was terminated by signal 9
2007-01-12 00:31:53 [3000] LOG: server process (PID 6492) was terminated by signal 9
2007-01-12 00:31:53 [3000] LOG: terminating any other active server processes
2007-01-12 00:31:53 [6496] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:53 [6495] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:54 [6494] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:56 [3000] LOG: all server processes terminated; reinitializing
2007-01-12 00:31:56 [6542] LOG: database system was interrupted at 2007-01-11 20:54:57 CET
2007-01-12 00:31:56 [6542] LOG: checkpoint record is at 0/FFFBB4
2007-01-12 00:31:56 [6542] LOG: redo record is at 0/FFFBB4; undo record is at 0/0; shutdown TRUE
2007-01-12 00:31:56 [6542] LOG: next transaction ID: 8781; next OID: 33526
2007-01-12 00:31:56 [6542] LOG: database system was not properly shut down; automatic recovery in progress
2007-01-12 00:31:56 [6542] LOG: record with zero length at 0/FFFBF4
2007-01-12 00:31:56 [6542] LOG: redo is not required
2007-01-12 00:31:59 [6542] LOG: database system is ready
2007-01-12 00:32:18 [6543] LOG: connection received: host=[local] port=

Could the database be corrupted?

 
I installed the new proxmox 1.6 on a different system with hardware raid 5 and spare - array logs seems to be just fine.

There are some entries i found in the postgres.log that maybe could help:

This one have been generated at the beginning of the log (i guess directly after proxmox installation):

2007-01-11 17:18:51 [2977] LOG: database system was shut down at 2007-01-11 17:07:15 CET
2007-01-11 17:18:51 [2977] LOG: checkpoint record is at 0/9B0B8C
2007-01-11 17:18:51 [2977] LOG: redo record is at 0/9B0B8C; undo record is at 0/0; shutdown TRUE
2007-01-11 17:18:51 [2977] LOG: next transaction ID: 536; next OID: 17142
2007-01-11 17:18:51 [2977] LOG: database system is ready
2007-01-11 17:18:56 [2984] LOG: connection received: host=[local] port=
2007-01-11 17:18:56 [2984] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:18:57 [2998] LOG: connection received: host=[local] port=
2007-01-11 17:18:57 [2998] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:18:59 [3005] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3005] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:18:59 [3005] FATAL: database "Proxmox_ruledb" does not exist
2007-01-11 17:18:59 [3006] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3006] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:18:59 [3006] FATAL: database "Proxmox_ruledb" does not exist
2007-01-11 17:18:59 [3014] LOG: connection received: host=[local] port=
2007-01-11 17:18:59 [3014] LOG: connection authorized: user=postgres database=template1
2007-01-11 17:19:01 [3016] LOG: connection received: host=[local] port=
2007-01-11 17:19:01 [3016] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "attribut_pkey" for table "attribut"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "object_id_seq" for "serial" column "object.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "object_pkey" for table "object"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "objectgroup_id_seq" for "serial" column "objectgroup.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "objectgroup_pkey" for table "objectgroup"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "rule_id_seq" for "serial" column "rule.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "rule_pkey" for table "rule"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "rulegroup_pkey" for table "rulegroup"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cgreylist_pkey" for table "cgreylist"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "clusterinfo_pkey" for table "clusterinfo"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "dailystat_pkey" for table "dailystat"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "domainstat_pkey" for table "domainstat"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "statinfo_pkey" for table "statinfo"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "cmailstore_id_seq" for "serial" column "cmailstore.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cmailstore_pkey" for table "cmailstore"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / UNIQUE will create implicit index "cmailstore_id_key" for table "cmailstore"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE will create implicit sequence "cstatistic_id_seq" for "serial" column "cstatistic.id"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "cstatistic_pkey" for table "cstatistic"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / UNIQUE will create implicit index "cstatistic_id_key" for table "cstatistic"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "userprefs_pkey" for table "userprefs"
2007-01-11 17:19:01 [3016] NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "virusinfo_pkey" for table "virusinfo"

a second one created after i had to cold restart the system because changing the IP-Adress freezes the Server:

2007-01-11 18:36:34 [3006] LOG: database system was interrupted at 2007-01-11 18:14:15 CET
2007-01-11 18:36:34 [3006] LOG: checkpoint record is at 0/A8DB6C
2007-01-11 18:36:34 [3006] LOG: redo record is at 0/A8DB6C; undo record is at 0/0; shutdown FALSE
2007-01-11 18:36:34 [3006] LOG: next transaction ID: 4521; next OID: 25334
2007-01-11 18:36:34 [3006] LOG: database system was not properly shut down; automatic recovery in progress
2007-01-11 18:36:34 [3006] LOG: record with zero length at 0/A8DBAC
2007-01-11 18:36:34 [3006] LOG: redo is not required
2007-01-11 18:36:37 [3006] LOG: database system is ready
2007-01-11 18:36:39 [3013] LOG: connection received: host=[local] port=
2007-01-11 18:36:39 [3013] LOG: connection authorized: user=postgres database=template1
2007-01-11 18:36:40 [3027] LOG: connection received: host=[local] port=
2007-01-11 18:36:40 [3027] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 18:36:40 [3028] LOG: connection received: host=[local] port=
2007-01-11 18:36:40 [3028] LOG: connection authorized: user=postgres database=template1
2007-01-11 18:36:43 [3035] LOG: connection received: host=[local] port=
2007-01-11 18:36:43 [3035] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-11 18:36:43 [3035] ERROR: language "plpgsql" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "userprefs" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "dailystat" already exists
2007-01-11 18:36:43 [3035] ERROR: column "mtime" of relation "domainstat" already exists

and a last one generated when the service crashed i think:

2007-01-12 00:30:17 [6538] LOG: connection authorized: user=postgres database=Proxmox_ruledb
2007-01-12 00:31:53 [3000] LOG: statistics collector process (PID 5741) was terminated by signal 9
2007-01-12 00:31:53 [3000] LOG: server process (PID 6492) was terminated by signal 9
2007-01-12 00:31:53 [3000] LOG: terminating any other active server processes
2007-01-12 00:31:53 [6496] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:53 [6495] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:54 [6494] WARNING: terminating connection because of crash of another server process
DETAIL: The postmaster has commanded this server process to roll back the current transaction and exit, because another server process exited abnormally and possibly corrupted shared memory.
HINT: In a moment you should be able to reconnect to the database and repeat your command.
2007-01-12 00:31:56 [3000] LOG: all server processes terminated; reinitializing
2007-01-12 00:31:56 [6542] LOG: database system was interrupted at 2007-01-11 20:54:57 CET
2007-01-12 00:31:56 [6542] LOG: checkpoint record is at 0/FFFBB4
2007-01-12 00:31:56 [6542] LOG: redo record is at 0/FFFBB4; undo record is at 0/0; shutdown TRUE
2007-01-12 00:31:56 [6542] LOG: next transaction ID: 8781; next OID: 33526
2007-01-12 00:31:56 [6542] LOG: database system was not properly shut down; automatic recovery in progress
2007-01-12 00:31:56 [6542] LOG: record with zero length at 0/FFFBF4
2007-01-12 00:31:56 [6542] LOG: redo is not required
2007-01-12 00:31:59 [6542] LOG: database system is ready
2007-01-12 00:32:18 [6543] LOG: connection received: host=[local] port=
 
as I see there was an unclean restart of the system. to debug and repair the current system we need to login via SSH - alternative you can do another reinstall. version 1.6 is running quite stable under normal circumstances. changing the IP should not freeze the system - can you reproduce this?
 
After deactivating Razor spam detection on friday the server seems to run through the weekend without any errors. Did razor write any logs?
 
Thanks for the answers! I checked the razor logs and it shows me that the port 2703 is blocked by the firewall. Now the ports for razor are open, but after activating razor over the webinterface i got the following message in the system log:

Jan 16 13:21:48 proxdaemon: registering razor failed: perl: warning: Setting locale failed. perl:
warning: Please check that your locale settings: ^ILANGUAGE =
"", ^ILC_ALL = "de", ^ILANG = (unset) are
supported and installed on your system. perl: warning: Falling back
to the standard locale ("C").

SQL and Filter services still running, so i hope they won't stop again. Do you know something about this error?
 
you can safely ignore these message "registering razor failed"
in the current version, proxmox does not report to razor network (just doing queries) and therefore we do not register there.
 
Now it's two days ago that i activated razor and everything runs well. I guess the closed razor port was the source for the errors.

Anyway, thanks for the great support!