New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[dev.icinga.com #12985] Downtimes disappearing shortly after icinga reload #4747
Comments
Updated by elippmann on 2016-10-27 11:13:42 +00:00
Hi, Could you please post the output of: ls -lah /var/lib/icinga2/api/packages/_api/ and the contents of the files from the directory above:
Are you running a single instance setup? Best regards, |
Updated by rdesanno on 2016-10-27 14:15:38 +00:00 Here you go. Also, we are running a single instance for what it's worth.
|
Updated by rdesanno on 2016-10-27 14:20:53 +00:00 I also noticed that the keys for the icinga_scheduleddowntime table dont seem to match the schema that I found online. I'm not sure how this could have happened over the course of the few upgrades that happened over the year and going to do a fresh install on another box to compare. Not sure if its related but thought I would point it out because this doesn't look right either.
|
Updated by rdesanno on 2016-10-27 15:54:33 +00:00 For example, this is what I see upon reload. Note that I had 1028 rows of downtime set and after reload, it was an empty set. Immediately after reload, there were no downtimes in effect under /icingaweb2/monitoring/list/downtimes in the GUI however when I looked at one of the hosts that previously had a downtime set before the reload, it still had a "plug" icon next to it, at least for a few seconds before disappearing.
Here is an example of how newly entered downtime records are being recorded if it helps. I'm curious what you think about the duration = 0.
|
Updated by nlm on 2016-11-10 10:39:58 +00:00 The duration column must be the duration attribute of flexible downtimes. From the documentation :
|
Updated by elippmann on 2016-11-10 11:12:32 +00:00 Thanks for the SQL output. But could you please resend this w/ select *? Are you using Web 2 for transmitting external commands? |
Updated by rdesanno on 2016-11-10 18:59:26 +00:00 Thanks for the input but after dealing with this bug for so long, I have decided to do a baremetal wipe / install and can no longer recreate this bug. I'm convinced that something in the upgrade path had broken this for us and my testing revealed that a new install was stable. FYI to anyone else having this issue. |
Updated by rdesanno on 2016-11-10 19:00:03 +00:00 Feel free to close this ticket. |
Updated by mfriedrich on 2016-12-07 21:27:30 +00:00
I have the feeling that this is related to the fact that sometimes /include.conf gets missing, see #13251. Still I'll close here as requested ;) |
This issue has been migrated from Redmine: https://dev.icinga.com/issues/12985
Created by rdesanno on 2016-10-25 20:42:56 +00:00
Assignee: rdesanno
Status: Closed (closed on 2016-12-07 21:27:30 +00:00)
Target Version: (none)
Last Update: 2016-12-07 21:27:30 +00:00 (in Redmine)
The root of the problem that we are seeing in our shop is whenever a host or service is set with a downtime or acknowledgement, and icinga is reloaded, those downtimes and/or comments get erased from the database and removed from the webui. It's pretty consistent in our environment and can replicate it very easily by doing the following:
set a downtime, acknowledgement, persistent acknowledgement with expiration
reload icinga2
wait
I have gone so far as to erase all comments and downtimes under our _api directory and recreated the database, but this did not help. I can literally watch the lines disappear from tables icinga_comments or icinga_scheduleddowntime and haven't a clue on how to fix it.
The only thing I can tell is that comments et al, might persist if entered on a different day than the reload takes place, but im not sure that this is 100% true.
Below are the versions we are running if it helps any:
CentOS7 3.10.0-327.36.1.el7.x86_64
The text was updated successfully, but these errors were encountered: