New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[dev.icinga.com #11686] Icinga Crash with the workflow Create_Host-> Downtime for the Host -> Delete Downtime -> Remove Host #4170
Comments
Updated by saurabh_hirani on 2016-04-28 10:19:43 +00:00 I verified the same and this is the case. Create host, downtime host, delete downtime, remove host - leads to the exact same error: Apr 28 07:39:12 vagrant kernel: [11121.769790] traps: icinga2[22623] general protection ip:7f199c3c722f sp:7f1995c80950 error:0 in libstdc**.so.6.0.16[7f199c313000+e2000] This is a pretty normal activity as hosts will transition through this cycle. Causing an entire icinga2 instance to fail is highly problematic in production systems. |
Updated by Christian_vlc on 2016-04-28 12:35:31 +00:00 I tried with: 1- Only the downtime on the Host (without the Services) But it doesnt help |
Updated by mfriedrich on 2016-05-02 07:11:15 +00:00
|
Updated by mfriedrich on 2016-05-02 13:45:55 +00:00
|
Updated by mfriedrich on 2016-05-02 14:27:12 +00:00
|
Updated by mfriedrich on 2016-05-02 14:27:58 +00:00
Paw test file attached. |
Updated by mfriedrich on 2016-05-02 14:51:29 +00:00
Looks like a race condition to me. At this stage the host object is not fully available when enforcing a dynamic_pointer_cast. Turns out when debugging and stepping into the functions, it works as expected from inside the debugger.
|
Updated by mfriedrich on 2016-05-02 15:10:39 +00:00 For some reason DependencyGraph::GetParents() returns in incomplete parent object vector which causes trouble here. If you stop the debugger and let another thread finish updating the dependency graph, everything is fine.
|
Updated by saurabh_hirani on 2016-05-06 14:26:34 +00:00 Sorry to be rushing but is there a targeted fix for it? This is making icinga2 API unsuable and causing crashes multiple times a day. |
Updated by mfriedrich on 2016-05-06 14:34:04 +00:00 If you can help me find a fix, I'll happily schedule a release target. Apparently I know where the problem lies but investigating on a fix takes longer than expected. Kind regards, |
Updated by saurabh_hirani on 2016-05-07 04:04:37 +00:00 dnsmichi wrote:
Got that. Thanks for the update Michael. I am not good at C/C** - so I cannot decipher the coding part clearly enough to contribute back. But I can help out with scenario testing in any way if possible. -- |
Updated by gbeutner on 2016-05-09 12:29:43 +00:00
|
Updated by gbeutner on 2016-05-09 12:31:11 +00:00
|
Updated by gbeutner on 2016-05-09 12:32:38 +00:00
Applied in changeset b8e911b. |
Updated by gbeutner on 2016-05-10 09:54:53 +00:00
|
Updated by mfriedrich on 2016-05-10 11:21:26 +00:00
|
Updated by mfriedrich on 2016-05-10 11:36:48 +00:00
|
Updated by mfriedrich on 2016-05-11 07:38:56 +00:00
|
This issue has been migrated from Redmine: https://dev.icinga.com/issues/11686
Created by Christian_vlc on 2016-04-27 09:38:26 +00:00
Assignee: gbeutner
Status: Resolved (closed on 2016-05-09 12:32:38 +00:00)
Target Version: 2.4.8
Last Update: 2016-05-11 07:38:56 +00:00 (in Redmine)
Hi,
Icinga Crash if I remove one host after that I created/removed one downtime for this host:
In the dmesg ist one segfault error, but in Icinga logs are any Error (debug not too).
icinga2[23538]: segfault at 38 ip 00007fb9ed09343a sp 00007fb9dbc2eaf0 error 4 in libstdc**.so.6.0.17[7fb9ed031000+e8000]
After this error, I must restart Icinga and then I can remove this host.
I tried with flexibel and fixed Downtimes.. .but every time the same result...
Some Idea??
Attachments
Changesets
2016-05-09 12:30:12 +00:00 by gbeutner b8e911b
2016-05-12 09:08:21 +00:00 by gbeutner d82db2a
Relations:
The text was updated successfully, but these errors were encountered: