icinga2

mirror of https://github.com/Icinga/icinga2.git synced 2025-09-26 11:08:51 +02:00

Author	SHA1	Message	Date
Yonas Habteab	39c1d10583	checker: make result timer interval configurable for testing	2025-09-25 17:48:06 +02:00
Yonas Habteab	9a5fd4bf96	ClusterEvents: add special special handling for `SetNextCheck` events	2025-09-24 16:00:09 +02:00
Yonas Habteab	a58a69f715	Drop the now superfluous `OnNextCheckUpdated` signal	2025-09-24 16:00:09 +02:00
Yonas Habteab	b1e3a8a436	Let Icinga DB & IDO subscribe to `OnNextCheckChanged` signal It also removes the extra `SendNextUpdate()` call from the `NewCheckResultHandler` handler in Icinga DB, since it's subscribed to the `NextCheckChanged` event anyway and that event is always emitted before the `NewCheckResult` event gets triggered. This call became redundant.	2025-09-24 16:00:09 +02:00
Yonas Habteab	fc8de9e087	Introduce `OnRescheduleCheck` signal & make use of it where appropriate This commit introduces a new kinda special `OnRescheduleCheck` signal that is emitted whenever we want to inform the checker to reschedule the checkable at a specific timestamp without actually changing the next check time. Previously, we called `SetNextCheck` with some random timestamp just to enforce the checker to either pick it up immediately or at a specific time. Then at some point in time, subscribing to the `OnNextCheckChanged` signal became effectively unusable for any other purpose than to inform the checker about a new next check time. Thus, it resulted in introducing a new signal that is solely responsible for informing the Icigna DB and IDO about a new next check time in places where calling `SetNextCheck` did make sense. This commit does quite the opposite: it replaces all calls to `SetNextCheck` that were only used to inform the checker about a new next check time wit `OnRescheduleCheck` calls. Only places where we actually wanted to change the next check time still call `SetNextCheck` and thus inform the checker and all other listeners about the new next check time. And as a bonus point, we now got rid of the two object locks for child and parent at the same time.	2025-09-24 16:00:09 +02:00
Yonas Habteab	f049f5258a	Checkable: update `next_check` ts in `ExecuteCheck` only if it's needed Since the scheduler accounts for already running checks, we only need to update the `next_check` timestamp in `Checkable::ExecuteCheck()` only where it actually makes sense to do so, and as for local checks this doesn't make sense at all. There only two cases where we need to update the next check beforehand: 1) The execute command event is sent to a connected remote endpoint, so we need to set the next check to a time in the future until we actually receive the check result back from the remote endpoint. However, it must not be too far in the future to avoid that the check is not re-run for too long in case the remote endpoint never responds. 2) The check is a remote check, but either the endpoint is currently syncing replay logs or not connected at all, and we are within the magical 5min cold startup window. In these cases, the check is effectively skipped, and there will be no check result for it coming in, we manually update the next check normally as if the check was executed. In the other cases, either the check is executed locally, which means the `m_RunningCheck` flag already prevents the scheduler from re-running the check, or this is a remote check and the endpoint is not connected, but we are outside the cold startup window, in which case we also don't do anything as we've already called `Checkable::ProcessCheckResult()` with an appropriate error state, which in turn will call `Checkable::UpdateNextCheck()`.	2025-09-24 10:15:41 +02:00
Yonas Habteab	5fe6b68b89	Checker: never reschedule checkables with already running checks This commit changes the ordering of CheckableScheduleInfo in the multi-index container to ensure that checkables with running checks are pushed to the end of the ordering. This prevents them from being prioritized for scheduling ahead of others, which could lead to unnecessary CPU load due to repeated scheduling attempts. By using a very large value for the index of checkables with running checks, they are effectively deprioritized until their current check is completed and they can be reinserted with their actual next check time.	2025-09-24 10:15:41 +02:00
Yonas Habteab	5f862ce3bb	HttpServerConnection: use `std::chrono` for `m_Seen`	2025-09-12 13:40:36 +02:00
Yonas Habteab	97ad0fc552	Make HTTP livness timout configurable for unittests It's annoying to have to wait 10 seconds for the `liveness_disconnect` test to complete, so make the timeout configurable and set it to a way lower value to test the functionality.	2025-09-12 12:54:18 +02:00
Julian Brost	87df80d322	Merge pull request #10516 from Icinga/http-handlers-stream-refactor Refactor HTTP connection handling and some handlers to stream responses	2025-08-29 11:33:34 +02:00
Johannes Schmidt	4782ea8a75	Make inherited protected functions of ApiListener public This is needed so it's possible to manually add an ApiListener object for the purpose of unit-testing.	2025-08-28 13:22:18 +02:00
Johannes Schmidt	bb75d73012	Refactor ObjectQueryHandler to use new JSON stream encoder	2025-08-28 13:22:18 +02:00
Johannes Schmidt	62b2dadbac	Remove extra parameters from HTTP handler signature These parameters are no longer needed since they were only used by EventsHandler which was refactored in an earlier commit.	2025-08-28 13:22:18 +02:00
Johannes Schmidt	d32f04a863	Refactor EventsHandler to stream responses via chunked encoding	2025-08-28 13:22:18 +02:00
Johannes Schmidt	3832bb4296	Use new HTTP message classes in HttpServerConnection and Handlers	2025-08-28 13:22:18 +02:00
Johannes Schmidt	37df843700	Add HttpRequest and HttpResponse classes	2025-08-28 13:22:15 +02:00
Julian Brost	0c2fd00383	Merge pull request #10538 from Icinga/allow-uid-gid-icinga-user-and-group Allow UID/GID in ICINGA2_(USER\|GROUP) environment variables	2025-08-27 11:00:50 +02:00
Johannes Schmidt	3ebe95ba8c	Allow UID/GID in ICINGA2_(USER\|GROUP) environment variables	2025-08-25 14:31:19 +02:00
Alexander Aleksandrovič Klimov	9905e9af32	Merge pull request #10389 from Icinga/zone-endpoint-order Zone#GetEndpoints(): return endpoints in the specified order, not randomly🎲	2025-08-22 10:11:51 +02:00
Alexander Aleksandrovič Klimov	5f2ee6e119	Merge pull request #10393 from Icinga/zone-endpoint-log ApiListener#RelayMessageOne(): log🪵 to which Endpoint messages are relayed	2025-08-22 10:11:25 +02:00
Alexander A. Klimov	17b49bd5b6	ApiListener#RelayMessageOne(): log to which Endpoint messages are relayed if they're for our parent Zone.	2025-08-15 11:03:55 +02:00
Yonas Habteab	1f92ec656b	Merge pull request #10523 from Icinga/dependency-eval-complexity Prevent worst-case exponential complexity in dependency evaluation	2025-08-05 11:57:47 +02:00
Julian Brost	63e9ef58ba	Prevent worst-case exponential complexity in dependency evaluation So far, calling Checkable::IsReachable() traversed all possible paths to it's parents. In case a parent is reachable via multiple paths, all it's parents were evaluated multiple times, result in a worst-case exponential complexity. With this commit, the implementation keeps track of which checkables were already visited and uses the already-computed reachability instead of repeating the computation, ensuring a worst-case linear runtime within the graph size.	2025-08-04 10:42:20 +02:00
Julian Brost	43f1e6f3a1	Move code involved in recursive dependency evaluation to helper class Checkable::IsReachable() and DependencyGroup::GetState() call each other recursively. Moving them to a common helper class allows adding caching to them in a later commit without having to pass a cache between the functions (through a public interface) or resorting to thread_local variables.	2025-08-04 10:42:20 +02:00
Julian Brost	a49ec1015d	Allow intrusive_ptr<const T> for objects This allows using ref-counted pointers to const objects. Adds a second typedef so that T::ConstPtr can be used similar to how T::Ptr currently is.	2025-07-30 16:42:27 +02:00
Julian Brost	ebd4fd1933	Log: don't construct std::ostringstream for no-op messages This commit removes the existing m_IsNoOp bool and instead wraps the m_Buffer std::ostringstream into std::optional. Functionally, this is pretty much the same, with the exception that std::ostringstream is no longer constructed for messages that will be discarded later.	2025-07-29 10:27:38 +02:00
Julian Brost	6487497665	Log: use std::forward in operator<< and remove overload for const char* There already is a template operator<< implemented, so far only for const references though. Changing this to perfectly forward the argument to the corresponding operator in the underlying std::ostringstring allows handling all the cases there, removing the need for a separate overload for const char*.	2025-07-29 10:27:38 +02:00
Yonas Habteab	ce3275d27f	Disallow stage deletions during reload Once the new worker process has read the config, it also includes a `include */include.conf` statement within the config packages root directory, and from there on we must not allow to delete any stage directory from the config package. Otherwise, when the worker actually evaluates that include statement, it will fail to find the directory where the include file is located, or the `active.conf` file, which is included from each stage's `include.conf` file, thus causing the worker fail. Co-Authored-By: Johannes Schmidt <johannes.schmidt@icinga.com>	2025-07-24 16:02:30 +02:00
Yonas Habteab	1ac4d83963	Use `AtomicFile` where applicable in `ConfigPackageUtility`	2025-07-24 10:54:39 +02:00
Yonas Habteab	35f42fa5a3	Handle concurrent config package updates gracefully Previously, we used a simple boolean to track the state of the package updates, and didn't reset it back when the config validation was successful because it was assumed that if we successfully validated the config beforehand, then the worker would also successfully reload the config afterwards, and that the old worker would be terminated. However, this assumption is not always true due to a number of reasons that I can't even think of right now, but the most obvious one is that after we successfully validated the config, the config might have changed again before the worker was able to reload it. If that happens, then the new worker might fail to successfully validate the config due to the recent changes, in which case the old worker would remain active, and this flag would still be set to true, causing any subsequent requests to fail with a `423` until you manually restart the Icinga 2 service. So, in order to prevent such a situation, we are additionally tracking the last time a reload failed and allow to bypass the `m_RunningPackageUpdates` flag only if the last reload failed time was changed since the previous request.	2025-07-24 10:54:39 +02:00
Julian Brost	827f85c327	Merge pull request #10387 from Icinga/cnt-msg Introduce Endpoint#messages_received_per_type	2025-07-16 17:29:24 +02:00
Julian Brost	1f15f0ff07	JsonEncoder: wrap writer for flushing This commit intruduces a small helper class that wraps any writer and provides a flush operation that performs the corresponding action if the writer is an AsyncJsonWriter and does nothing otherwise.	2025-07-11 16:10:22 +02:00
Yonas Habteab	82b80e24c1	fix comment	2025-07-11 14:05:54 +02:00
Yonas Habteab	cd1ab7548c	Rename `AsyncJsonWriter::Flush()` -> `MayFlush()` to reflect its usage	2025-07-11 13:55:33 +02:00
Yonas Habteab	89418f38ee	JsonEncoder: let the serializer replace invalid UTF-8 characters Replacing invalid UTF-8 characters beforehand by our selves doesn't make any sense, the serializer can literally perform the same replacement ops with the exact same Unicode replacement character (U+FFFD) on its own. So, why not just use it directly? Instead of wasting memory on a temporary `String` object to always UTF-8 validate every and each value, we just use the serializer to directly to dump the replaced char (if any) into the output writer. No memory waste, no fuss!	2025-07-10 18:09:21 +02:00
Yonas Habteab	dad4c0889f	JsonEncoder: lock olock conditionally & flush output regularly	2025-07-10 18:09:21 +02:00
Yonas Habteab	398b5e3193	Implement `LockIfRequired()` method for `Namespace`, `Dictionary` & `Array`	2025-07-10 18:09:21 +02:00
Yonas Habteab	57726fbb66	Do not require olock on frozen `Namespace`, `Dictionary` & `Array`	2025-07-10 18:09:21 +02:00
Yonas Habteab	2461e0415d	Introduce `JsonEncode` helper function It's just a wrapper around the `JsonEncoder` class to simplify its usage.	2025-07-10 18:09:21 +02:00
Yonas Habteab	9dd2e2a3ec	Introduce `JsonEncoder` class	2025-07-10 18:09:21 +02:00
Yonas Habteab	1c61bced03	Introduce `AsyncJsonWriter` output adapter interface	2025-07-09 13:41:15 +02:00
Yonas Habteab	8ef921aa5e	Implement bool operator for `ObjectLock`	2025-07-08 18:24:16 +02:00
Yonas Habteab	4c0628c24d	Allow to defer lock on `ObjectLock`	2025-07-08 18:24:16 +02:00
Yonas Habteab	455d6fcde1	Introduce `ValueGenerator` class	2025-07-08 18:24:16 +02:00
Julian Brost	0ebcd2662d	No longer allow overriding the frozen attribute of containers The Array, Dictionary, and Namespace types provide a Freeze() method that makes them read-only. So far, there was the possibility to call some methods with `overrideFrozen=true` which would then bypass the corresponding check and allow modification of the data structures nonetheless. With 24b57f0d3a222835178e88489eabd595755ed883, this possibility was already removed from the Namespace type. However, for interface compatibility, it kept the parameter and just ignores it, throwing an exception on any modification on a frozen instance. The only place using `overrideFrozen` was processing of the `-D`/`--define` command line flag that allows setting additional variables in the DSL. At the time it is evaluated, there are no user-created data structures yet that could be frozen, so the only frozen objects that could be encountered are Namespaces (Icinga doesn't freeze other types by itself) and for these, `overrideFrozen` already has no effect. Hence, there is no harm in removing `overrideFrozen` altogether. This simplifies the code and also means that frozen objects are now indeed read-only without exceptions, allowing further optimizations regarding locking in the future.	2025-07-08 14:16:20 +02:00
Chris Malton	ec48dae331	Correct a problem with expiry times not being passed through to AcknowledgeProblem	2025-06-23 11:57:29 +02:00
Julian Brost	1aa62d4bb9	Merge pull request #10420 from Icinga/bundled-perfdata-writers-fix Serialize fields before queueing them to the workqueue	2025-06-17 10:17:27 +02:00
Johannes Schmidt	82bb636d2b	Use WaitGroup to wait for or abort HTTP requests The wait group gets passed to HttpServerConnection, then down to the HttpHandlers. For those handlers that modify the program state, the wait group is locked so ApiListener will wait on Stop() for the request to complete. If the request iterates over config objects, a further check on the state of the wait group is added to abort early and not delay program shutdown. In that case, 503 responses will be sent to the client. Additionally, in HttpServerConnection, no further requests than the one already started will be allowed once the wait group is joining.	2025-06-13 14:48:15 +02:00
Johannes Schmidt	33777f6f3f	Disconnect JSON-RPC clients on ApiListner::Stop()	2025-06-13 14:48:15 +02:00
Johannes Schmidt	00802ed9fa	Stop ApiListener::ListenerCoroutineProc() when Stop() is called	2025-06-13 14:48:11 +02:00

1 2 3 4 5 ...

6693 Commits