poseidon

Author	SHA1	Message	Date
Maximilian Paß	70c108aebf	Unify the representation of the three dots.	2023-11-09 13:11:39 +01:00
Maximilian Paß	d0dd5c08cb	Remove usage of context.DeadlineExceeded for internal decisions as this error is strongly used by other packages. By checking such wrapped errors the internal decision can be influenced accidentally. In this case the retry mechanism checked if the error is context.DeadlineExceeded and assumed it would be created by the internal context. This assumption was wrong.	2023-10-31 15:49:56 +01:00
Maximilian Paß	6b69a2d732	Refactor Nomad Recovery from an approach that loaded the runners only once at the startup to a method that will be repeated i.e. if the Nomad Event Stream connection interrupts.	2023-10-31 15:49:56 +01:00
Maximilian Paß	b2898f9183	Fix List of the Environments with fetch. Before the List function dropped all idleRunners of all environments when fetch was set. Additionally, the replaced environment was not destroyed properly so that a goroutine for it and for all its idle runners remained running.	2023-10-31 15:49:56 +01:00
Maximilian Paß	59da36303c	Fix Goroutine Leak of Environment Get that was caused by creating an intermediate environment `fetchedEnvironment` when fetching the environments but not removing it in case that we just copy its configuration to the existing environment.	2023-09-11 13:44:29 +02:00
Maximilian Paß	e3161637a9	Extract the WatchEventStream retry mechanism into the utils including all other retry mechanisms. With this change we fix that the WatchEventStream goroutine does not stop directly when the context is done (but previously only one second after).	2023-09-11 13:44:29 +02:00
Maximilian Paß	13a9da95e5	Introduce a context for RetryExponential as second criteria (next to the maximum number of attempts) for canceling the retrying. This is required as we started with the previous commit to retry the nomad environment recovery. This always fails for unit tests (as they are not connected to an Nomad cluster). Before, we ignored the one error but the retrying leads to unit test timeouts. Additionally, we now stop retrying to create a runner when the environment got deleted.	2023-08-18 09:28:23 +02:00
Maximilian Paß	73759f8a3c	Retry Environment Recovery	2023-08-18 09:28:23 +02:00
Maximilian Paß	2650efbb38	Sentry Tracing Identifier	2023-02-03 10:29:18 +00:00
Maximilian Paß	f2c205a8ed	Add additional performance spans	2023-02-03 10:29:18 +00:00
Maximilian Paß	0c6c48c3cf	#190 Add unit tests for runner recovery.	2022-11-26 13:33:44 +00:00
Maximilian Paß	7119f3e012	Fix not canceling monitoring events for removed environments and runners.	2022-10-24 13:15:14 +02:00
Maximilian Paß	1eef26cc83	Add environment id to periodical monitoring events.	2022-08-20 09:17:43 +02:00
Maximilian Paß	5590c50e14	#110 Add periodical monitoring events.	2022-08-19 20:48:46 +02:00
Sebastian Serth	021530d5a7	Apply GoFmt fixes	2022-08-10 19:34:05 +02:00
Maximilian Paß	18daa1152c	Save the environment id for runner monitoring.	2022-07-31 19:42:35 +02:00
Maximilian Paß	498e8f5ff5	#110 Refactor influxdb monitoring to use it as singleton. This enables the possibility to monitor processes that are independent of an incoming request.	2022-07-01 15:29:31 +02:00
Maximilian Paß	34040162c2	#89 Generalise the three Storage interfaces and structs into one generic storage manager.	2022-06-29 16:21:19 +02:00
Maximilian Paß	136f596dc2	Add aws environments to the statistics but only with the field usedRunners.	2022-04-09 16:35:53 +02:00
Maximilian Paß	2cf890ab91	Implement review comments	2022-02-28 14:54:40 +01:00
Maximilian Paß	0ef5a4e39f	Make Execution Environment interface Nomad independent	2022-02-28 14:54:40 +01:00
Maximilian Paß	ba43f667c2	Add architecture for multiple managers using the chain of responsibility pattern.	2022-02-28 14:54:40 +01:00

22 Commits