Commit graph

6 commits

Author SHA1 Message Date
Moritz 'e1mo' Fromm
d7d6b7e6ef
hosts/hamilton: init 2023-08-12 11:48:39 +02:00
Moritz 'e1mo' Fromm
d8d2997bb3
Use full key fingerprint for e1mo in sops 2023-07-23 16:02:25 +02:00
Moritz 'e1mo' Fromm
b5ca65d9a0
Add mommes GPG key to sops 2023-07-23 16:01:04 +02:00
Moritz 'e1mo' Fromm
5442317071
Remove n0emis from sops
Follow up to cd026b118c
2023-07-17 12:20:02 +02:00
Moritz 'e1mo' Fromm
d199834a61
Add adb and admin htpasswd user
Also updated instructions for editing the .htpasswd
2023-01-06 15:51:22 +01:00
Moritz 'e1mo' Fromm
3acc1865c0
services/monitoring: Setup
The goal is to create a monitoring setup where each server monitors
itself when it comes failing systemd services, disk or RAM filling up,
…. In addition each prometheus will monitor remote prometheus and
alertmanager instances for signs of failure (e.g. being unreachable,
errors in notification delivery, dropping alerts).

A lot of metrics (especially histograms from prometheus or alertmanager)
are being dropped before ingestion to disk save on space and memory.

Depending on how many servers we may or may not have in the future this
could probably use some kind of overhaul since we rightnow have n^2
monitoring peer relationships (not even speaking of possible duplicated
alerts).
2023-01-06 15:51:22 +01:00