grafana

Author	SHA1	Message	Date
Seunghun Shin	c784de6ef5	Alerting: Add compressed periodic save for alert instances (#111803 ) What is this feature? This PR implements compressed periodic save for alert state storage, providing a more efficient alternative to regular periodic saves by grouping alert instances by rule UID and storing them using protobuf and snappy compression. When enabled via the state_compressed_periodic_save_enabled configuration option, the system groups alert instances by their alert rule, compresses each group using protobuf serialization and snappy compression, and processes all rules within a single database transaction at specified intervals instead of syncing after every alert evaluation cycle. Why do we need this feature? During discussions in PR #111357, we identified the need for a compressed approach to periodic alert state storage that could further reduce database load beyond the jitter mechanism. While the jitter feature distributes database operations over time, this compressed periodic save approach reduces the frequency of database operations by batching alert state updates at explicitly declared intervals rather than syncing after every alert evaluation cycle. This approach provides several key benefits: - Reduced Database Frequency: Instead of frequent sync operations tied to alert evaluation cycles, updates occur only at configured intervals - Storage Efficiency: Rule-based grouping with protobuf and snappy compression significantly reduces storage requirements The compressed periodic save complements the existing jitter mechanism by providing an alternative strategy focused on reducing overall database interaction frequency while maintaining data integrity through compression and batching. Who is this feature for? - Platform/Infrastructure teams managing large-scale Grafana deployments with high alert cardinality - Organizations looking to optimize storage costs and database performance for alerting workloads - Production environments with 1000+ alert rules where database write frequency is a concern	2025-11-07 11:51:48 +01:00
Pepe Cano	7ed46fd321	docs(alerting): `alertingSaveStateCompressed` is enabled by default (#111897 )	2025-10-03 09:02:58 +02:00
Seunghun Shin	512c292e04	Alerting: Add jitter support for periodic alert state storage to reduce database load spikes (#111357 ) What is this feature? This PR implements a jitter mechanism for periodic alert state storage to distribute database load over time instead of processing all alert instances simultaneously. When enabled via the state_periodic_save_jitter_enabled configuration option, the system spreads batch write operations across 85% of the save interval window, preventing database load spikes in high-cardinality alerting environments. Why do we need this feature? In production environments with high alert cardinality, the current periodic batch storage can cause database performance issues by processing all alert instances simultaneously at fixed intervals. Even when using periodic batch storage to improve performance, concentrating all database operations at a single point in time can overwhelm database resources, especially in resource-constrained environments. Rather than performing all INSERT operations at once during the periodic save, distributing these operations across the time window until the next save cycle can maintain more stable service operation within limited database resources. This approach prevents resource saturation by spreading the database load over the available time interval, allowing the system to operate more gracefully within existing resource constraints. For example, with 200,000 alert instances using a 5-minute interval and 4,000 batch size, instead of executing 50 batch operations simultaneously, the jitter mechanism distributes these operations across approximately 4.25 minutes (85% of 5 minutes), with each batch executed roughly every 5.2 seconds. This PR provides system-level protection against such load spikes by distributing operations across time, reducing peak resource usage while maintaining the benefits of periodic batch storage. The jitter mechanism is particularly valuable in resource-constrained environments where maintaining consistent database performance is more critical than precise timing of state updates.	2025-09-29 11:22:36 +02:00
Alexander Akhmetov	c827ddf790	Alerting: Add meta-monitoring documentation for GRAFANA_ALERTS (#108785 )	2025-07-29 21:54:44 +02:00
renovate[bot]	c94f930950	Update dependency prettier to v3.6.2 (#108689 ) * Update dependency prettier to v3.6.2 * run prettier --------- Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com> Co-authored-by: Ashley Harrison <ashley.harrison@grafana.com>	2025-07-25 17:47:44 +01:00
Pepe Cano	61efc8b609	docs(alerting): clarify usage of different Alertmanagers and fix misleading details (#107498 ) * docs(alerting): clarify usage of different Alertmanagers and fix misleading details * address review changes	2025-07-02 21:46:29 +02:00
Pepe Cano	f5b79fca55	docs(alerting): performance considerations minor clarifications (#107333 )	2025-06-30 07:42:26 +00:00
Alexander Akhmetov	f4b0e793aa	Alerting: Document label sanitization in GRAFANA_ALERTS (#107285 ) * Alerting: Document label sanitization in GRAFANA_ALERTS	2025-06-27 23:33:42 +02:00
Pepe Cano	f14492baf8	docs(alerting): prom backend to write ALERTS metric (#107006 ) * docs(alerting): prom backend to write ALERTS metric * add enterprise label * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/configure-alert-state-history/index.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> * Update docs/sources/alerting/set-up/meta-monitoring.md Co-authored-by: Alexander Akhmetov <me@alx.cx> --------- Co-authored-by: Alexander Akhmetov <me@alx.cx>	2025-06-24 10:38:52 +02:00
Jack Baldry	244ffad99d	Fix all the old usage of admonition syntax (#106984 )	2025-06-19 17:31:13 +01:00
Vadim Stepanov	5137995830	Alerting: Add support for Redis Sentinel for Alerting HA (#106322 ) * Alerting: Add support for Redis Sentinel * docs * docs * Use minisentinel in test * Apply suggestions from code review Co-authored-by: Johnny Kartheiser <140559259+JohnnyK-Grafana@users.noreply.github.com> Co-authored-by: Fayzal Ghantiwala <114010985+fayzal-g@users.noreply.github.com> * "address(es)" -> "address or addresses" * make update-workspace * make lint-go-diff --------- Co-authored-by: Johnny Kartheiser <140559259+JohnnyK-Grafana@users.noreply.github.com> Co-authored-by: Fayzal Ghantiwala <114010985+fayzal-g@users.noreply.github.com>	2025-06-05 15:02:40 +01:00
Pepe Cano	b2d317cc23	docs(alerting): add notes for Redis HA setup (#106144 ) * docs(alerting): add note for Redis HA setup * run prettier	2025-05-29 11:21:32 -05:00
Pepe Cano	f023fcc68a	docs(alerting): New Alertmanager contact point docs (#103782 )	2025-04-21 12:08:41 +02:00
Yuri Tseretyan	1bafd5c807	Docs: Remove mention of alertingApiServer flag from alerting documentation (#104131 )	2025-04-18 11:51:38 -04:00
Moustafa Baiou	b604fdf6f8	Alerting: Update docs for RBAC (#104005 ) * Alerting: Update docs for RBAC This updates the documentation for RBAC to match the changes from `032299011a` * add note about RBAC permissions with datasource permissions * writers toolkit update	2025-04-15 15:34:19 -04:00
William Wernert	a8f60de620	Alerting: Remove feature toggles relating to Loki Alert State History (#103540 ) * Remove feature toggles relating to Loki Alert State History	2025-04-08 09:50:27 -04:00
Pepe Cano	d44a9953d3	docs(alerting): add missing port setting for the HA k8s example (#103017 )	2025-03-27 22:05:41 +01:00
Johnny Kartheiser	558773ed7f	docs: add note about alert migration (#102741 ) * docs: add note about alert migration added note about alert migration in 11.6.0. * docs: add note about alert migration adding documentation about the migrations to the performance limitation docs * title edit <-- vale = NO -->	2025-03-26 15:39:32 -05:00
Alexander Akhmetov	ef5cc12b33	Alerting: Add HMAC signature config to the webhook integration (#100960 ) Adds HMAC-SHA256 signature support to webhook notifications, providing a way to verify the authenticity and integrity of webhook requests. The implementation allows to specify the header in which the signature will be sent. The signature is calculated from the request body. An optional timestamp header name can be provided. If set, the HMAC signature will be generated by concatenating the timestamp, a ":" and the request body: {timestamp}:{body}. The timestamp will also be sent in the provided header name.	2025-03-14 07:22:41 +01:00
Pepe Cano	5bfe046da9	docs(alerting): clarify behaviour when provisioning the policy tree (#101937 )	2025-03-11 15:58:25 +01:00
Robby Milo	13cf67de53	Remove relref shortcodes (#101694 ) * manually replce all shared relrefs * relref replace - grafana next * Merge branch 'master' into robbymilo/relref-replace-grafana-next * manual fixes * remove ref shortcode * Merge branch 'master' into robbymilo/relref-replace-grafana-next * prettier * fix test * update readme	2025-03-06 13:59:08 +01:00
Alexander Akhmetov	a9ce930634	Alerting: Promote alertingSaveStateCompressed flag to public preview (#99935 )	2025-02-06 18:09:43 +01:00
Garret Wyman	cf177776bf	Alerting: Adding color option for slack receiver (#99615 )	2025-01-30 00:12:16 +02:00
Pepe Cano	6178320257	Alerting docs/internal: leaf bundle page becomes a branch bundle (#98391 )	2024-12-30 11:49:11 +01:00
Pepe Cano	6976908597	Alerting docs: update configuring and using additional Alertmanagers in Grafana Alerting (#98363 ) * Alerting docs: Alertmanager data source * Alertmanager data source: minor intro changes * Update `Intro > Notifications` * Complete Configure alertmanager update * Configure alertmanager * minor final edits	2024-12-27 10:16:31 +01:00
Pepe Cano	001cc853a5	Alerting docs: update `Monitor status` section (#98179 ) * Minor updates to `Declare incidents` docs * change URL * View alert state history + restructuring * Complete alert state * alert state * change heading * View alert rules * Monitor alerts	2024-12-18 13:23:15 +01:00
Alexander Akhmetov	1f8f9a45d7	Alerting: Add state_periodic_save_batch_size config option (#98019 ) * Alerting: Add state_periodic_save_batch_size config option --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-12-16 15:30:38 +01:00
brendamuir	52f6f69d4b	Alerting docs: adds new export alert rule definition (#97028 ) * Alerting docs: adds new export alert rule definition * update * Update button text for exporting new rule definition * Update docs/sources/alerting/set-up/provision-alerting-resources/export-alerting-resources/index.md Co-authored-by: Sonia Aguilar <33540275+soniaAguilarPeiron@users.noreply.github.com> * Update docs/sources/alerting/set-up/provision-alerting-resources/export-alerting-resources/index.md Co-authored-by: Sonia Aguilar <33540275+soniaAguilarPeiron@users.noreply.github.com> * definition * type * definition 2 * pepes feedback * removes will --------- Co-authored-by: Sonia Aguilar <soniaaguilarpeiron@gmail.com> Co-authored-by: Sonia Aguilar <33540275+soniaAguilarPeiron@users.noreply.github.com>	2024-11-26 19:20:55 +01:00
Pepe Cano	706300e9b7	Alerting: notification template group (#96447 ) Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com>	2024-11-22 14:40:20 +02:00
Pepe Cano	154a2e0d06	Alerting docs: Move `Meta monitoring` to `Additional configuration` section (#96150 )	2024-11-11 11:08:12 +01:00
Pepe Cano	b953190328	Alerting docs: reuse `Additional configuration` page for Cloud docs (#96101 )	2024-11-08 16:03:47 +01:00
brendamuir	86bc087257	Alerting docs: changes advanced to additional (#96083 )	2024-11-08 13:29:10 +01:00
brendamuir	b2af163dc5	Alerting docs: adds advanced config section (#96013 ) * Alerting docs: adds advanced config section * corrects ref * feedback from pepe * renames detect and respond	2024-11-07 15:59:00 +01:00
Alessandro Chitarrini	c490b29d34	Update Discord contact point documentation for use_discord_username type (#95011 )	2024-11-04 10:02:19 +00:00
Yuri Tseretyan	c5bad9f843	Alerting: Update documentation to include new permissions for routes (#95437 ) * update documentation * Update index.md	2024-10-30 10:09:32 +01:00
Tito Lins	71d04a326b	Alerting: Support tls config for webhook receiver (#93513 ) Adds the ability to configure tls settings on the webhook receiver (e.g. to skip server certificate validation)	2024-10-22 12:44:32 +02:00
Yuri Tseretyan	ced5497ba1	Docs: Update alerting notifications documentation (#93944 ) * add new permissions and fixed roles * Apply suggestions from code review Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * address comments * add actions to complete list * fmt --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-10-15 16:16:15 +02:00
Pepe Cano	75900139ae	Alerting docs: update `Configure Alertmanagers` (#93712 ) * Alerting docs: update `Configure Alertmanagers` - clarify alertmanager types - specify that each alertmanager manages its own alerting resources * Update docs/sources/alerting/set-up/configure-alertmanager/index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-alertmanager/index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-alertmanager/index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * use `Alertmanager` - capitalize the first letter * Small copy changes * Minor cosmetic updates to `Add an Alertmanager` section * Update docs/sources/alerting/set-up/configure-alertmanager/index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-09-25 12:44:17 +02:00
brendamuir	5a1a3204c9	Alerting docs: adds ha-advertise-address to alerting docs (#93303 ) * Alerting docs: adds ha-advertise-address to alerting docs * Change description of setting [ha_advertise_address] --------- Co-authored-by: Yuri Tseretyan <yuriy.tseretyan@grafana.com>	2024-09-23 19:14:34 +02:00
Alexander Akhmetov	e59ea00518	Alerting: Add TLS, QoS and retain options to the MQTT receiver (#92331 )	2024-09-17 21:11:16 +02:00
brendamuir	98f766a50d	Alerting docs: adds recording rule info (#93204 ) * Alerting docs: adds recording rule info * ran prettier * Updates with feedback from pepe and removes external reference * couple of minor edits * removes reference * feedback from sonia * adds links per gilles * adds correct version link	2024-09-12 12:06:23 +02:00
Pepe Cano	7eb7b51dce	Docs: update IAM links for Cloud docs (#93147 )	2024-09-10 09:33:45 +02:00
Pepe Cano	15de549093	Alerting docs: HA - configure alertmanager to prevent duplicated notifications (#92611 ) * Alerting docs: HA - configure shared alertmanager * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Apply minor content suggestion --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-09-03 06:31:09 +02:00
brendamuir	667cbd626c	Alerting docs: adds display names to fixed roles for RBAC (#92357 ) * Alerting docs: adds display names to fixed roles for RBAC * ran prettier * updates descriptions * preposition update * ran prettier	2024-08-27 09:23:04 +02:00
Pepe Cano	2ba930ab1f	Alerting Docs: Monitor your high availability setup (#92063 ) * Alerting Docs: Monitor your high availability setup * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/configure-high-availability/_index.md Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Shorten links * Update/reorder a bit the description about alertmanager gossiping * Update `alertmanager_peer_position` description --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-08-27 08:01:14 +02:00
Alexander Akhmetov	832bb01f36	Alerting: Add MQTT notifications receiver (#91487 ) * Alerting: Add MQTT notifications receiver * Update alerting to 9daa6239cc41dc42bff0e916c8d0d27766caa8b9 (main) --------- Co-authored-by: Jack Baldry <jack.baldry@grafana.com> Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com>	2024-08-22 16:47:48 +02:00
brendamuir	f833b782b6	Alerting docs: adds silence RBAC 11.1 (#89176 ) * Alerting docs: adds silence RBAC 11.1 * ran prettier * Improve docs with new rule-specific silence RBAC information * Apply suggestions from code review Co-authored-by: Jack Baldry <jack.baldry@grafana.com> * Apply suggestions from code review Co-authored-by: Jack Baldry <jack.baldry@grafana.com> * prettier --------- Co-authored-by: Matt Jacobson <matthew.jacobson@grafana.com> Co-authored-by: Jack Baldry <jack.baldry@grafana.com>	2024-06-27 10:10:34 +02:00
Jacob Valdemar	eb76ea47a0	Alerting: Add ha_reconnect_timeout configuration option (#88823 ) * Docs: Update "Configure high availability" guide with ha_reconnect_timeout configuration --------- Co-authored-by: Christopher Moyer <35463610+chri2547@users.noreply.github.com>	2024-06-11 13:25:48 -04:00
Pepe Cano	3af72bdeee	Alerting docs: fix `Alertmanagers` title (#88625 )	2024-06-04 09:10:28 +02:00
Pepe Cano	68c44f1dd9	Alerting docs: Update Alertmanager docs (#88567 ) * Remove alertmanager page and set redirects * Update internal alertmanager links * Update `Alertmanager` docs * Change heading to `Configure Alertmanagers`	2024-06-03 11:00:52 +02:00

1 2 3 4

171 Commits