grafana

Author	SHA1	Message	Date
Moustafa Baiou	0d2ee90ff1	Alerting: Fix copying of recording rule fields Recording rule fields were not being copied correctly when duplicating an alert rule. This manifests as missing `TargetDataSourceUID` fields from the `Record` part of the rule when rules in a group are re-ordered. Added some additional tests to ensure we cover the generation of recording rules in tests and fixed the copying logic to ensure all fields are copied correctly. (cherry picked from commit `c73b3ccf6e`)	2025-09-03 10:00:22 -04:00
grafana-delivery-bot[bot]	b9ee6bae38	[release-12.0.1] Alerting: Ensure field validators return the proper type (#104243 ) Alerting: Ensure field validators return the proper type (#104050) * Ensure field validators return the proper type This ensures correct error propagation through services up to the API layer. * Move error wrapping up to call site (cherry picked from commit `820c338414`) Co-authored-by: William Wernert <william.wernert@grafana.com>	2025-04-21 21:17:40 +01:00
Mariell Hoversholm	757be6365a	CI: Bump golangci-lint to 2.0.2 (#103572 )	2025-04-10 14:42:23 +02:00
Yuri Tseretyan	dc0083d879	Alerting: Sequential evaluation of rules in group (#98829 ) * introduce RulesGroupComparer * extract runJob method * implement sequential evaluation * Make sequence building testable & add comments * Also run callback in recording rules + add tests * Improve tests * Address PR comments --------- Co-authored-by: William Wernert <william.wernert@grafana.com>	2025-04-02 23:10:32 +03:00
maicon	d8c5c2d3b8	K8s: Folders: Modify GetChildren to return only Folder References (#103072 ) * Return FolderReference instead of Folder on GetChildren Signed-off-by: Maicon Costa <maiconscosta@gmail.com> --------- Signed-off-by: Maicon Costa <maiconscosta@gmail.com>	2025-04-02 01:30:17 -03:00
Alexander Akhmetov	f49a88ab72	Alerting: Add MissingSeriesEvalsToResolve to the APIs (#102150 ) What is this feature? A follow-up for #101184, adds AlertRule.MissingSeriesEvalsToResolve to the APIs. missing_series_evals_to_resolve must be specified too and it must be > 0. POST /api/ruler/grafana/api/v1/rules/{folderUID} works in the following way: If missing_series_evals_to_resolve is not sent or null, the rule keeps its existing value If missing_series_evals_to_resolve > 0: updates to that value If missing_series_evals_to_resolve = 0: resets to default (nil). AlertRule.MissingSeriesEvalsToResolve can't be 0, so I used it to reset In the Provisioning API, the value is just set if present and > 0. Otherwise it's reset: PUT to /api/v1/provisioning/alert-rules/{UID}: If missing_series_evals_to_resolve is nil, it's reset to the default value If missing_series_evals_to_resolve > 0, it's updated	2025-03-26 13:34:53 +01:00
Alexander Akhmetov	f7aa17f2e4	Alerting: Add default values to AlertRule.Data queries in Prometheus conversion (#102843 ) What is this feature? Prometheus conversion: ensures that AlertRule.Data queries always have default parameters set (intervalMs, maxDataPoints). Without this, updates of the same rule can cause version increments. Why do we need this feature? Currently, when converting Prometheus rules to Grafana alerts, some default parameters are not explicitly set in the query model. This creates a problem during rule updates: When a user updates a rule that hasn't changed, we still detect differences in the AlertQuery.Model because the newly converted rules are missing the default fields, such as intervalMs and maxDataPoints. This causes unnecessary version increments of alert rules.	2025-03-26 11:46:49 +01:00
Yuri Tseretyan	e39b17d701	Alerting: Remove constraints for uniqueness of rule title (#102067 ) * fix having duplicated names in same group in the UI --------- Co-authored-by: Sonia Aguilar <soniaaguilarpeiron@gmail.com>	2025-03-18 13:27:44 -04:00
Alexander Akhmetov	695ac91290	Alerting: Add backend support for keep_firing_for (#100750 ) What is this feature? This PR introduces a new alert rule configuration option, keep_firing_for (Prometheus documentation). keep_firing_for prevents alerts from resolving immediately after the alert condition returns to normal. Instead, they transition into a "Recovering" state and are not considered resolved by the Alertmanager. Once the recovery period ends (or after the next evaluation if it is bigger than keep_firing_for), the alert transitions to "Normal" if it doesn't start alerting again: Before +----------+ +----------+ \| Alerting \|---->\| Normal \| +----------+ +----------+ ----- After +----------+ +------------+ +----------+ \| Alerting \|----->\| Recovering \|---->\| Normal \| +----------+ +------------+ +----------+ Why do we need this feature? This feature prevents flapping alerts by adding a recovery period. This helps avoid false resolutions caused by brief alert	2025-03-18 11:24:48 +01:00
Alexander Akhmetov	7dd6f52630	Alerting: Add MissingSeriesEvalsToResolve option to the AlertRule (#101184 )	2025-03-11 22:12:06 +01:00
Steve Simpson	b7dcfcedcb	Alerting: Extend recording rule definitions/interfaces with data source. (#101678 ) Extend the recording rule definition to include the target data source, allowing configuration of where the output of the recording rule is written to. Also extends the relevant interfaces in preparation for the next set of changes.	2025-03-06 14:09:17 +01:00
Alexander Akhmetov	d44728f4e5	Alerting: Metric to count imported from Prometheus rules (#100847 )	2025-03-05 14:02:28 +01:00
Yuri Tseretyan	879b121136	Alerting: Add GUID to alert rule tables (#101321 ) * add column guid to alert rule table and rule_guid to rule version table + populate the new field with UUID * update storage and domain models * patch GUID * ignore GUID in fingerprint tests	2025-02-28 09:47:25 -05:00
Alexander Akhmetov	ae2074ef55	Alerting: Fix updating Prometheus definition in the metadata (#101440 ) Initially, Metadata had only the EditorSettings, and HasMetadata was used to understand if the incoming update request had metadata in the body because it could be omitted if it was empty. For example, when the rule is updated via the provisioning API or has only false values. If it was in the request, we used that; if not, we used the metadata from the existing rule from the database. If the rule was updated via the AlertRuleService, we didn't change Metadata at all if the rule already existed. But now, Metadata also has the Prometheus rule definition, and we always need to update it with the new version of the AlertRuleService when the rule exists in the DB and has the same UID. HasMetadata is renamed to HasEditorSettings to keep the old behaviour only for EditorSettings. Now, the provisioning API and the conversion API will overwrite everything except EditorSettings with the new data.	2025-02-28 13:11:49 +02:00
Alexander Akhmetov	6eb335a8ce	Alerting: API to read rule groups using mimirtool (#100674 )	2025-02-25 15:49:08 +01:00
Alexander Akhmetov	b641fd64f9	Alerting: API to create rule groups using mimirtool (#100558 ) What is this feature? Adds an API endpoint to create alert rules with mimirtool: - POST /convert/prometheus/config/v1/rules/{NamespaceTitle} - Accepts a single rule group in a Prometheus YAML format and creates or updates a Grafana rule group from it. The endpoint uses the conversion package from #100224. Key parts The API works similarly to the provisioning API. If the rule does not exist, it will be created, otherwise updated. Any rules not present in the new group will be deleted, ensuring the group is fully synchronized with the provided configuration. Since the API works with namespace titles (folders), the handler automatically creates a folder in the root based on the provided title if it does not exist. It also requires a special header, X-Grafana-Alerting-Datasource-UID. This header specifies which datasource to use for the new rules. If the rule group's evaluation interval is not specified, it uses the DefaultRuleEvaluationInterval from settings.	2025-02-25 11:26:36 +01:00
Matthew Jacobson	b78a63b0ad	Alerting: Use new image TokenProvider and send image url in annotation (#99989 ) * Send new annotation containing image url * Use new image TokenProvider with TokenStore New abstraction GetImage no longer needs to support parsing both token and url from annotations, as remote AM will use the new URLProvider. Instead, we use the new generic TokenProvider and give it a TokenStore backed by the grafana database. That means we revert back to always using token simplifying code and security considerations. * Upgrade grafana/alerting to merged commit SHA	2025-02-20 12:47:40 -05:00
Alexander Akhmetov	3cc4320aa9	Alerting: Add rule conversion package (#100224 )	2025-02-12 19:38:48 +02:00
Yuri Tseretyan	4cac3158c7	Alerting: Fix alert rule copy to include metadata (#100212 ) * copy metadata * add tests for copy and generator * extract copy rule to a production method and update usages * fix tests	2025-02-11 09:46:02 -05:00
Moustafa Baiou	7dee4d1808	Alerting: Allow specifying uid for new rules added to groups (#99858 ) When modifying rule groups the `uid` can be specified but only if the rule already existed in the DB. If the rule is new the update would be rejected. This updates the RuleGroup provisioning apis to allow specifying the `uid` when creating/updating rule groups. Additionally, the RuleGroupIdx was not being updated when rules were reordered in the group. Context: https://github.com/grafana/terraform-provider-grafana/pull/1971#issuecomment-2599223897 Relates to: https://github.com/grafana/terraform-provider-grafana/issues/1928 Fixes: #98283	2025-02-10 10:28:34 -05:00
Yuri Tseretyan	1b8db233a7	Alerting: Rule Version API to Ignore versions without diff (#100093 )	2025-02-10 09:20:35 -05:00
Yuri Tseretyan	68f1730461	Alerting: set updated_by for system owned operations (#100068 )	2025-02-04 14:23:15 -05:00
Yuri Tseretyan	ac41c19350	Alerting: Rule version history API (#99041 ) * implement store method to read rule versions * implement request handler * declare a new endpoint * fix fake to return correct response * add tests * add integration tests * rename history to versions * apply diff from swagger CI step Signed-off-by: Yuri Tseretyan <yuriy.tseretyan@grafana.com> --------- Signed-off-by: Yuri Tseretyan <yuriy.tseretyan@grafana.com>	2025-02-03 13:26:18 -05:00
Garret Wyman	cf177776bf	Alerting: Adding color option for slack receiver (#99615 )	2025-01-30 00:12:16 +02:00
Moustafa Baiou	b820fd6bef	Alerting: Fix Alertmanager configuration updates (#99610 ) * Alerting: Fix Alertmanager configuration updates Alertmanager configuration updates would behave inconsistently when performing no-op updates with `mysql` as the store. In particular this bug manifested as a failure to reload the provisioned alertmanager configuration components with no changes to the configuration itself. This would result in a 500 error with mysql store only. The core issue is that we were relying on the number of rows affected by the update query to determine if the configuration was found in the db or not. While this behavior works for certain sql dialects, mysql does not return the number of rows matched by the update query but rather the number of rows actually updated. Also discovered and fixed the mismatched `xorm` tag for the `CreatedAt` field to match the actual column name in the db. References: https://dev.mysql.com/doc/refman/8.4/en/update.html	2025-01-29 23:00:45 +02:00
Yuri Tseretyan	92d6762a3a	Alerting: Store information about user that created\updated alert rule (#99395 ) * introduce new fields created_by in rule tables * update domain model and compat layer to support UpdatedBy * add alert rule generator mutators for UpdatedBy * ignore UpdatedBy in diff and hash calculation * Add user context to alert rule insert/update operations Updated InsertAlertRules and UpdateAlertRules methods to accept a user context parameter. This change ensures auditability and better tracking of user actions when creating or updating alert rules. Adjusted all relevant calls and interfaces to pass the user context accordingly. * set UpdatedBy in PreSave because this is where Updated is set * Use nil userID for system-initiated updates This ensures differentiation between system and user-initiated changes for better traceability and clarity in update origins. --------- Signed-off-by: Yuri Tseretyan <yuriy.tseretyan@grafana.com>	2025-01-24 12:09:17 -05:00
Matthew Jacobson	a6dffd7552	Upgrade grafana/alerting to 209e052dba64 (#99118 ) Update grafana/alerting to 209e052dba64 Includes: - Add NoopDecode function for non-base64-encoded secrets (#264) - Log duplicated receivers (#265)	2025-01-17 21:53:41 +02:00
Santiago	f60caf6932	Alerting: Fix alert rules unpausing after moving rule to different folder (#97580 ) Alerting: Fix alert rules unpaused after moving rule to different folder	2024-12-06 14:33:13 -03:00
Nihal	e73bb34cc0	Alerting: Fix Conflicting Alert Rule Response Has Wrong 'rule_uid' (#95013 ) * change to return the right conflicting alert rule uid. see https://github.com/grafana/grafana/issues/89755 Signed-off-by: wasim-nihal <sswasim64@gmail.com> * correcting the code comment Signed-off-by: wasim-nihal <sswasim64@gmail.com> * changes to return the conflicting uid for both insert and update operations Signed-off-by: wasim-nihal <sswasim64@gmail.com> * changes to return verbose conflicting alert rule response payload Signed-off-by: wasim-nihal <sswasim64@gmail.com> * changes to return verbose conflicting alert rule response payload Signed-off-by: wasim-nihal <sswasim64@gmail.com> * Update pkg/services/ngalert/store/alert_rule.go Co-authored-by: Matthew Jacobson <JacobsonMT@gmail.com> --------- Signed-off-by: wasim-nihal <sswasim64@gmail.com> Co-authored-by: Matthew Jacobson <JacobsonMT@gmail.com>	2024-11-26 15:13:31 -05:00
Matthew Jacobson	64c93217ff	Alerting: Fix incorrect 500 code on missing alert rule dashboardUID / panelID (#96491 )	2024-11-14 21:24:48 +02:00
Alexander Akhmetov	324503ee8b	Alerting: Add simplified_notifications_section field to the alert rule metadata (#95988 )	2024-11-14 12:55:54 +01:00
Alexander Akhmetov	4ce1abc6f9	Alerting: Fix saving advanced mode toggle state in the alert rule editor (#95924 )	2024-11-06 18:39:15 +01:00
William Wernert	0920e8bcc6	Alerting: Clear ignored fields of recording rules for API response (#95004 ) * Clear ignored fields of recording rules for API response * Move field clearing to compat function * Run make update-workspace * Cleanup changes	2024-10-19 01:03:12 +03:00
Alexander Akhmetov	0b804e720f	Alerting: Add RuleGroup field to ListAlertInstancesQuery struct (#94615 ) Alerting: add RuleGroup field to ListAlertInstancesQuery struct	2024-10-18 09:44:16 +02:00
Yuri Tseretyan	18e66d22b1	Alerting: Add more tracing for receivers service (#94572 )	2024-10-11 11:41:13 -04:00
Alexander Akhmetov	0a4e6ff86b	Alerting: Add SaveAlertInstancesForRule instance store method (#94505 ) Alerting: Add SaveAlertInstancesForRule method to the InstanceStore interface	2024-10-11 13:47:44 +02:00
Matthew Jacobson	099055e8a5	Alerting: Verify receiver permission read on rule create/update (#94286 ) * Alerting: Verify receiver permission read on rule create/update	2024-10-04 23:52:38 +03:00
Alexander Weaver	393faa8732	Alerting: Move rule evaluation status logic out of prometheus API and into scheduler (#89141 ) * Add health fields to rules and an aggregator method to the scheduler * Move health, last error, and last eval time in together to minimize state processing * Wire up a readonly scheduler to prom api * Extract to exported function * Use health in api_prometheus and fix up tests * Rename health struct to status * Fix tests one more time * Several new tests * Handle inactive rules * Push state mapping into state manager * rename to StatusReader * Rectify cyclo complexity rebase * Convert existing package local status implementation to models one * fix tests * undo RuleDefs rename	2024-09-30 16:52:49 -05:00
Alexander Akhmetov	b9964865cb	Alerting: Copy alert rule metadata when the rule is updated via provisioning API (#93723 ) Alerting: Copy alert rule metadata when the rule is updated	2024-09-25 22:31:02 +02:00
Matthew Jacobson	1ede1e32b8	Alerting: Receiver resource permissions service (#93552 )	2024-09-20 18:31:42 -04:00
William Wernert	f1ba7deff5	Alerting: Also clear fields in model/store validation for recording rules (#93506 ) * Fix model validation * Remove validation from provisioning service	2024-09-20 00:27:37 +03:00
Alexander Akhmetov	9f5b05f936	Alerting: Add metadata field with editor_settings to alert rule (#93245 )	2024-09-19 16:43:41 +02:00
Matthew Jacobson	3bf77d2e05	Alerting: Include in-use metadata in k8s receiver LIST & GET (#93016 ) * Include in-use metadata in k8s receiver List & Get	2024-09-13 20:20:09 +03:00
Matthew Jacobson	ff6a20f54a	Alerting: Include access control metadata in k8s receiver LIST & GET (#93013 ) * Include access control metadata in k8s receiver List & Get * Add tests for receiver access * Simplify receiver access provisioning extension - prevents edge case infinite recursion - removes read requirement from create	2024-09-12 20:57:53 +03:00
Yuri Tseretyan	f8fa5286a1	Alerting: Introduce alert rule models in storage (#93187 ) * introduce storage model for alert rule tables * remove AlertRuleVersion from models because it's not used anywhere other than in storage * update historian xorm store to use alerting store to fetch rules * fix folder tests --------- Co-authored-by: Matthew Jacobson <matthew.jacobson@grafana.com>	2024-09-12 13:20:33 -04:00
Yuri Tseretyan	cb372d3fa8	Alerting: Support secrets in contact points nested fields (#92035 ) Back-end: * update alerting module * update GetSecretKeysForContactPointType to extract secret fields from nested options * Update RemoveSecretsForContactPoint to support complex settings * update PostableGrafanaReceiverToEmbeddedContactPoint to support nested secrets * update Integration to support nested settings in models.Integration * make sigv4 fields optional Front-end: * add UI support for encrypted subform fields * allow emptying nested secure fields * Omit non touched secure fields in POST payload when saving a contact point * Use SecretInput from grafana-ui instead of the new EncryptedInput * use produce from immer * rename mapClone * rename sliceClone * Don't use produce from immer as we need to delete the fileds afterwards --------- Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com> Co-authored-by: Sonia Aguilar <soniaaguilarpeiron@gmail.com> Co-authored-by: Matt Jacobson <matthew.jacobson@grafana.com>	2024-09-10 22:26:23 -04:00
Matthew Jacobson	32f06c6d9c	Alerting: Receiver API complete core implementation (#91738 ) * Replace global authz abstraction with one compatible with uid scope * Replace GettableApiReceiver with models.Receiver in receiver_svc * GrafanaIntegrationConfig -> models.Integration * Implement Create/Update methods * Add optimistic concurrency to receiver API * Add scope to ReceiversRead & ReceiversReadSecrets migrates existing permissions to include implicit global scope * Add receiver create, update, delete actions * Check if receiver is used by rules before delete * On receiver name change update in routes and notification settings * Improve errors * Linting * Include read permissions are requirements for create/update/delete * Alias ngalert/models to ngmodels to differentiate from v0alpha1 model * Ensure integration UIDs are valid, unique, and generated if empty * Validate integration settings on create/update * Leverage UidToName to GetReceiver instead of GetReceivers * Remove some unnecessary uses of simplejson * alerting.notifications.receiver -> alerting.notifications.receivers * validator -> provenanceValidator * Only validate the modified receiver stops existing invalid receivers from preventing modification of a valid receiver. * Improve error in Integration.Encrypt * Remove scope from alert.notifications.receivers:create * Add todos for receiver renaming * Use receiverAC precondition checks in k8s api * Linting * Optional optimistic concurrency for delete * make update-workspace * More specific auth checks in k8s authorize.go * Add debug log when delete optimistic concurrency is skipped * Improve error message on authorizer.DecisionDeny * Keep error for non-forbidden errutil errors	2024-08-26 10:47:53 -04:00
Yuri Tseretyan	135f6571a9	Alerting: Update Time Interval service to support renaming of resources (#91856 ) * add RenameTimeIntervalInNotificationSettings to storage * update dependencies when the time interval is renamed --------- Co-authored-by: William Wernert <william.wernert@grafana.com>	2024-08-16 20:55:03 +03:00
Alexander Weaver	34ab5fe1f3	Alerting: Restart rule routines if the type changes (#90867 ) * Restart when types change * Wire up test hooks correctly * testing	2024-08-14 14:57:47 -05:00
Alexander Akhmetov	149f02aebe	Alerting: Add rule_group label to grafana_alerting_rule_group_rules metric (#88289 ) * Alerting: Add rule_group label to grafana_alerting_rule_group_rules metric (#62361) * Alerting: Delete rule group metrics when the rule group is deleted This commit addresses the issue where the GroupRules metric (a GaugeVec) keeps its value and is not deleted when an alert rule is removed from the rule registry. Previously, when an alert rule with orgID=1 was active, the metric was: grafana_alerting_rule_group_rules{org="1",state="active"} 1 However, after deleting this rule, subsequent calls to updateRulesMetrics did not update the gauge value, causing the metric to incorrectly remain at 1. The fix ensures that when updateRulesMetrics is called it also deletes the group rule metrics with the corresponding label values if needed.	2024-08-13 13:27:23 +02:00

1 2 3 4 5 ...

253 Commits