ObserveNow
Alerting
Overview
in observability systems, alerting serves a critical function as the proactive element that connects data collection to action opsverse observenow uses grafana as the primary alerting tool support for prometheus alertmanager too is available here are some important features supported by the alerting component comprehensive telemetry support the alerting component is designed to support alerts based on all types of telemetry data, providing a unified alerting experience across various observability signals metrics based alerts alerts can be configured using time series data from various metrics data sources this includes support for threshold based alerts, rate of change alerts, and alerts based on anomalies log based alerts alerts can be triggered based on log data, allowing for detection of specific events or patterns in log streams this functionality supports alerting on log volume, specific log messages, or complex log patterns use cases may include alerting on application errors, security events, or system level issues captured in logs apm (application performance monitoring) alerts alerts can be set up based on application performance data, providing insights into the behavior and health of applications this includes alerting on metrics such as response times, error rates, and transaction volumes apm alerts can help identify performance bottlenecks, slow database queries, or degraded user experiences trace based alerts while less common, alerts can also be configured based on distributed tracing data this allows for alerting on service dependencies, latency between services, or specific error conditions in trace spans since opsverse uses clickhouse as the storage engine for traces data, sql can be used to access this data for creating alerts multi signal alerts grafana's alerting system allows for the creation of complex alerts that combine multiple types of telemetry for example, an alert could be triggered based on a combination of high error rates in metrics, specific error messages in logs, and slow transaction times in apm data notification system the alerting system integrates with a wide range of notification channels to ensure timely and effective communication of alerts out of the box support is provided for common platforms such as email, slack, pagerduty, opsgenie, microsoft teams, and telegram additionally, the webhook functionality allows for integration with virtually any system capable of receiving http post requests, enabling custom notifications to fit diverse operational workflows this extensive notification support ensures that alerts can be seamlessly incorporated into existing processes, regardless of the communication tools an organization employs advanced routing policies the alerting system provides extensive support for complex routing policies, enabling organizations to create sophisticated notification workflows these policies can be based on various factors including alert labels, severity, time of day, and team assignments the system supports hierarchical routing trees, allowing for granular control over alert escalation and distribution anomaly based alerting support for anomaly based alerts based alerts is available on top of the alerting system please contact your opsverse customer success manager to learn more about anomaly detection