#incident-management

[ follow ]
#ai
fromInfoQ
3 months ago
Artificial intelligence

Datadog Employs LLMs for Assisting with Writing Accident Postmortems

fromInfoQ
3 months ago
Artificial intelligence

Datadog Employs LLMs for Assisting with Writing Accident Postmortems

fromInfoQ
4 weeks ago

Security or Convenience - Why Not Both?

You start the company-issued laptop. You really dislike this machine. It's slow, clunky. You don't like the operating system. You can't even install an ad blocker for your browser.
Software development
Artificial intelligence
fromInfoQ
1 month ago

Logz.io and Dynatrace Innovations Shift Observability Into the AI Age

AI integration into observability platforms is automating operational tasks to enhance efficiency.
Logz.io's AI Agents and Dynatrace's Davis AI significantly reduce incident resolution times.
fromSilicon Canals
1 month ago

AI tool of the week: Netdata Insights, a tool that helps engineers find and fix system issues faster

Netdata Insights revolutionizes incident reporting by automating processes and delivering actionable intelligence from complex telemetry data.
Remote teams
fromNew Relic
2 months ago

Team collaboration speeds incident response

New Relic Teams enhances incident troubleshooting by centralizing ownership information, improving team coordination and reducing response times.
fromDevOps.com
2 months ago

Causely Extends Reach of Observability to Grafana Dashboards - DevOps.com

"Causely’s integration of Grafana dashboards enhances root cause analysis for DevOps teams, providing better visibility and actionable intelligence in IT workflows."
Artificial intelligence
fromSecuritymagazine
2 months ago

Automate or Fall Behind - Crisis Response at the Speed of Risk

Most businesses still treat crisis response like it's 2015. A ransomware alert goes out. Emails fly. Group chats explode. Someone digs out the playbook.
DevOps
#cybersecurity
Privacy professionals
fromSecuritymagazine
3 months ago

The Oracle breach and the case for transparent cyber response

The Oracle Cloud breach highlights the importance of responsiveness in cybersecurity, showcasing that initial denial can exacerbate damage.
Timely communication post-breach is critical to maintain trust and facilitate organizational responses.
Privacy professionals
fromSecuritymagazine
3 months ago

The Oracle breach and the case for transparent cyber response

The Oracle Cloud breach highlights the importance of responsiveness in cybersecurity, showcasing that initial denial can exacerbate damage.
Timely communication post-breach is critical to maintain trust and facilitate organizational responses.
Artificial intelligence
fromDevOps.com
3 months ago

Next-Generation Observability: Combining OpenTelemetry and AI for Proactive Incident Management - DevOps.com

Modern systems necessitate advanced monitoring solutions like OpenTelemetry due to the inadequacies of traditional tools.
fromIrish Independent
3 months ago

Marks and Spencer pauses online orders and contactless payments in stores as ongoing cyber security incident persists

“As part of our proactive management of a cyber incident, we have made the decision to pause taking orders via our M&S.com websites and apps.”
E-Commerce
Information security
fromInfoQ
5 months ago

How a Manual Remediation for a Phishing URL Took Down Cloudflare R2

Human error led to Cloudflare's R2 Gateway service outage, affecting multiple other services for over an hour.
Artificial intelligence
fromInfoQ
5 months ago

Resilience, Observability and Unintended Consequences of Automation

Courtney Nash's extensive background integrates cognitive science with technology to improve resilience engineering in DevOps.
fromDevOps.com
6 months ago

Navigating System Failures: Best Practices for Incident Management and Rapid Recovery in 2025 - DevOps.com

System failures are inevitable; robust incident management and preparation are essential to minimize downtime and mitigate impacts on businesses.
[ Load more ]