- Lead critical incidents — coordinate multi-disciplinary response efforts across Databricks' cloud-based services to rapidly mitigate impact and restore operations.
- Drive technical root cause analysis and Reliability improvements:
- collaborate with engineering teams to trace and document underlying causes across distributed systems, services, and data stores.
- Summarize key learnings, clearly communicate action items, and ensure that technical and procedural improvements are followed through.
- Own communications during incidents — deliver frequent, high-quality updates to internal stakeholders (executives, engineering leadership, support) and compose and publish customer-facing notifications that are accurate, timely, and empathetic.
- Mentor and train peers in both incident communication and technical response disciplines to raise the overall quality of Databricks' incident response.
- 5+ years of experience in incident management, site reliability engineering, or production operations supporting large-scale, cloud-native systems.
- Proven ability to lead and coordinate high-severity incidents, including identifying impact, isolating fault domains, and managing multi-team response efforts.
- Strong understanding of cloud infrastructure (AWS, Azure, or GCP) — including compute, networking, storage, and observability components.
- Deep expertise in log analysis and debugging.
- Familiarity with log aggregation and search tools (e.g., Datadog, Elasticsearch, Splunk, Cloud Logging, or OpenTelemetry).
- Hands‑on experience with observability systems — metrics, logging, and tracing frameworks (Prometheus, Grafana, OpenTelemetry, etc.).
- Proficiency in at least one major programming or scripting language (Python, Go, or Bash) for automating diagnostics, data collection, or analysis.
- Experience developing and maintaining incident playbooks and communication templates to ensure consistent, timely updates.
- Excellent contextual interpretation and writing skills, as well as the ability to effectively summarize and communicate to both technical and business audiences, are required.
- BS, Master's or other advanced degree in Computer Science or Computer Engineering, or related Engineering field.
-
Operations Management focuses on efficiently meeting the needs of our clients across various lines of business. If your passion is managing and developing staff to ensure quality care to help our clients live their best life, we encourage you to apply today. · A comprehensive ran ...
CLARKSBURG1 month ago
-
We're hiring immediately for our respected Management Training Program. · ...
Clarksburg2 weeks ago
-
You could be the one who changes everything for our 28 million members. Centene is transforming the health of our communities, one person at a time. · ...
Northampton $107,700 - $199,300 (USD) Full time1 month ago
-
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. As a diversified, national organization, Centene's technology professionals have access to competitive benefits including a fresh perspective on ...
Northampton $87,000 - $161,300 (USD) Full time1 month ago
-
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. · ...
Cacao $102,900 - $190,500 (USD) Full time2 weeks ago
-
+ Job summary · You could be the one who changes everything for our members by using technology to improve health outcomes around the world. · + Lead a Production Support team responsible for monitoring, triaging and resolving incidents for batch processes across shifts. · Ensure ...
Northampton $102,900 - $190,500 (USD) Full time1 month ago
-
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. · This is a diversified, national organization that has access to competitive benefits including a fresh perspective on workplace flexibility. · ...
Northampton $102,900 - $190,500 (USD) Full time1 month ago
-
You could be the one who changes everything for our 28 million members. Centene is transforming the health of our communities, one person at a time. · ...
Cacao $87,700 - $157,800 (USD) Full time1 month ago
-
You could be the one who changes everything for our 28 million members by using technology to improve health outcomes around the world. · As a diversified national organization Centene's technology professionals have access to competitive benefits including a fresh perspective on ...
Northampton $87,000 - $161,300 (USD) Full time1 month ago
-
The Senior Health Physicist will support medical facilities, research institutions, and industrial clients by overseeing and implementing radiation protection programs to ensure compliance with federal and state regulations, accreditation standards, · and best practices in radiat ...
Sodankylä $75,000 - $125,000 (USD) Full time5 days ago
-
+Job summary+Bold in how we dream and innovate Responsive to feedback challenges and opportunities Accountable for results best in class outcomes Visionary in future focused problem-solving Exceptional in execution impact · +Responsibilities+ · Managed Threat Support: Actively co ...
Prior Lake $78,800 - $155,430 (USD) Full time1 month ago
-
Sell Cybersecurity Services to customers in the West Region. · ...
Alamosa $100,000 - $120,000 (USD) Full time1 month ago
-
The EHS Manager is responsible for developing, implementing, and overseeing environmental, health, and safety programs that ensure compliance with ISO 9001 standards. · The role involves fostering a culture of safety and environmental responsibility, · conducting regular audits, ...
Tipton, MO1 month ago
-
Allied Universal is an Equal Opportunity Employer. · All qualified applicants will receive consideration for employment without regard to race/ethnicity, age, color, religion, sex, sexual orientation, gender identity, · national origin, genetic information, · disability, · prote ...
Russellville2 weeks ago
-
Nurse Residency Position at Mercy Hospital South · In this exciting opportunity at Mercy Hospital South Emergency Department . As part of our team, you'll collaborate closely with ED physicians to ensure optimal patient care. With numerous educational opportunities available with ...
Jamestown19 hours ago
-
The EHS Manager is responsible for developing, implementing and overseeing environmental, health and safety programs to ensure compliance with ISO 9001 standards. · ...
Tipton, MO1 week ago
-
The EHS Manager is responsible for developing, implementing and overseeing environmental health and safety programs that ensure compliance with ISO 9001 standards. · Align EHS policies and procedures with ISO 9001 standards to ensure consistent quality management practices across ...
Tipton Full time1 month ago
-
The IT Support Specialist will provide computer onsite support execution for factory or Client energy site's owned/managed assets and operational support of both office IT and shop floor IT/OT assets that are not supported by the OSS techs. · Provide computer onsite support of e ...
Jefferson City1 month ago
-
We're Hiring: Risk Management Specialist – Join Our Growing Team Are you ready to make an impact in a fast-paced dynamic environment We're looking for a Risk Management Specialist to support our Risk Management department in addressing property damage claims vendor relationships ...
Columbia $49,000 - $54,000 (USD) Full time1 month ago
-
The part-time Campus Security Officer/Shuttle Driver plays a critical role in maintaining a safe and secure environment for students, faculty, staff, and visitors. · ...
Columbia Part time1 week ago
-
The Project Health and Safety Manager is responsible for planning, implementing, and overseeing safety programs to ensure compliance with all local, state, federal and client-based health and safety regulations on the construction project site. · ...
Columbia1 month ago
Senior Incident Manager - California - Databricks Inc.
Description
At Databricks, we are passionate about empowering data teams to tackle the world's most challenging problems — from bringing the next mode of transportation to reality to accelerating the development of medical breakthroughs. We achieve this by building and operating the world's best data and AI infrastructure platform, enabling our customers to leverage deep data insights and enhance their business. Founded by engineers — and customer-obsessed — we leap at every opportunity to tackle technical challenges, from designing next-gen UI/UX for interfacing with data to scaling our services and infrastructure across millions of virtual machines. And we're only getting started.
As a Senior Incident Manager, you will lead Databricks' most critical production incidents while providing clear, accurate, and timely communication to customers, executives, and engineers. You'll serve as both incident commander and reliability engineer; orchestrating multi-team responses, driving real-time status updates, and partnering with engineering to analyze and prevent failures. Your work will ensure Databricks maintains not only technical resilience but also customer and stakeholder confidence during high-impact events.
This role combines operational leadership, technical systems knowledge, and exceptional communication skills. You will be at the intersection of engineering depth and operational clarity, ensuring that every major incident is managed with precision, transparency, and continuous improvement.
The impact you will have here:
What are we looking for?
Zone 1 Pay Range
$143,300 — $200,600 USD
About Databricks
Databricks is the data and AI company. More than 10,000 organizations worldwide — including Comcast, Condé Nast, Grammarly, and over 50% of the Fortune 500 — rely on the Databricks Data Intelligence Platform to unify and democratize data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe and was founded by the original creators of Lakehouse, Apache Spark, Delta Lake and MLflow. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.
Benefits
At Databricks, we strive to provide comprehensive benefits and perks that meet the needs of all of our employees. For specific details on the benefits offered in your region, please visit
Our Commitment to Diversity and Inclusion
At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.
Compliance
If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.
#J-18808-Ljbffr
-
Area Supervisor
Only for registered members CLARKSBURG
-
Management Trainee
Only for registered members Clarksburg
-
Senior Manager, Privacy Compliance
Full time Only for registered members Northampton
-
Senior Site Reliability Engineer
Full time Only for registered members Northampton
-
Lead Systems Engineer
Full time Only for registered members Cacao
-
Manager, Data Engineering
Full time Only for registered members Northampton
-
Lead Site Reliability Engineer
Full time Only for registered members Northampton
-
Senior Security Compliance Analyst
Full time Only for registered members Cacao
-
Sr Power Platform Development Engineer
Full time Only for registered members Northampton
-
Health Physicist
Full time Only for registered members Sodankylä
-
Security Solution Engineer
Full time Only for registered members Prior Lake
-
Cybersecurity Sales Account Executive
Full time Only for registered members Alamosa
-
Environmental Health and Safety Manager
Only for registered members Tipton, MO
-
Security Officer Mobile Foot Patrol
Only for registered members Russellville
-
Emergency Department Nurse Residency
Mercy- Jamestown
-
Environmental Health and Safety Manager
Only for registered members Tipton, MO
-
Environmental Health and Safety Manager
Full time Only for registered members Tipton
-
Information Technology Support Specialist
Only for registered members Jefferson City
-
Home Office
Full time Only for registered members Columbia
-
Part-Time Security Officer/Shuttle Driver
Part time Only for registered members Columbia
-
Project Health and Safety Manager
Only for registered members Columbia
