We're sorry, but CareerCircle does not work properly without JavaScript enabled. Please enable JavaScript for your browser and reload this page.

Join

Splunk Observability Engineer

TEKsystems

Posted Thursday, October 30, 2025

Posting ID: JP-005640482

Morrisville, NC

**Cannot support C2C or sponsor at this time**

Description

Job Description: Splunk Observability Engineer

Role Summary

We are seeking a highly skilled Splunk Observability Engineer with a strong System Administration and Infrastructure Automation background. The ideal candidate will design, build, and manage large-scale observability solutions across hybrid cloud environments, leveraging Splunk Observability Cloud, OpenTelemetry, and automation tools to enable end-to-end visibility, incident response, and performance insights.

Technical Expertise

• Education & Experience:

◦ Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.

◦ 8–10 years of relevant experience in infrastructure operations, observability, or DevOps engineering.

◦ Proven experience in managing environments with 10,000+ compute systems across RHEL, OpenStack, and VMware.

• Core Infrastructure Expertise:

◦ Strong hands-on experience with Red Hat Enterprise Linux — build, configuration, hardening, patching, and lifecycle operations.

◦ Working knowledge of VMware vSphere, Cisco UCS compute infrastructure, and OpenStack environments.

◦ Deep understanding of networking fundamentals, firewalls, iptables, and system security best practices.

• Automation & Configuration Management:

◦ Expertise with Ansible, Terraform, and Puppet for OS provisioning, configuration, and lifecycle automation.

◦ Experience with CI/CD pipelines using Git, Jenkins, or similar tools for automated delivery and testing.

◦ Strong scripting and automation background in Python or Ruby.

• Observability & Monitoring:

◦ Experience designing and managing Splunk Observability Cloud, Splunk Enterprise, or equivalent monitoring platforms.

◦ Ability to instrument applications and systems using OpenTelemetry, Telegraf, or custom agents.

◦ Knowledge of metrics, logs, traces, and events correlation to build actionable insights and alerts.

Key Responsibilities:

• Participate actively in Agile scrum ceremonies and sprint planning.

• Design and implement automated provisioning pipelines for RHEL and observability agents using Ansible, Terraform, and CI/CD workflows.

• Manage the Splunk Observability platform — ingestion pipelines, detectors, dashboards, and alerting.

• Monitor, diagnose, and resolve complex infrastructure performance issues.

• Drive observability adoption across systems and applications, improving MTTR and SLO compliance.

• Maintain documentation, runbooks, and configuration repositories for infrastructure and observability systems.

Non-Technical / Behavioral Skills

• Proven experience working in Agile environments with globally distributed teams.

• Strong communication and collaboration skills, with the ability to influence cross-functional stakeholders.

• Demonstrated problem-solving and troubleshooting ability in complex distributed environments.

• Self-motivated and proactive with a strong sense of ownership and accountability.

• Experience with open-source ecosystems and community-driven development practices.

Preferred Qualifications

• Splunk certifications (e.g., Splunk Certified Admin / Observability Engineer / Core Certified Power User).

• Exposure to Kubernetes, AWS CloudWatch, or Grafana/Prometheus integration.

• Experience implementing AIOps, anomaly detection, or predictive alerting solutions.

Experience Level

Expert Level

Compensation:$72

Contact Information

Email: ryanhughes@teksystems.com

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

Hybrid

Communication

Operations

Workflow Management

Information Technology

Automation

Accountability

Self-Motivation

Operating Systems

Dashboard

Python (Programming Language)

Agile Methodology

Scripting

Influencing Skills

Computer Science

Technology Ecosystems

Problem Solving

Troubleshooting (Problem Solving)

DevOps

Amazon Web Services

CI/CD

Ruby (Programming Language)

Observability

Puppet (Configuration Management Tool)

Ansible

Terraform

Splunk

Kubernetes

Git (Version Control System)

Firewall

System Administration

Collaboration

Incident Response

Scrum (Software Development)

Jenkins

Hardening

Configuration Management

VMware VSphere

Virtual Teams

Sprint Planning

Grafana

Prometheus (Software)

Red Hat Enterprise Linux

Hybrid Cloud Computing

OpenStack

Open Source Technology

Infrastructure Automation

Anomaly Detection

Amazon CloudWatch

AIOps (Artificial Intelligence For IT Operations)

Telegraf

Iptables

OpenTelemetry

Blog

Privacy Notices

Accessibility Statement

Cookies Settings

Cookie Notice

CA Notices at Collection

Splunk Observability Engineer

Contact Information

Blog

JOB SEEKERS

EMPLOYERS

COMMUNITY PARTNERS

ABOUT

RESOURCES