Skip to main content
CareerCircle Home
Log in
Join
Search for and find Splunk Observability Engineer jobs and TEKsystems jobs at CareerCircle.com
TEKsystems jobs, learn more at CareerCircle.com

Splunk Observability Engineer

TEKsystems

Posted Thursday, October 30, 2025

Posting ID: JP-005640482

Morrisville, NC
Share:
FacebookTwitterLinkedin

**Cannot support C2C or sponsor at this time**



Description

Job Description: Splunk Observability Engineer


Role Summary

We are seeking a highly skilled Splunk Observability Engineer with a strong System Administration and Infrastructure Automation background. The ideal candidate will design, build, and manage large-scale observability solutions across hybrid cloud environments, leveraging Splunk Observability Cloud, OpenTelemetry, and automation tools to enable end-to-end visibility, incident response, and performance insights.


Technical Expertise


• Education & Experience:

◦ Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.

◦ 8–10 years of relevant experience in infrastructure operations, observability, or DevOps engineering.

◦ Proven experience in managing environments with 10,000+ compute systems across RHEL, OpenStack, and VMware.


• Core Infrastructure Expertise:

◦ Strong hands-on experience with Red Hat Enterprise Linux — build, configuration, hardening, patching, and lifecycle operations.

◦ Working knowledge of VMware vSphere, Cisco UCS compute infrastructure, and OpenStack environments.

◦ Deep understanding of networking fundamentals, firewalls, iptables, and system security best practices.


• Automation & Configuration Management:

◦ Expertise with Ansible, Terraform, and Puppet for OS provisioning, configuration, and lifecycle automation.

◦ Experience with CI/CD pipelines using Git, Jenkins, or similar tools for automated delivery and testing.

◦ Strong scripting and automation background in Python or Ruby.


• Observability & Monitoring:

◦ Experience designing and managing Splunk Observability Cloud, Splunk Enterprise, or equivalent monitoring platforms.

◦ Ability to instrument applications and systems using OpenTelemetry, Telegraf, or custom agents.

◦ Knowledge of metrics, logs, traces, and events correlation to build actionable insights and alerts.


Key Responsibilities:

• Participate actively in Agile scrum ceremonies and sprint planning.

• Design and implement automated provisioning pipelines for RHEL and observability agents using Ansible, Terraform, and CI/CD workflows.

• Manage the Splunk Observability platform — ingestion pipelines, detectors, dashboards, and alerting.

• Monitor, diagnose, and resolve complex infrastructure performance issues.

• Drive observability adoption across systems and applications, improving MTTR and SLO compliance.

• Maintain documentation, runbooks, and configuration repositories for infrastructure and observability systems.


Non-Technical / Behavioral Skills

• Proven experience working in Agile environments with globally distributed teams.

• Strong communication and collaboration skills, with the ability to influence cross-functional stakeholders.

• Demonstrated problem-solving and troubleshooting ability in complex distributed environments.

• Self-motivated and proactive with a strong sense of ownership and accountability.

• Experience with open-source ecosystems and community-driven development practices.


Preferred Qualifications

• Splunk certifications (e.g., Splunk Certified Admin / Observability Engineer / Core Certified Power User).

• Exposure to Kubernetes, AWS CloudWatch, or Grafana/Prometheus integration.

• Experience implementing AIOps, anomaly detection, or predictive alerting solutions.





Experience Level

Expert Level

Compensation:$72

Contact Information

Email: ryanhughes@teksystems.com

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.
Hybrid
Communication
Operations
Workflow Management
Information Technology
Automation
Accountability
Self-Motivation
Operating Systems
Dashboard
Python (Programming Language)
Agile Methodology
Scripting
Influencing Skills
Computer Science
Technology Ecosystems
Problem Solving
Troubleshooting (Problem Solving)
DevOps
Amazon Web Services
CI/CD
Ruby (Programming Language)
Observability
Puppet (Configuration Management Tool)
Ansible
Terraform
Splunk
Kubernetes
Git (Version Control System)
Firewall
System Administration
Collaboration
Incident Response
Scrum (Software Development)
Jenkins
Hardening
Configuration Management
VMware VSphere
Virtual Teams
Sprint Planning
Grafana
Prometheus (Software)
Red Hat Enterprise Linux
Hybrid Cloud Computing
OpenStack
Open Source Technology
Infrastructure Automation
Anomaly Detection
Amazon CloudWatch
AIOps (Artificial Intelligence For IT Operations)
Telegraf
Iptables
OpenTelemetry

Blog