Site Reliability Engineer (SRE) Job at Tekaccel Inc, Texas

QW4yQlhzUEJRRS8xaDVVaFVFd25iV2x5Rnc9PQ==
  • Tekaccel Inc
  • Texas

Job Description

Role: Site Reliability Engineer (SRE)
Location: Dallas, TX or San Antonio, TX
Duration: Long-Term Contract

Job Description:

We are seeking a Site Reliability Engineer (SRE) with expertise in Datadog and ELK (Elasticsearch, Logstash, Kibana) technologies to enhance our logging, alerting, and monitoring systems . This role will play a critical part in improving application reliability, performance, and observability by implementing best-in-class monitoring solutions.

The engineer will work closely with the automation team to streamline processes, ensuring system health is proactively managed through intelligent alerts and dashboards. The goal is to create an optimized monitoring environment that aligns with the Automation Objectives and Key Results (OKR) strategy.

Key Responsibilities:
  • Design, develop, and maintain observability and monitoring solutions using Datadog and ELK stack .
  • Enhance application logging and alerting systems to improve incident detection and resolution time .
  • Collaborate with the automation team to integrate monitoring solutions with automated workflows.
  • Develop user-friendly dashboards that provide real-time insights into application performance and system health .
  • Implement proactive monitoring strategies to prevent outages and improve service reliability.
  • Optimize system alerts to reduce noise and ensure actionable insights for DevOps and engineering teams.
  • Work closely with developers, infrastructure teams, and security teams to ensure application stability and performance.
  • Contribute to the development and execution of SRE best practices , including site reliability automation, self-healing mechanisms, and incident response frameworks .
  • Participate in on-call rotations to ensure high availability and quick incident resolution.
Qualifications & Skills: Required Expertise:
  • Strong hands-on experience with Datadog and ELK stack (Elasticsearch, Logstash, Kibana) .
  • Expertise in log management, metrics collection, alerting, and distributed monitoring .
  • Experience in designing dashboards for real-time monitoring and analytics .
  • Proficiency in automating monitoring workflows to support DevOps and SRE initiatives .
  • Ability to optimize alerting mechanisms to improve operational efficiency.
Preferred Qualifications:
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud .
  • Familiarity with Infrastructure as Code (IaC) tools like Terraform or Ansible .
  • Strong programming or scripting skills in Python, Bash, or Golang .
  • Knowledge of CI/CD pipelines and how monitoring integrates with DevOps workflows .
  • Understanding of Kubernetes, Docker, and containerized application monitoring .
  • Strong problem-solving skills with the ability to troubleshoot complex system issues .

Job Tags

Contract work,

Similar Jobs

Peraton

Site Reliability Engineer (SRE) Job at Peraton

 ...Responsibilities: Manage, support and maintain a reliable environment for the site to ensure the stability and security...  ...to obtain GSA Public Trust ~ The SRE requires minimum of 5 years of...  ...in working within software engineer team who leveraged DevOps with development... 

Interior Talent

Recruitment Manager Job at Interior Talent

 ...discriminate on the basis of race, color, creed, religion, gender, gender identity, pregnancy, marital status, partnership status, domestic violence victim status, sexual orientation, age, national origin, alienage or citizenship status, veteran or military status,... 

HSM

Truck Driver Job at HSM

 ...Overview HSM Spiller is looking for a CDL-A truck driver. No experience needed but must hold a CDL Class-A. At HSM Spiller we pride ourselves on being home weekly. Along with runs that last 2-3 days at a time. At HSM Spiller there is a 99% no touch freight along with... 

OysterLink

Cafe Manager Job at OysterLink

About the job OysterLink is the go-to website for sourcing top-tier jobs in the hospitality industry. Were looking for a Caf Manager to join the team at Kwench Juice Caf in Apex, North Carolina. If you are passionate about health and wellness and committed to providing...

Company Confidential

Chef Job at Company Confidential

 ...histories consistent with applicable law. Pursuant to state and local pay disclosure requirements, the pay range for this role, with...  ...in accordance with applicable plan documents. Benefits for Union represented employees will be in accordance with the applicable...