Job Title: Datacenter Infrastructure Engineer
Duration: 6 Months (Possibility of Extension)
Location Preference: Dallas, TX area (Preferred)
Overview
We are looking for an Infrastructure Development Engineer to design, operate, and scale foundational datacenter services that power bare-metal, virtualization, and cloud-adjacent platforms.
This role owns the automation to boot and manage critical services such as corporate IPAM/DDI, CMDB, and datacenter bootstrapping systems.
The engineer will work across hardware, networking, and platform teams to ensure infrastructure is discoverable, automated, reliable, and ready for self-service consumption.
Key Responsibilities
Automation & Development
• Build automation and tools using Python
• Develop Python-based tools and services for provisioning, configuration, monitoring, and self-service workflows
• Automate operational tasks such as imaging, deployments, health checks, and remediation
• Integrate internal and external APIs to orchestrate infrastructure workflows across compute, storage, network, and cloud
• Developers with experience in C++ or Java will also be considered
Software Defined Network Services (IPAM, DDI & CMDB)
• Own and operate corporate IP Address Management (IPAM) and DDI (DNS, DHCP, IPAM) platforms
• Design scalable IP allocation, DNS, and DHCP strategies across datacenters
• Integrate IPAM/DDI systems with provisioning, bootstrapping, and CMDB workflows
• Act as a steward of the CMDB ensuring accuracy and automation-driven updates
• Define standards for asset discovery, lifecycle state, and dependency mapping
Monitoring, Observability & Reliability
• Implement monitoring, alerting, and dashboards for infrastructure health using tools such as Prometheus, Grafana, ELK, Nagios
• Track key metrics such as availability, latency, capacity, and error rates
• Participate in incident response and root cause analysis
• Implement long-term fixes and operational runbooks
Required Skills & Experience
Core Technical Skills
• Experience with bare-metal provisioning and hypervisor deployment
• Hands-on experience with OpenStack, VMware, KubeVirt, or similar virtualization platforms
• Deep understanding of IPAM, DNS, and DHCP
• Experience with CMDB systems
• Knowledge of datacenter networking concepts including Fibre Channel
• Proficiency with Linux systems and troubleshooting
Automation & Systems Thinking
• Experience building infrastructure automation and onboarding pipelines
• Familiarity with API-driven integrations and workflow orchestration
• Ability to understand infrastructure as a platform
Collaboration & Ownership
• Experience working with hardware, network, storage, and SRE teams
• Strong operational mindset focused on reliability and supportability
• Ability to solve complex problems with automated solutions
Nice to Have
• Experience with large-scale internal platforms or infrastructure as a product
• Background in Site Reliability Engineering (SRE)
• Experience with self-service infrastructure platforms
• Experience in multi-datacenter or hybrid environments
Server Bootstrapping & Provisioning Automation
• Experience with datacenter bootstrapping services including PXE, imaging, and OS/hypervisor provisioning
• Ensure seamless transition from hardware arrival to production-ready infrastructure
• Improve time-to-serve metrics for new racks, clusters, and testbeds