RunPod

Develop, train, and scale AI models. All in one cloud.

Manager, Datacenter Network Engineering

Network EngineerNetwork EngineerFull TimeRemoteTeam 51-200Since 2022H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

44 days ago

Salary

$150K - $240K / year

Bachelor Degree8 yrs expEnglishCloudLinux

Job Description

• Manage and grow a team of network engineers responsible for datacenter fabrics, interconnects, and global WAN connectivity. Provide mentorship, technical guidance, and clear ownership boundaries. • Define and evolve network designs for GPU-heavy clusters, including spine-leaf topologies, ECMP routing, and high-bandwidth east-west traffic patterns. • Oversee design and operation of InfiniBand and RoCE-based fabrics supporting distributed training and inference workloads. Ensure performance, loss characteristics, and congestion control meet AI workload requirements. • Guide implementation and operations of encapsulation technologies such as VXLAN, EVPN, Geneve, or similar, enabling scalable multi-tenant isolation and flexible network provisioning. • Lead strategy and execution for global WAN connectivity, including private backbone links, IX connectivity, and hybrid connectivity with cloud providers and partners. • Establish operational best practices for monitoring, capacity planning, change management, incident response, and post-mortems across the network stack. • Partner closely with Infrastructure, SRE, Hardware, and Product Engineering teams to ensure network capabilities align with platform and customer requirements. • Work with hardware vendors, colocation providers, and transit partners on network design, procurement, deployment timelines, and escalations. • Ensure network designs support secure isolation, DDoS resilience, and compliance requirements without compromising performance.

Job Requirements

  • 3+ years managing network or infrastructure engineering teams, with experience scaling teams and systems in production environments.
  • 8+ years designing and operating large-scale datacenter networks, including spine-leaf architectures, BGP-based routing, and high-throughput fabrics.
  • Strong hands-on experience with VXLAN/EVPN or equivalent encapsulation protocols, including control-plane and data-plane considerations.
  • Proven experience with InfiniBand and/or RoCE, including congestion management, lossless Ethernet concepts, and performance tuning for GPU workloads.
  • Deep familiarity with global WAN technologies, including private backbone design, inter-region connectivity, routing policy, and traffic engineering.
  • Comfortable working with Linux-based systems, network operating systems, and automation tooling.
  • Strong background in network observability, incident management, capacity forecasting, and change control.
  • Clear written and verbal communication skills, with the ability to align stakeholders and lead teams through complex technical challenges.
  • Successful completion of a background check.

Benefits

  • Meaningful equity in a fast-growing company- everyone on the team receives stock options — your impact drives our growth, and you share in the upside.
  • Generous medical, dental & vision plans — we cover 100% for all employees and partial for dependents.
  • Flexible PTO- take the time you need to recharge
  • Most roles are remote work first with an inclusive, collaborative teams utilizing slack as the main form of internal communication
  • Join a passionate team on the cutting edge of AI infrastructure — where culture, learning, and ownership are at the heart of how we scale.

Related Categories

Related Job Pages

More Network Engineer Jobs

Principal Network Engineer, Voice and Unified Communications, Virtual

Providence

Providence has a long history of serving Alaska, beginning when the Sisters of Providence first brought health care to Nome in 1902 during the Gold Rush. This pioneering spirit set the standard for modern health care in Alaska and formed the foundation for Providence's growth as the state's largest private employer and leading health care provider. Award-winning and comprehensive medical centers located in Anchorage, Eagle River, Kodiak Island, Mat-Su, Seward, and Valdez. Not-for-profit network providing a full spectrum of care with leading-edge diagnostics and treatment, outpatient health centers, physician groups and clinics, outreach programs, and hospice and home care.

Network Engineer44 days ago
Full TimeRemoteTeam 10,001+Since 1856H1B Sponsor

Network Engineer designing and operating mission-critical services at Providence

AnsiblePerlPythonSQL
Alaska + 5 moreAll locations: Alaska, California, Montana, Oregon, Texas, Washington
$68 - $136 / hour

Network Engineer

The AME Group

Managed IT Services | Cybersecurity | Business Resilience| Backup and Recovery | Compliance Assist | SOC 2 Type 2

Network Engineer44 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

IT Network Engineer supporting clients remotely at The AME Group

CloudDNSFirewalls
Tennessee

Member of Technical Staff, Network Engineer

Anchorage Digital

Trusted institutional partner in crypto and first federally chartered crypto bank

Network Engineer48 days ago
Full TimeRemoteTeam 201-500Since 2017H1B Sponsor

Building advanced digital asset platform for institutions to participate in crypto

AWSCloudDockerFirewallsGoogle Cloud PlatformLinuxTerraformUnix
United States
Network Engineer48 days ago
ContractRemoteTeam 51-200Since 2006H1B No Sponsor

Senior Network Engineer in data centre networking using VXLAN and Cisco ACI technologies

Switching
United States