Andromeda Cluster

Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only for hyperscalers. We began with a single managed cluster — but it filled almost instantly. Today, Andromeda works with leading AI labs, data centers, and cloud providers to deliver compute when and where it’s needed most. Our long-term vision is to build the liquidity layer for global AI compute. We are expanding to new frontiers to find the brightest that work in AI infrastructure, research and engineering.

Software Engineer - AI Infrastructure

Software EngineerSoftware EngineerFull TimeRemoteTeam 11-50

Location

United States

Posted

8 days ago

Salary

Not specified

No structured requirement data.

Job Description

Software Engineer - AI Infrastructure

Location: North America Remote / San Francisco · Full-Time

About Andromeda

Andromeda Cluster, founded by Nat Friedman and Daniel Gross, is on a mission to democratize access to cutting-edge AI infrastructure previously reserved for hyperscalers. What began with a single managed cluster has quickly evolved into a global platform, connecting leading AI labs, data centers, and cloud providers. Our orchestration layer seamlessly routes training and inference jobs across the world, unlocking flexibility and efficiency in one of the fastest-growing sectors on earth. Our long-term vision is to establish a global marketplace for AI compute—powering AGI with the same fluidity as world financial markets.

We are scaling rapidly and seeking exceptional talent in AI infrastructure, research, and engineering.

The Role

As an Infrastructure Product Engineer, you will play a pivotal role in building the backbone of Andromeda’s platform. You'll transform complex, real-world infrastructure challenges into scalable product capabilities that benefit our customers.

Positioned at the intersection of infrastructure and product engineering, this role is deeply technical and systems-oriented, yet laser-focused on building solutions with broad leverage.

What You'll Do

  • Design and develop core platform components, including infrastructure orchestration, provisioning, and lifecycle management solutions.

  • Build robust APIs, services, and control planes that abstract over diverse infrastructure types (VMs, Kubernetes, bare metal, schedulers).

  • Translate customer usage patterns into product requirements, delivering impactful features and improvements.

  • Create automation and internal tooling to eliminate manual or ad-hoc operational work.

  • Enhance reliability, performance, and observability at the platform level, emphasizing durable improvements over quick fixes.

  • Collaborate with peer teams to define clear ownership boundaries between platform capabilities and customer-specific solutions.

  • Write clean, maintainable, and well-documented code with a focus on long-term sustainability.

  • Participate in technical design discussions and contribute to the architectural evolution of our platform.

What We're Looking For

  • 5+ years of experience in Infrastructure, Platform, or Backend Engineering roles.

  • Strong systems fundamentals: deep understanding of Linux, networking, storage, and distributed systems.

  • Proven expertise with Kubernetes, VMs, or bare-metal environments.

  • Advanced software engineering skills; capable of building production-grade APIs and services (Python, Go, or similar).

  • Extensive experience with infrastructure as code and automation tools (Terraform, Ansible, Helm, etc.).

  • Demonstrated ability to navigate ambiguity and distill complex problems into clear, maintainable abstractions.

  • Product-focused mindset: care about interfaces, defaults, reliability, and sustainable operations.

  • Excellent written and verbal communication skills; effective collaborator across engineering and product functions.

Nice to Have:

  • Hands-on experience with GPU or AI infrastructure.

  • Experience with control-plane or orchestration systems.

  • Background spanning both infrastructure and application/backend engineering.

  • Experience architecting multi-tenant systems.

  • Strong skills in technical writing and design documentation.

  • Early-stage startup experience.

Why You’ll Love It Here

This is a true builder’s opportunity: you’ll have ownership and autonomy to shape our systems, engage directly with customers and providers, and lay the foundations for scalable, reliable AI infrastructure. Join us at Andromeda and help power the future of AI.

Related Job Pages

More Software Engineer Jobs

opsi Software Developer

GoTab

GoTab is fully committed to Equal Employment Opportunity and to attracting, retaining, developing, and promoting the most qualified employees without regard to their race, color, religion, creed, sex, gender, sexual orientation, gender identity, gender expression, age, national origin, genetic information, marital/familial status, disability, military status, veteran status, or any other protected status. We are dedicated to providing a work environment free from discrimination and harassment, where employees are treated with respect and dignity. As a company, we are not able to sponsor employment visas at this time, including but not limited to F-1 OPT and H1-B.

Software Engineer8 days ago
Full TimeRemoteTeam 51-200

The developer will focus on maintaining platform stability, leading new partner integration projects, and building features like cost of goods control systems and automated reporting tools for restaurant operations.

United States
$70K - $80K / year
Software Engineer8 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

The stack: Node.js/Express, TypeScript, React, Postgres, Clickhouse, MongoDB, Heroku                              &am...

United States
Full TimeRemote

Sr. Software Engineer – 5G RAN Location: Aberdeen Proving Ground (APG), MD Type: Remote (Travel as needed) Salary Range: $150,000 – $200,000 annually About Fairwinds Technologies Fairwinds Technologies is a U.S.-based engineering firm specializing in Satellite Communications (SAT...

United States
Full TimeRemote

cFocus Software seeks an Application Developer II to join our program supporting the Executive Office of the President (EOP). This position is remote and requires a TS/SCI clearance. Compare the number of records and number of bytes of data, in aggregate, for a specific data type...

United States