Network Architect
Role Summary
Firmus is seeking a skilled Network Architect / Senior Network Architect to join our Engineering and Technology team. The ideal candidate will play a crucial role in leading the design and deployment of both our physical network infrastructure and network architecture for our AI infrastructure projects.
This role offers an exciting opportunity to work at the forefront of AI networking technology and contribute to the growth of Firmus’ AI infrastructure capabilities. Deep hands-on experience with network hardware is required, particularly with fibre optic systems. Also necessary to succeed in the role is a strong architectural mindset to guide the evolution of scalable, secure and high-performance networks for AI.
Key Responsibilities
Network Architecture & Design
- Architect and maintain low-latency, high-throughput interconnects (e.g. InfiniBand 100/200/400/800GbE) for HPC and AI workloads.
- Collaborate with other network engineers and cross-functional teams to develop network infrastructure roadmaps, aligned with business and technical strategy.
- Lead the design of our layer 1/2/3 network infrastructure in our data centre deployments, our AI Factories, and the cluster interconnects with considerations for redundancy, scalability, and performance.
- Evaluate new technologies, architectures, and design patterns to improve network performance and efficiency.
Network Hardware & Physical Infrastructure
- Lead the design, configuration, and deployment of highly scalable physical networks optimised for AI workloads.
- Oversee the planning and implementation of fibre optic cabling systems (single-mode & multi-mode), including backbone connections, patch panels, and structured cabling.
- Plan the capacity and integration of optical technologies (DWDM, CWDM) and long-haul fibre for intersite connectivity.
- Ensure physical infrastructure aligns with architectural standards and supports scalability, availability, and security goals.
- Create and maintain accurate and up-to-date documentation of network architecture, hardware and cabling.
- Work closely with other engineering disciplines to coordinate the network infrastructure with other services (e.g. mechanical, electrical, security etc.) within the data centre.
- Participate in the operations standby roster and on-call support from time to time.
Network Security and Policy
- Plan and implement firewall and security devices. Apply firewall rules, VLAN segmentation, ACLs adhering to zero-trust principles to safeguard internal and external communications.
- Collaborate with SMC Security and Risk team to enforce policies and respond to security incidents.
Operation Support
- Respond and resolve escalate network issues, outages and performance degradations across the SMC Corporate and Compute network infrastructure.
- Analyze logs, run diagnostics and coordinate with vendors, carriers as needed.
- Work with internal observability team to setup and maintain monitoring tools to proactively identify bottlenecks, errors and abnormal behaviours.
- Analyze trends for bandwidth, hardware utilization, and growth to inform scaling and make recommendation to procurement decisions.
- Design and rest redundant paths, failover mechanisms and DR playbooks to ensure uninterrupted connectivity during outages or maintenance.
- Participate in operation standby roster and on-call for time to time.
Project Management
- Support the deployment team with defining project timelines and resource allocation for the network portion of AI cluster installations.
- Create Bill of Materials and develop budgets for network deployments.
- Coordinate with cross-functional teams to ensure successful project delivery.
- Technology Expertise
- Maintain and expand expertise in physical network hardware and advanced networking technologies, including:
- Optical Transport Network
- NVIDIA InfiniBand
- Spectrum Ethernet Platform
- RDMA over Converged Ethernet (RoCE)
- Familiarity with open-source network operating systems such as Cumulus Linux and Sonic.
- Provide technical support and troubleshooting for advanced networking technologies, escalating to vendors as needed.
- Mentor junior network engineers, assisting with their technical development.
Stakeholder Management & Collaboration
- Work closely with both Firmus Engineering and Commissioning teams to align network infrastructure with customers’ requirements.
- Facilitate knowledge sharing and communication between teams and create and maintain comprehensive technical documentation.
- Maintain and build strong relationships with key technology partners and vendors and proactively manage and coordinate partner engagement on site.
Skills & Experience
- Bachelor’s degree in Network Engineering, Computer Science, or a related technical field.
- 5+ years of experience in network engineering, with a focus on AI infrastructure.
- Strong project management skills and experience leading complex technical projects.
- Solid understanding of advanced networking technologies, particularly those related to AI.
- Hands-on experience with NVIDIA InfiniBand, Spectrum Ethernet Platform, and RoCE.
- Strong experience with network cabling systems, both fibre optic and copper.
- Excellent problem-solving and analytical skills.
- Ability to work independently and as part of a team.
- Strong communication skills, both written and verbal.
- Willingness to travel domestically and internationally for on-site deployments and commissioning as required.
Key Competencies
- AI/HPC network architecture (InfiniBand, 100–800GbE)
- L1/L2/L3 data centre and cluster design
- Fibre & optical infrastructure (SM/MM, DWDM/CWDM)
- Advanced networking tech (RoCE, Spectrum, Cumulus/SONiC)
- Network security, resilience & DR
- Incident response, monitoring & capacity planning
- Project delivery, BoMs & vendor management
- Clear communication and team mentoring
Success Metrics
- Networks meet latency, throughput & scalability targets
- AI cluster deployments delivered on time and within budget
- High availability with minimal unplanned outages
- Fast incident resolution and proactive risk identification
- Accurate, current network documentation
- Strong stakeholder and vendor feedback
Location & Reporting
- Singapore
- Reporting to Senior Manager, Networking
Employment Basis
Full-time
Diversity
At Firmus, we are committed to building a diverse and inclusive workplace. We encourage applications from candidates of all backgrounds who are passionate about creating a more sustainable future through innovative engineering solutions.
Join us in our mission to revolutionize the AI industry through sustainable practices and cutting-edge engineering. Apply now to be part of shaping the future of sustainable AI infrastructure.
About Sustainable Metal Cloud
Our vision is to move cloud computing towards net zero, with solutions forged through advanced technology. Partnering with NVIDIA to provide large-scale GPU AI infrastructure.
WHY YOU'LL LOVE WORKING HERE
Our team shares a passion for possibility, knowing that our technology enables ideas across the world. Ideas that can reshape the course of progress and break down traditional boundaries.