About Company
NVIDIA is a pioneer in accelerated computing, standing at the nexus of artificial intelligence, high-performance computing, and graphics. For over two decades, we’ve been relentlessly pushing the boundaries of what’s possible, revolutionizing industries from gaming and professional visualization to data centers and autonomous machines. Our innovative GPU technology powers the world’s most advanced AI research, enabling breakthroughs in healthcare, science, and smart cities. Joining NVIDIA means becoming part of a global team of visionaries and engineers dedicated to solving the most complex problems and shaping the future of technology. We foster a culture of innovation, continuous learning, and collaborative excellence, providing an environment where brilliant minds can thrive and make a tangible impact. This is an opportunity to contribute directly to the next generation of cloud infrastructure that leverages NVIDIA’s unparalleled capabilities.
Job Description
We are actively seeking a highly skilled and passionate NVIDIA Cloud Engineer to join our dedicated team in Langley, British Columbia, for an immediate start. This critical role is designed for an individual with a deep understanding of cloud platforms and a fervent interest in building, deploying, and managing robust, scalable cloud infrastructure optimized specifically for NVIDIA’s cutting-edge GPU and AI technologies. You will play a pivotal role in designing, implementing, and maintaining sophisticated cloud solutions that underpin our complex software and hardware ecosystems. Your day-to-day will involve close collaboration with diverse internal teams, including software developers, data scientists, and research engineers, ensuring our cloud environments are not only performant and secure but also highly available and cost-efficient. We’re looking for someone who can drive automation, enforce best practices, and contribute significantly to our cloud strategy. If you thrive in a dynamic, fast-paced, and highly innovative environment, possess a strong commitment to operational excellence, and are eager to make an immediate and lasting impact on the future of AI and accelerated computing, this is your chance to shine.
Key Responsibilities
- Design, implement, and manage scalable and secure cloud infrastructure (IaaS, PaaS, SaaS) across various public and private cloud environments, with a primary focus on leveraging NVIDIA technologies for optimal performance.
- Automate the deployment, scaling, and management of cloud resources using Infrastructure as Code (e.g., Terraform, Ansible, Pulumi).
- Proactively monitor cloud infrastructure performance, identify potential bottlenecks, and implement solutions for optimization, cost efficiency, and reliability.
- Collaborate closely with development teams to seamlessly integrate NVIDIA AI/ML frameworks and GPU-accelerated applications into robust cloud platforms.
- Develop and maintain efficient CI/CD pipelines for cloud-native applications and core infrastructure components.
- Ensure the highest standards of security are met in all cloud deployments, including rigorous access control, data encryption, network segmentation, and compliance.
- Provide expert technical support and advanced troubleshooting for cloud-related issues, ensuring high availability and system reliability.
- Stay abreast of emerging cloud technologies, new NVIDIA products, and industry best practices to continuously drive innovation and improvement within our cloud architecture.
Required Skills
- Minimum of 3 years of professional experience in Cloud Engineering, DevOps, or Site Reliability Engineering roles.
- Demonstrated proficiency with at least one major cloud platform (AWS, Azure, GCP), including hands-on experience with core compute, storage, and networking services.
- Strong understanding of containerization technologies (Docker, Kubernetes) and experience with orchestration platforms.
- Proven experience with Infrastructure as Code (e.g., Terraform, CloudFormation, ARM Templates) for automated provisioning and configuration.
- Solid scripting and programming skills (e.g., Python, Bash, Go) for automation and tool development.
- Familiarity with CI/CD tools and practices (e.g., Jenkins, GitLab CI, Azure DevOps, GitHub Actions).
- Comprehensive understanding of networking concepts (VPC, VPN, DNS, Load Balancing) and robust security best practices in cloud environments.
- Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, ELK Stack, Splunk).
Preferred Qualifications
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or a closely related technical field.
- Possession of relevant professional cloud certifications (e.g., AWS Certified Solutions Architect – Professional, Azure Solutions Architect Expert, Google Cloud Professional Cloud Architect).
- Direct hands-on experience with NVIDIA GPU technologies, CUDA, and deep learning frameworks (e.g., TensorFlow, PyTorch).
- Experience with High-Performance Computing (HPC) environments or managing large-scale data processing pipelines.
- Knowledge of SRE (Site Reliability Engineering) principles and their application to cloud operations.
- Prior experience working in a fast-paced, agile development environment with a focus on rapid iteration and deployment.
Perks & Benefits
- Competitive salary and performance-based bonuses designed to reward exceptional contributions.
- Comprehensive health, dental, and vision insurance coverage for employees and their families.
- Generous paid time off, including vacation, sick leave, and company-recognized holidays.
- Extensive opportunities for professional development, including access to training courses, certifications, and conferences.
- Access to cutting-edge NVIDIA technology, development kits, and powerful computing resources.
- A vibrant, collaborative, and innovative work environment that encourages creativity and problem-solving.
- Robust retirement savings plan with a substantial company match to support your long-term financial goals.
- Employee stock purchase program (ESPP) allowing you to invest in NVIDIA's future success.
- Wellness programs and employee assistance initiatives to support overall well-being.
How to Apply
Interested candidates are strongly encouraged to click on the application link below to submit their resume and a compelling cover letter. Please take this opportunity to highlight your specific experience with cloud platforms, NVIDIA technologies, and any immediate start availability.