Description
Invent the future with us.
Recognized by Fast Company’s 2023 100 Best Workplaces for Innovators List, Ampere is a semiconductor design company for a new era, leading the future of computing with an innovative approach to CPU design focused on high-performance, energy efficient, sustainable cloud computing.
By providing a new level of predictable performance, efficiency, and sustainability Ampere is working with leading cloud suppliers and a growing partner ecosystem to deliver cloud instances, servers and embedded/edge products that can handle the compute demands of today and tomorrow.
Join us at Ampere and work alongside a passionate and growing team — we’d love to have you apply!
About the role:
In this role as an AI Accelerator Software Engineer-Runtime Library, you will drive the development and optimization of cutting-edge AI frameworks. You will be at the forefront of advancing AI capabilities, helping to pave the way for high-performance and efficient computing solutions that will meet future AI demands.
What you’ll achieve:
- In this role, you will build a runtime library accelerator that will enable multiple frameworks and serving platforms for the Ampere deep learning accelerator.
- Go deep into to the entire SW/HW stack to accelerate the deep learning including but not limited to inference serving, framework integration, compiler, runtime library, communication and compute kernel development, and performance tuning.
- In this role, you will work on deep learning model enabling with performance and accuracy for popular frameworks like PyTorch and Llama.cpp and for serving platforms like vLLM and SGLang, positioning you at the forefront of AI innovation.
- HW/SW codesign to optimize existing AI architectures to enhance computational efficiency, increase throughput, reduce latency, and improve the scalability, pushing the boundaries of what's possible in AI technology.
- Be a key team member in building state-of-the-art software and hardware AI co-processors/accelerators, contribute to a collaborative and dynamic work environment, supporting continuous improvement and excellence.
- Collaborate with cross-functional teams to integrate AI solutions into Ampere's cloud-native processor platforms and accelerators.
About you:
- BS Computer Science, Mathematics or a related technical field & 12 years of related experience; or MS degree & 8 years; or PhD & 5 years
- Previous work experience developing user mode driver or runtime library for any GPUs or deep learning accelerator in Linux environment.
- This position requires strong expertise in programming languages such as Python, C/C++ with a strong background in performance tuning.
- Previous software development with a focus on AI frameworks – PyTorch, llama.cpp, ONNX, etc is a big plus.
- Solid understanding of AI and machine learning concepts, including neural networks and data processing frameworks is also preferred.
- Experience with high-performance computing systems and cloud-based architectures.
What we’ll offer:
At Ampere we believe in taking care of our employees and providing a competitive total rewards package that includes base pay, bonus (i.e., variable pay tied to internal company goals), long-term incentive, and comprehensive benefits.
Benefits highlights include:
- Premium medical health care, so that you and your family members can feel secure in your health
- A generous paid time off policy so that you can embrace a healthy work-life balance
- A wide variety of office amenities including nutritious snacks and refreshing drinks, free gym and sauna access to keep you fueled and healthy
- Flexible working hours and a remote work policy that includes reimbursement of connectivity costs and equipment to work from home
- Sports card fully financed by Ampere
And there is much more than compensation and benefits. At Ampere, we foster an inclusive culture that empowers our employees to do more and grow more. We are passionate about inventing industry leading cloud-native designs that contribute to a more sustainable future. We are excited to share more about our career opportunities with you through the interview process.
#LI-DR
#LI-Remote
Ampere is an inclusive and equal opportunity employer and welcomes applicants from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, national origin, citizenship, religion, age, veteran and/or military status, sex, sexual orientation, gender, gender identity, gender expression, physical or mental disability, or any other basis protected by federal, state or local law.
Report job