Opportunity Description
Project descriptionWorking on GPU support for openai/triton — a language and compiler for writing highly efficient custom deep‐learning primitives.Work with the open‐source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with triton on GPUs with ROCm.ResponsibilitiesNew features development, support and optimization of openai/triton project for GPUs.Communication with other developers, customers and project managers.Test implementation, project documentation and verification of system with unit/component/functional tests.Mandatory skills descriptionStrong C/C++ programming skill.Experience with compiler internals (LLVM, GCC or any other).
Basic Python programming skill.Experience in performance analysis.Nice-to-have skills descriptionBasic understanding of ML technology.Experience with GPGPU (general purpose GPU) computing (HIP, CUDA, OpenCL, etc.).
Experience with Python.Experience with LLVM and MLIR compiler infrastr...
Interested in this opportunity? Apply now through Expertini.
Apply for this Position