
Sr. Power Attainment Engineer
- Austin, TX
- Permanent
- Full-time
- Actively participate in analysis of post silicon performance and power data collected to ensure integrity of results and to provide summary and conclusions of results
- Learn and Execute Power Attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap
- Proactively driving continuous improvement for post-silicon power attainment activities
- Participate in development of automation environment in developing scripts automating workloads, enhancing capabilities of execution capabilities in Linux, Python and other support software support tools
- Hands-on experience locally or remotely with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipment as a means to accomplish power attainment work
- Develop and execute characterization test plans for Datacenter GPUs related to Power attainment and feature tuning for performance optimization
- Analyzing data from workload or execution output datalogs using excel or analysis tools manually or developed automation
- Optimize power and performance features for AI, Machine learning & High-performance computing
- Work in a fast-paced constrained environment
- Become a key stakeholder in product performance validation process
- Analyze and debug interactions between various power management features
- Develop and execute performance validation test plans for HPC/ML frameworks
- Configure and setup test and customer based ML/AI Datacenter GPU systems for data collection, experiments and post-silicon activities
- Work in Windows and Linux environments
- Support prototyping experiments for new GPU features that impact performance and power
- Troubleshoot system-level issues that may occur in test environments and platforms
- Proactively driving continuous improvement for post-silicon power and performance activities
- Experience in datacenter environment preferred
- Excellent grasp of computer organization/architecture and power management
- Knowledge in power limited performance methodologies and control theory
- Knowledge in memory partitioning and access
- Extensive experience in platform optimization. Solid knowledge of Computer I/O.
- Experience with tools for performance analysis
- Strong programming skills, scripting experience in Python preferred
- Desirable to be proficient in Linux command line environment and Shell scripting
- Deep knowledge of power management techniques like deep sleep and clock gating
- Experience with container technologies (ex. Docker)
- Strong analytical and problem-solving skills with a key attention to detail
- Experience in data analysis, summarization, and presentation
- Excellent presentation and communication skills
- Experience in debug and lab tools such as oscilloscopes, DAQs, power measurement capabilities