
System Engineer_TC27603
20 hours ago
New Taipei City New Taipei City, Taiwan
Supermicro
Full time
NT$900,000 - NT$1,200,000 per year
Responsibilities
- Familiar with the day-to-day operational support for Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures.
- Builds Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures in-house and onsite testing, deployment, and platforms accordingly to meet customer's requirement.
- Troubleshoot hardware and software issues in rack cabinet. Provide fixes in a timely manner.
- Documents complex test procedures and troubleshooting procedures related to servers/networks/clusters software and hardware.
- Familiar with Intel/AMD/NVIDIA development toolkits like CUDA, oneAPI, ROCm.
- Conduct tests and benchmarks against server hardware, storage, network, applications, HPC and AI/ML/DL workflows.
- Programming experience with web applications, including frontend or backend.
- Collect, visualize, and analyze test and benchmark results.
- Programming experience with Python, Ansible and Linux shell scripting.
- Write technical documentation including test reports and standard operating procedure (SOP).
Qualifications
- Bachelor's or Master degree in Computer Science or equivalent work experience preferred.
- 3+ years of proven experience in a HPC/AI or Cloud/Network management.
- In-depth knowledge of Cloud/HPC/AI deployment and testing.
- Strong problem-solving and decision-making abilities, with a proactive approach to identifying and resolving issues.
- Excellent communication skills, both verbal and written, with the ability to collaborate and build strong relationships with stakeholders at all levels.
- It's a plus if you have CCNA/CCNP, AWS, RHCE or RHCA certificates.
- Positive attitude, desire to learn, time management, and strong interpersonal skills are a plus