System Engineer_TC27603

20 hours ago


New Taipei City New Taipei City, Taiwan Supermicro Full time NT$900,000 - NT$1,200,000 per year

Responsibilities

  • Familiar with the day-to-day operational support for Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures.
  • Builds Cluster, Storage, HPC, AI, Data Center and Cloud infrastructures in-house and onsite testing, deployment, and platforms accordingly to meet customer's requirement.
  • Troubleshoot hardware and software issues in rack cabinet. Provide fixes in a timely manner.
  • Documents complex test procedures and troubleshooting procedures related to servers/networks/clusters software and hardware.
  • Familiar with Intel/AMD/NVIDIA development toolkits like CUDA, oneAPI, ROCm.
  • Conduct tests and benchmarks against server hardware, storage, network, applications, HPC and AI/ML/DL workflows.
  • Programming experience with web applications, including frontend or backend.
  • Collect, visualize, and analyze test and benchmark results.
  • Programming experience with Python, Ansible and Linux shell scripting.
  • Write technical documentation including test reports and standard operating procedure (SOP).

Qualifications

  • Bachelor's or Master degree in Computer Science or equivalent work experience preferred.
  • 3+ years of proven experience in a HPC/AI or Cloud/Network management.
  • In-depth knowledge of Cloud/HPC/AI deployment and testing.
  • Strong problem-solving and decision-making abilities, with a proactive approach to identifying and resolving issues.
  • Excellent communication skills, both verbal and written, with the ability to collaborate and build strong relationships with stakeholders at all levels.
  • It's a plus if you have CCNA/CCNP, AWS, RHCE or RHCA certificates.
  • Positive attitude, desire to learn, time management, and strong interpersonal skills are a plus