AI HPC Manager
1 week ago
Established in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers' products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer electronics. It is the world's largest provider of logic ICs, with an annual capacity of 16 million 12-inch equivalent wafers. TSMC operates fabs in Taiwan as well as manufacturing subsidiaries in Washington State, Japan and China, and the Company began construction on a specialty technology fab in Dresden, Germany, in 2024. In Arizona, TSMC is building three fabs, with the first starting 4nm production in 2025, the second by 2028, and the third by the end of the decade.
Job Responsibilities:
An AI (Artificial Intelligence) HPC (High-Performance Computing) architect is responsible for designing, developing, and implementing high-performance computing solutions with a specific focus on artificial intelligence and machine learning workloads. Their role involves a combination of expertise in AI algorithms, HPC technologies, and system architecture.
Job description of AI HPC Architect:
Solution Design: Collaborate with stakeholders, including data scientists, researchers, and IT teams, to understand their requirements and translate them into effective AI HPC solutions. Design architectures that optimize performance, scalability, and efficiency for AI workloads.
HPC Infrastructure Planning: Evaluate and select appropriate hardware, networking, and storage resources to support AI computations at scale. This involves considering factors such as parallel processing, GPU (Graphics Processing Unit) acceleration, interconnect technologies, and storage systems.
Algorithm Optimization: Work closely with data scientists and machine learning experts to identify opportunities for algorithm optimization. Apply techniques like parallel processing, distributed computing, and GPU acceleration to improve the performance and efficiency of AI models.
Performance Tuning: Optimize the performance of AI workloads by fine-tuning system configurations, resource allocation, and workload management. Identify and resolve bottlenecks related to computation, memory, storage, or network bandwidth.
Scalability and Resilience: Design solutions that can scale to handle large-scale AI workloads and accommodate future growth. Ensure high availability and fault tolerance by implementing redundancy, load balancing, and failover mechanisms.
Integration and Deployment: Collaborate with software engineers and DevOps teams to integrate AI models and algorithms into production environments. Develop deployment strategies and workflows for efficient deployment and management of AI HPC systems.
Research and Innovation: Stay up to date with the latest advancements in AI, machine learning, and HPC technologies. Identify and evaluate emerging technologies, frameworks, and tools to enhance the performance and capabilities of AI HPC systems.
Documentation and Communication: Document system designs, configurations, and performance optimizations. Communicate complex technical concepts and recommendations effectively to both technical and non-technical stakeholders.
Job Qualifications:
Strong Background in AI and Machine Learning: A deep understanding of artificial intelligence and machine learning concepts, algorithms, and frameworks is essential. Experience in developing and deploying AI models is highly valuable.
Expertise in High-Performance Computing: In-depth knowledge of high-performance computing architectures, technologies, and best practices is crucial. Familiarity with HPC frameworks, parallel processing, distributed computing, and GPU acceleration is important for optimizing AI workloads.
System Architecture and Design: Proficiency in designing scalable and efficient system architectures is necessary. Experience in selecting and configuring hardware components, networking technologies, and storage systems for high-performance computing is desirable.
Programming and Scripting Skills: Proficiency in programming languages such as Python, C++, or Java is important for implementing and optimizing AI algorithms. Knowledge of scripting languages like Bash or PowerShell is beneficial for automation and system management tasks.
HPC Tools and Frameworks: Familiarity with HPC tools and frameworks, such as MPI (Message Passing Interface), OpenMP, CUDA, or OpenCL, is valuable. Understanding how to leverage these tools for parallel computing and GPU acceleration is advantageous.
Performance Optimization: Experience in performance tuning and optimization techniques for large-scale computing systems is essential. Knowledge of profiling tools, benchmarking, and workload management is valuable for identifying and resolving performance bottlenecks.
System Administration: Understanding system administration principles and practices is beneficial. Knowledge of Linux or UNIX-based operating systems, system monitoring tools, and cluster management frameworks is advantageous.
Communication and Collaboration: Strong communication skills are crucial for collaborating with cross-functional teams, including data scientists, researchers, software engineers, and IT professionals. The ability to convey complex technical concepts to both technical and non-technical stakeholders is important.
Continuous Learning: The field of AI and HPC is rapidly evolving, so a passion for continuous learning and staying updated with the latest advancements is important. Keeping up with research papers, attending conferences, and participating in relevant communities can contribute to professional growth.
Fostering a global inclusive workplace reflects TSMC's core values and business philosophy and is essential for our future success. Our commitment to global inclusive workplace allows us to create an environment where every employee, regardless of gender, age, disability, religion, race, ethnicity, nationality, political affiliation, or sexual orientation, can bring their unique perspective and experiences to work, enabling us to drive profitability, increase productivity, and unleash innovation. We strive to create a workplace that is equitable and accessible to all employees. We are committed to fostering an inclusive culture where every employee feels valued and empowered to contribute to our mission and provide excellent service to our global customers.
-
Intern- AI Software Platform Engineer
2 weeks ago
Hsinchu City,, Taiwan Qualcomm Full time NT$40,000 - NT$80,000 per yearCompanyQualcomm Semiconductor LimitedJob AreaInterns Group, Interns Group > Interim InternQualcomm OverviewQualcomm is a company of inventors that unlocked 5G ushering in an age of rapid acceleration in connectivity and new possibilities that will transform industries, create jobs, and enrich lives. But this is just the beginning. It takes inventive minds...
-
AI Technical Manager
1 week ago
Hsinchu City,, Taiwan TSMC Full time $120,000 - $240,000 per yearEstablished in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers' products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer...
-
AI Optimization Manager
1 week ago
Hsinchu City,, Taiwan TSMC Full time NT$120,000 - NT$180,000 per yearEstablished in 1987 and headquartered in Taiwan, TSMC pioneered the pure-play foundry business model with an exclusive focus on manufacturing its customers' products. As of 2024, TSMC serves more than 500 customers and manufactures over 11,000 products for high-performance computing, smartphones, the Internet of Things (IoT), automotive, and digital consumer...
-
AI Product Manager
3 days ago
Hsinchu City,, Taiwan Winbond Full time NT$900,000 - NT$1,200,000 per year作為華邦的(竹北) AI Product Manager ,你將協助內部需求部門建置完善的資訊系統平台,促進企業內部運作並提升生產力。工作內容包含:
-
Manager Sys/Test Validation Eng
1 week ago
Hsinchu City, Taiwan Advanced Micro Devices, Inc Full time $80,000 - $120,000 per yearWHAT YOU DO AT AMD CHANGES EVERYTHINGAt AMD, our mission is to build great products that accelerate next-generation computing experiences – from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create...
-
CAD AI Program Manager
2 weeks ago
Hsinchu City,, Taiwan Winbond Full time NT$900,000 - NT$1,200,000 per year作為華邦的(竹北) CAD AI Program Manager ,[職位簡介]主導 CAD 團隊內部的 AI 應用發展策略,負責分析工作流程痛點、規劃 AI 導入計畫、整合跨部門資源,並協助帶領 AI 工程團隊實作與部署。[工作內容]規劃 AI 在 CAD 領域的導入策略,建立技術...
-
AAID AI Product Manager
2 weeks ago
Hsinchu County,, Taiwan TSMC Full time $104,000 - $130,878 per yearWe are looking for an AI Product Manager to leverage their semiconductor equipment expertise and drive the development of innovative AI solutions, significantly enhancing our products and delivering value to equipment engineers.ResponsibilitiesLeverage strong knowledge of semiconductor equipment to identify AI opportunities and define a compelling product...
-
Site Reliability Engineer-AI Cloud
1 week ago
Bade District, Taoyuan City, Taiwan Supermicro Full time NT$280,000 - NT$1,440,000 per year*Job Req ID: 26896About Supermicro*Supermicro is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global...
-
Manufacturing Manager
1 day ago
Hsinchu City,, Taiwan Graphcore Full time NT$1,800,000 - NT$2,500,000 per yearAbout GraphcoreGraphcore is one of the world's leading innovators in Artificial Intelligence compute.It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry.As part of the SoftBank Group, Graphcore is a member of an elite...
-
Manufacturing Manager
1 day ago
Hsinchu City, Taiwan Graphcore Full time NT$1,200,000 - NT$2,400,000 per yearAbout GraphcoreGraphcore is one of the world's leading innovators in Artificial Intelligence compute. It is developing hardware, software and systems infrastructure that will unlock the next generation of AI breakthroughs and power the widespread adoption of AI solutions across every industry. As part of the SoftBank Group, Graphcore is a member of...