
AI Evaluation Specialist
2 weeks ago
Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100 countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.
We are seeking a dedicated AI Evaluation Specialist responsible for designing, implementing, and managing comprehensive evaluation frameworks that span the entire lifecycle of LLM agents—from pre-deployment testing to post-deployment monitoring and iterative refinement. Your work will directly influence Binance's AI adoption journey by ensuring the reliability, adaptability, and governance compliance of AI agents operating across various domains such as Customer Service, Growth, and Compliance.
Responsibilities:- Participate in the entire software development lifecycle, encompassing all stages from requirements analysis to test planning, execution, defect tracking, through to product release and maintenance.
- Go to person in relation to A.I Agents evaluation and continuously monitoring.
- Create comprehensive and effective test strategies and hands-on testing to ensure the accuracy, reliability, and performance of AI and data applications .
- Root cause analysis of test failures and product issues in an effective manner, and drive optimization for future enhancements.
- Design and develop internal tools leveraging AI technology to improve engineering and testing work efficiency.
- Bachelor's or Master's degree in Computer Science, Artificial Intelligence, Data Science, or a related field.
- Strong understanding of Large Language Models (LLMs), autonomous AI agents, and their system architectures.
- Experience with AI evaluation methodologies, including offline benchmarking, online monitoring, and hybrid human-AI evaluation approaches.
- Familiarity with software engineering best practices such as Test-Driven Development (TDD), Behavior-Driven Development (BDD), and their limitations in AI contexts.
- Proficiency in designing adaptive, lifecycle-spanning evaluation frameworks that incorporate both quantitative and qualitative metrics.
- Experience with evaluation tools and frameworks (e.g., Opik,LangSmith) is a plus.
- Ability to analyze complex system-level behaviors, including reasoning pipelines, tool integrations, and emergent agent actions.
- Strong analytical skills with experience in data-driven diagnostics and root cause analysis.
- Excellent communication skills to document evaluation plans, results, and recommendations clearly.
- Experience working in cross-functional teams and managing feedback loops between evaluation and development.
- Experience collaborating with infrastructure or platform teams to improve AI tooling and automation platforms.
Why Binance
• Shape the future with the world's leading blockchain ecosystem
• Collaborate with world-class talent in a user-centric global organization with a flat structure
• Tackle unique, fast-paced projects with autonomy in an innovative environment
• Thrive in a results-driven workplace with opportunities for career growth and continuous learning
• Competitive salary and company benefits
• Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)
Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.
-
Taiwan Crypto Full time $90,000 - $120,000 per yearis seeking a highly skilled Localization Technical Specialist to join our team. The ideal candidate will have a passion for languages and technology, along with a strong attention to detail. In this role, you will have the opportunity to elevate our automated workflow to new heights, refining AI prompts to achieve even more precise translation outcomes. You...
-
Workflow Automation Specialist
6 days ago
Taiwan PicCollage Full time $70,000 - $120,000 per yearAbout Us: We are a profitable and growing company, originating in Silicon Valley and now headquartered in Taiwan. We combine intuitive design with Creative AI tech to create inspiring products for millions of people worldwide We offer a fun, creative, and international workplace with competitive compensation, stock options, flexible hybrid work, free...
-
AI Investment Partner
2 weeks ago
Taiwan Appier Full time $150,000 - $200,000 per yearAppier is a technology company which aims to provide artificial intelligence platforms to help enterprises solve their most challenging business problems. Appier was established in 2012 by a passionate team of computer scientists and engineers with expertise in AI, data analysis, distributed systems, and marketing. About the Role: We are looking for a...
-
AI Trainer for Mandarin
2 weeks ago
Taiwan Alignerr Full time $15 - $150 per yearis a community of subject matter experts from several disciplines who align AI models by creating high-quality data in their field of expertise to build the future of Generative AI. Alignerr is operated by Labelbox. Labelbox is the leading data-centric AI platform for building intelligent applications. Teams looking to capitalize on the latest advances in...
-
Shape the Future of AI — On Your Terms: Chinese
2 weeks ago
Taiwan Welocalize Full time $40,000 - $60,000 per yearWelo Data works with technology companies to provide datasets that are high-quality, ethically sourced, relevant, diverse, and scalable to supercharge their AI models. As a Welocalize brand, WeloData leverages over 25 years of experience in partnering with the world's most innovative companies and brings together a curated global community of over 500,000 AI...
-
Supply Chain Specialist
2 weeks ago
Banqiao, Taiwan (TW) NZXT, Inc. Full time $90,000 - $120,000 per yearJob Title: Supply Chain SpecialistLocation: Taiwan, TaipeiWorkplace Type: Onsite JOB SUMMARY We are seeking an experienced Buyer to join NZXT, a multinational leader in gaming computer technology. This role requires expertise in managing contract manufacturers (CMs) and driving cross-functional alignment with internal stakeholders. The ideal candidate will...
-
Senior RTB Product Manager
2 weeks ago
Taiwan Appier Full time $90,000 - $120,000 per yearAbout Appier Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...
-
Senior Machine Learning Scientist, LLM
2 weeks ago
Taiwan Appier Full time $120,000 - $150,000 per yearAbout Appier Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...
-
Research Scientist
2 weeks ago
Taiwan Appier Full time $90,000 - $120,000 per yearAbout Appier Appier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...
-
Security IT Operations
2 weeks ago
Taiwan Crypto Full time $90,000 - $120,000 per yearWe are seeking a highly motivated and skilled Security IT Operations - AI Champion to drive the integration of AI solutions within our IT operations, with a strong focus on enhancing security. This role will be instrumental in leveraging AI tools and technologies to improve efficiency, automate processes, and bolster our security posture. The ideal candidate...