Data Scientist, Reinforcement Learning

2 weeks ago


Taipei, Taiwan Binance Full time $150,000 - $200,000 per year

Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

About the Role

You will develop and optimize RL models for enterprise-scale applications such as customer service, token reporting, compliance, and Web3 domain reasoning. You will explore and evaluate advanced algorithms including PPO, GRPO, DPO, RLHF, RLAIF, and Agentic RL to enhance the capabilities of LLMs, VLMs, and Agentic AI at Binance. The role requires a strong theoretical foundation in RL—covering policy optimization, reward modeling, and planning—paired with the engineering skills to build scalable production systems. You will take full ownership from research through deployment, driving experimentation with systematic evaluation and benchmarking. Collaboration across research, infrastructure, and application teams will be key to delivering impactful AI solutions.

Responsibilities:
  • Research and develop state-of-the-art RL algorithms, focusing on large model optimization and alignment techniques.
  • Design and implement RL training pipelines, including environment simulation, data generation, and reward function design.
  • Apply RL methods to enhance LLM/VLM/Agentic AI capabilities in reasoning, planning, and autonomous decision-making.
  • Collaborate with engineers and researchers to integrate RL solutions into enterprise AI platforms.
  • Monitor model performance in production and continuously improve through iterative training and fine-tuning.
Requirements:
  • Master's degree in Computer Science, Applied Mathematics, Machine Learning, or related fields.
  • 3+ years of hands-on experience in RL or LLM/VLM/Agentic AI optimization.
  • Strong coding skills in Python, with experience in ML frameworks and RL libraries.
  • Experience with large-scale distributed training and optimization.
  • Self-driven, ownership mindset, and strong problem-solving skills. Excellent communication skills for cross-functional collaboration.

Why Binance


• Shape the future with the world's leading blockchain ecosystem


• Collaborate with world-class talent in a user-centric global organization with a flat structure


• Tackle unique, fast-paced projects with autonomy in an innovative environment


• Thrive in a results-driven workplace with opportunities for career growth and continuous learning


• Competitive salary and company benefits


• Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.

By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.


  • Data Scientist

    1 week ago


    Taipei, Taipei City, Taiwan Binance Full time $70,000 - $120,000 per year

    Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance...


  • Taipei, Taipei City, Taiwan Appier Full time $80,000 - $120,000 per year

    About AppierAppier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...

  • Research Scientist

    2 weeks ago


    Taipei, Taiwan Binance Full time $150,000 - $200,000 per year

    Binance is a leading global blockchain ecosystem behind the world's largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 280 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance...


  • Taipei, Taipei City, Taiwan Appier Full time $104,000 - $130,878 per year

    About AppierAppier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...

  • Data Scientist

    3 days ago


    Taipei City, Taipei City, Taiwan Aimazing Full time $104,000 - $130,878 per year

    Aimazing is looking for a Data Scientist to join our team. You will partner with engineers, designers and product managers to identify growth and scaling opportunities, help with hypothesis design, pre and post analysis and communication of results and recommendations. You will also be responsible to develop potential new methodologies and build platforms...

  • Sr. Data Scientist

    2 weeks ago


    Taipei, Taipei City, Taiwan Trend Micro Full time $90,000 - $120,000 per year

    Join Trend ‧ Join New Generation趨勢科技 - 全球雲端資安領航者 / 全亞洲最大軟體公司 / 企業版圖橫跨五大洲 / 趨勢全球研發基地在台灣===============================================================AILAB is responsible to monitor the latest trending of AI development and application, we are looking for Data Scientist with...

  • Senior Data Scientist

    2 weeks ago


    Taipei, Taiwan KKCompany Technologies Full time $90,000 - $120,000 per year

    Team Segment : Cloud & Ai Solutions KKCompany Technologies, Asias leading AI multimedia technology group is dedicated to creating values for customers with core businesses of multimedia technologies, digital cloud, and AI applications. At KKCompany, we believe in Innovation Made Simple, and technology is the answer to the struggles faced by every...


  • Taipei, Taipei City, Taiwan Appier Full time $100,000 - $150,000 per year

    About AppierAppier is a software-as-a-service (SaaS) company that uses artificial intelligence (AI) to power business decision-making. Founded in 2012 with a vision of democratizing AI, Appier's mission is turning AI into ROI by making software intelligent. Appier now has 17 offices across APAC, Europe and U.S., and is listed on the Tokyo Stock Exchange...

  • Sr. Data Scientist

    2 weeks ago


    Taipei, Taipei City, Taiwan Trend Micro Full time $104,000 - $130,878 per year

    Join Trend ‧ Join New Generation趨勢科技 - 全球雲端資安領航者 / 全亞洲最大軟體公司 / 企業版圖橫跨五大洲 / 趨勢全球研發基地在台灣===============================================================OverviewAt Trend Micro, we are on a mission to make the world safer for exchanging digital information.Join us to tackle the...


  • Taipei, Taipei City, Taiwan Netskope Full time $120,000 - $200,000 per year

    About NetskopeToday, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud and follows and protects data wherever it goes, so we started Netskope to redefine Cloud, Network and Data Security.Since 2012, we have built the...