Logo of Huzzle

Job

LLM Software Engineer/Researcher (Applied Machine Learning) - 2024 Start (PhD)

Logo of ByteDance

ByteDance

Nov 17

💼 Graduate Job

Seattle

AI generated summary

  • The ideal candidate must have a Ph.D./Master in Computer Science or related field, prior experience in training large language models, strong understanding of cutting-edge LLM research, proficiency in Python or C++, experience with deep learning frameworks, distributed computing, and have published research papers or contributions to the LLM community. Additionally, experience with inference tuning, GPU/AI accelerators, large-scale machine learning systems, deployment of AI models, and LLM application development is desirable.
  • The candidate will lead the development of advanced techniques for machine learning, work on creating high-capacity platforms, collaborate with cross-functional teams, and contribute to the success of large models at ByteDance.

Graduate Job

Software Engineering, DataSeattle

Description

  • The Applied Machine Learning Enterprise team combines system engineering and machine learning to develop and operate big model service platform that offers businesses Model-as-a-Service solutions (MaaS) to both the big model vendors and users. We are actively seeking talented Software Engineers/Researchers specializing in Large Language Models (LLM) to join our dynamic team.
  • We are looking for talented individuals to join our team in 2024. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance.

Requirements

  • Ph.D./Master in Computer Science, Artificial Intelligence, or a related field.
  • Have prior experience working with training and inference of large language models.
  • Strong understanding of cutting-edge LLM research (e.g., long context, multi modality, alignment research etc.) and possess practical expertise in effectively implementing these advanced systems.
  • Proficiency in programming languages such as Python or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, etc.).
  • Strong understanding of distributed computing framework & performance tuning and verification for training/finetuning/inference. Being familiar with PEFT or MoE is a plus.
  • Preferred Qualifications:
  • Excellent problem-solving skills and a creative mindset to address complex AI challenges. Demonstrated ability to drive research projects from idea to implementation, producing tangible outcomes.
  • Published research papers or contributions to the LLM community would be a significant plus.
  • Experience with inference tuning and Inference acceleration. Have a deep understanding of GPU and/or other AI accelerators, experience with large scale AI networks, pytorch 2.0 and similar technologies.
  • Experience with large scale machine learning systems' scheduling and orchestration, familiar with Kubernetes and Cloud Native technologies.
  • Experience with deploying AI models into production environments, testing and evaluation of AI systems, LLM application & agent development is desirable.

Education requirements

Masters
PhD

Area of Responsibilities

Software Engineering
Data

Responsibilities

  • In this role, you will be at the forefront of cutting-edge research and development of advanced techniques for MaaS solutions including model continuous pretraining, fine-tuning, evaluation, inference capabilities and also LLM application/agent development. Your primary responsibility will be to:
  • lead the creation of next-generation, high-capacity LLM platforms and innovative products.
  • work closely with cross-functional teams to plan and implement projects harnessing LLMs for diverse purposes and vertical domains.
  • Maintain a deep passion for contributing to the success of large models is essential in this innovative and fast-paced team environment.

Details

Work type

Full time

Work mode

office

Location

Seattle