Job

LLM Software Engineer/Researcher (Applied Machine Learning) - 2024 Start (PhD)

ByteDance

•

Nov 17

💼 Graduate Job

Seattle

The ideal candidate must have a Ph.D./Master in Computer Science or related field, prior experience in training large language models, strong understanding of cutting-edge LLM research, proficiency in Python or C++, experience with deep learning frameworks, distributed computing, and have published research papers or contributions to the LLM community. Additionally, experience with inference tuning, GPU/AI accelerators, large-scale machine learning systems, deployment of AI models, and LLM application development is desirable.
The candidate will lead the development of advanced techniques for machine learning, work on creating high-capacity platforms, collaborate with cross-functional teams, and contribute to the success of large models at ByteDance.

Graduate Job

Software Engineering, Data•Seattle

The Applied Machine Learning Enterprise team combines system engineering and machine learning to develop and operate big model service platform that offers businesses Model-as-a-Service solutions (MaaS) to both the big model vendors and users. We are actively seeking talented Software Engineers/Researchers specializing in Large Language Models (LLM) to join our dynamic team.
We are looking for talented individuals to join our team in 2024. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance.

Ph.D./Master in Computer Science, Artificial Intelligence, or a related field.
Have prior experience working with training and inference of large language models.
Strong understanding of cutting-edge LLM research (e.g., long context, multi modality, alignment research etc.) and possess practical expertise in effectively implementing these advanced systems.
Proficiency in programming languages such as Python or C++ and a track record of working with deep learning frameworks (e.g., pytorch, deepspeed, etc.).
Strong understanding of distributed computing framework & performance tuning and verification for training/finetuning/inference. Being familiar with PEFT or MoE is a plus.
Preferred Qualifications:
Excellent problem-solving skills and a creative mindset to address complex AI challenges. Demonstrated ability to drive research projects from idea to implementation, producing tangible outcomes.
Published research papers or contributions to the LLM community would be a significant plus.
Experience with inference tuning and Inference acceleration. Have a deep understanding of GPU and/or other AI accelerators, experience with large scale AI networks, pytorch 2.0 and similar technologies.
Experience with large scale machine learning systems' scheduling and orchestration, familiar with Kubernetes and Cloud Native technologies.
Experience with deploying AI models into production environments, testing and evaluation of AI systems, LLM application & agent development is desirable.

Masters

PhD

Software Engineering

Data

In this role, you will be at the forefront of cutting-edge research and development of advanced techniques for MaaS solutions including model continuous pretraining, fine-tuning, evaluation, inference capabilities and also LLM application/agent development. Your primary responsibility will be to:
lead the creation of next-generation, high-capacity LLM platforms and innovative products.
work closely with cross-functional teams to plan and implement projects harnessing LLMs for diverse purposes and vertical domains.
Maintain a deep passion for contributing to the success of large models is essential in this innovative and fast-paced team environment.

Work type

Full time

Work mode

office

Location

Seattle