Connecting World's top Talents with Premier Jobs and Networking.
Register
Connecting World's top Talents with Premier Jobs and Networking.

混元大模型训练框架研发工程师-(北京/深圳)

Apply instagram Share link

Job Source

腾讯集团

Location

China, Beijing

Salary

Negotiable

Job Type

Full Time

Language

Job Posted Date

20-06-2025

Job Description

1.参与开发优化大模型训练框架,支持单任务万卡以上规模高效稳定训练;
2.参与NLP、多模态大模型结构设计,并联合业务进行模型训练效率和效果验证;
3.参与文生图、文生视频、文生3D等业务的训练性能加速;
4.参与低精度训练性能优化和业务推广、参与大窗口训练性能优化。

Job Requirements

1.熟练使用PyTorch框架,可对DDP训练的代码进行性能分析和优化;
2.熟练使用主流大模型训练框架DeepSpeed、Megatron,掌握3D并行、ZeRO机制、Flash-Attn等的原理、使用场景、优劣势以及可优化方向;
3.有ViT、SD、DiT模型训练性能优化经验者优先;
4.熟练掌握CUDA性能优化手段,有算子编写优化项目经验者优先;
5.对大模型前沿技术比较敏锐者优先;
6.有实际大模型的训练调参和效果评测项目经验的优先;
7.良好的沟通能力、解决问题能力。。加分项:



腾讯集团




Just one more quick step more to complete your application!

 

Welcome to Linkedtour! Please complete your profile first and then enjoy your trip in Linkedtour!

 

Just one more quick step more to complete your application!

 

Please complete now your information at our partner site and click to apply. Good luck !