Connecting World's top Talents with Premier Jobs and Networking.
Register
Connecting World's top Talents with Premier Jobs and Networking.

大模型推理优化研发工程师-算子优化/编译(深圳/北京/上海/杭州)

Apply instagram Share link

Job Source

腾讯集团

Location

China, Shenzhen

Salary

Negotiable

Designation

Internet/AI

Job Type

Full Time

Language

Job Posted Date

19-09-2025

Job Description

1.针对大模型推理场景,负责GPU/AI芯片底层性能优化与调优;
2.优化和扩展vLLM、SGLang、PyTorch等框架的核心模块,提升计算效率与资源利用率;
3.深入分析GPU/AI芯片的硬件架构特性,设计并实现高性能算子、算法和特性使能组件;
4.探索前沿技术方向(如混合专家模型MOE、动态计算图编译优化等)。

Job Requirements

1.熟练掌握C/C++、Python编程语言,具备良好的coding和调试能力;
2.熟悉GPU/AI芯片编程,如CUDA,OpenCL,Ascend C等;
3.熟悉Cublas,Cutlass,CK等高性能算子开发工具;
4.熟悉Torch-Compile等AI编译模块者优先;
5.熟悉主流大模型推理框架,有实际性能调优经验(如KV Cache优化、动态批处理、Attention算子定制等);
6.扎实的高性能计算基础,熟悉并行计算、内存优化、通信优化等技术。。加分项:1.机器学习或者体系结构相关顶会论文,开源项目贡献者;
2.熟悉Attention结构MHA/MQA/GQA/MLA,以及MOE结构等。



腾讯集团




Just one more quick step more to complete your application!

 

Welcome to Linkedtour! Please complete your profile first and then enjoy your trip in Linkedtour!

 

Just one more quick step more to complete your application!

 

Please complete now your information at our partner site and click to apply. Good luck !